DUKE V 
MATHEMATICAL 
JOURNAL 


EDITED BY 


ARTHUR BYRON COBLE DAVID VERNON WIDDER 
JOSEPH MILLER THOMAS 
Managing Editor 


WITH THE COOPERATION OF 


H, E, BRAY L. R. FORD E. J. McSHANE J. A. SHOHAT 
L. W. COHEN E. P. LANE C. C. MacDUFFEE G. T. WHYBURN 
R. E. LANGER OYSTEIN ORE 


AND THE MATHEMATICS DEPARTMENT OF DUKE UNIVERSITY 


VOLUME 3 


1937 


DUKE UNIVERSITY PRESS 
DURHAM, N. C. 


A 











Mathematics Library 


y 
K 


\ \¥ * — f ’ r¢ | INTEN”® ‘S 
1s VoLuME 3, 1937 
ALLENDOERFER, Cart B. The imbedding of Riemann spaces in the 


I re hincitcrs nae Cleat achw esa da ieieie Sekine aaileede dates ena 
Barr, Retnnoip. Abelian groups without elements of finite order... .. . 
BeckENBACH, E.. F. Remarks on the problem of Plateau............... 
Bett, E. T. Certain ternary cubic arithmetical forms. ................ 
BrrxuHorr, GARRETT. An extended arithmetic........................ 
I, COE. TS OE IB oii ood wo ded mec nw edceaesiones xs 
Boas, R. P., Jn. Asymptotic relations for derivatives. 

Bocuner, 8. Analytic mapping of compact Riemann spaces into E uc selidean 


ss nab nose dA SES RESHR eee DEA eae 
BocuNner, 8. Completely monotone functions of the Laplace operator for 
ee ee ee Ee re ee 


Bocuner, 8. Remark on the theorem of Green....................... 
Bocuner, 8S. Stable laws of probability and completely monotone 
PE OEE Pe Te Ee ee ee Re ee Pen ee 
Brown, ArTHuR B. Critical curvatures in Riemannian spaces.......... 
Cameron, R. H. Analytic functions of absolutely convergent generalized 
ee Een re Te 
Caruitz, Leonarp. An analogue of the von Staudt-¢ Youse n ‘theorem 
Caruitz, LEONARD. Sums of squares of polynomials........... oe 
Cosir, ArtHur B. A class of linear groups with integral coefficients. . 
Conen, L. W. Uniformity properties in topological space satisfying the 
SD Ne iii 6-6-5 Ws A re we WRK eee Eee KEVERS 
Conen, L. W., and Dunrorp, Netson. Transformations on sequence 


IN i wih ase ch wd aca nach alee aa eS use bina aaated 
CoraL, Max. On the necessary conditions for the minimum of a double 
Eee eae ames a Sm Ky ee ere ee 
Dunrorp, Netson, and Couen, L. W. Transformations on sequence 
I i 6. cap d dnt ceked ease vanenaneuadleesantee bake. ea uaet 
FraME, J. SUTHERLAND. The degrees of the irreducible components of 
simply transitive permutation groups........................000.. 
GERGEN, J. J. Summability of double Fourier series................... 
GHENT, KenNetH 8S. Sums of values of a polynomial multiplied by 
6.5 nos hdd rihn Soe Sk Pe kd ee ed ee ee le eee 
YOLDSTINE, HerMAN H. The minima of functionals with associated side 
COMGMIOMS.... oceans ee ey ee ee ee ouée ke anes 
Hatt, D. W., and Scuweicert, G. EF. Non-n-alternating transfor- 
sd ikke hws 0 wks05 15s 5 CONN ih eile sabe eee 


610 


689 


585 


689 


418 


623 








iv CONTENTS 


Hiapon, R. Arcuie, and Hott, D. L. Stresses in moderately thick 
| Ee hee rer ee rere ree reer ee 

Hitt, J. D. On perfect methods of summability. Lata daianc alana moans 

Hite, Ervar. The inversion problem of Mobius. A Ere ee Se A Pe aa 

Hitie, Einar, Szecé, G., and Tamarkin, J. D. On some generalizations 
rN Oe I, Is 6-05.46 ec washes ccc. Crew shuneeees 

aa D. ms ana en R. Arcute. Stresses in moderately thick 


JACKSON, : Hen NHAM. Orthogonal polynomials on a plane curve. 
Jacospson, N. A note on non-associative algebras. .................... 
Joun, Frrrz. Polar correspondence with respect to a convex region... . . 
KALES, Morris L. Tauberian theorems related to Borel and Abel sum- 
PE ee ee eer ek ey ee ae e ers eee ee eee 
LANGER, Rupo_ew BE. The expansion theory of ordinary differential sys- 
I RY ID I ooo sc te od vtec arte RE ec le nsw a ale ee 
LATIMER, CLAIBORNE G. The classes of integral sets in a quaternion 
BIN 33.6.5 504-5 usb d eho od edie CRE Tne eee eee ee 
Lupin, C. I. Transformation of differential equations in the neighborhood 
SN IIRL >. oC sn 4d nirs Oo eh ce Cee Ebene Ss oeee aaa eee 
McCoy, N. H., and Montgomery, Deane. A representation of general- 
ee er ee ee eC oie es ed ee ey 
Mac Lane, Saunpers. A structural characterization of planar com- 
I ing vo nahn ks ash ROMA BRA wae b Owe wee 
Martin, Monroe H. The ergodic function of Birkhoff................ 
Martin, W. T., and Wrener, N. Taylor’s series of entire functions of 
II 5 fore one ticdddteeak Aiea A eee heee Rae ean 
Merriman, G. M., and Watsu, J. L. Note on the simultaneous ortho- 
gonality of harmonic polynomials on several curves. 
Montaomery, Deane, and McCoy, N. H. A representation ‘of goner- 
I NR osc pdncn eb aeie WCC REA UW aw WN eee malenan ee 
Ore, Oystern. Structures and group theory. I...................... 
Parker, W. V. The characteristic roots of a matrix................... 
Pierce, Jesse. Solutions of systems of differential equations in terms of 
ieofimite sevien of GeGimibe MUGGGUED ... 2. oon ccc cece ccc ecescccceecs 
Prrt, H. R. Theorems on Fourier series and power series............... 
Price, G. Batey. On the extreme points of convex sets................ 
Quape, E. 8. Trigonometrie approximation in the mean............... 
Ravd, Trnor. Solution of a problem of F. Riesz on the harmonic majorants 
io vains be bA eva e ebb Obssndawvanssdweawans 
Ranpvets, W. C. Summability of conjugate derived series.............. 
Ricuarpson, M. Betti numbers of 3-fold symmetric products; a cor- 
NIN sie decTh elepcg wn hick od escalate em 
Scuiuuina, Orro F. G. The structure of certain rational infinite 
RE cei. oe cana e heen beomats Cec TTs co erees 














CONTENTS v 


Scuweicert, G. E., and Haut, D. W. Non-n-alternating transfor- 


IY nk ne 09's OO RE AA wR One ee kbs ene Saas eins 623 
Suerrer, I. M. Concerning Appell sets and associated linear functional 
ee ree me rrr ree ee a ern en 593 
SpeNcER, VIVIAN EBERLE. Asymptotic expressions for the zeros of gener- 
alized Laguerre polynomials and Weber functions.................. 667 
Stone, M. H., and Tamarkin, J. D. Elementary proofs of some known 
theorems of the theory of complex Euclidean spaces................ 294 
SzeGcé, G., Hitte, Ervar, and TaMarkKIN, J. D. On some generalizations 
I AE I. Ss ins Reh he eine 610k Oboe G 94550 nee eee 729 
TAMARKIN, J. D., and Stone, M. H. Elementary proofs of some known 
theorems of the theory of complex Euclidean spaces................ 294 
TAMARKIN, J. D., Hitie, Ernar, and Szecé, G. On some generalizations 
ST FS Rr errr 729 
Titrt, E. W. (n—1)-dimensional characteristic strips of a first order 
I Se Ge avo nok viv Gd Sains eet cadansaseacaass 740 
Vanpiver, H. 8. On Bernoulli’s numbers and Fermat’s last theorem. ... 569 
Watsu, J. L., and Merriman, G. M. Note on the simultaneous ortho- 
gonality of harmonic polynomials on several curves................. 279 
Warp, Morean. Residuation in structures over which a multiplication 
| Ee eres aime Ee ey OO ee ee 627 


Wesster, M.S. On the zeros of Jacobi polynomials, with applications. . 426 
Weyt, HerMaNN. Commutator algebra of a finite group of collineations. 200 
Wuitney, Hasster. The maps of an n-complex into an n-sphere....... 51 
Wuitney, Hasster. On the maps of an n-sphere into another n-sphere.. 46 
Wuirtney, Hasster. On matrices of integers and combinatorial topology... 35 


Wuysurn, G. T. Interior transformations on compact sets............. 370 

Wiener, N., and Martin, W. T. Taylor’s series of entire functions of 
ini nnckdewenesckaadieseen Js dam ner eek ee ewn eee 213 

WILLIAMSON, JOHN. Quasi-unitary matrices. ............0...0000000005. 715 








SUMS OF SQUARES OF POLYNOMIALS 
By LEeonarD CARLITz 


1. Introduction. In this note we determine the number of representations 
of 0 as the sum of an arbitrary number of squares of polynomials in a single 
indeterminate with coefficients in a fixed Galois field GF(p"), p > 2. More 


accurately, if a; , --- , a, are t non-zero elements of GF(p"), a, + --- + a: = 0, 
we determine the number of solutions of 
(1.1) 0 = mYi+--- + a¥} 


in primary’ polynomials Y; each of degree k, an assigned positive integer. We 
denote the number of solutions of (1.1) by 


N,(0) = Ni(0). 
The more general equation 
(1.2) aG = mY? +... + «Fi, 


where aG ~ 0, G of degree < 2k, has been treated in two papers, one on the 
case t even, the other on the case t odd.” In the latter paper a formula for N2,(0) 
appeared incidentally. We shall derive this formula anew by the simpler and 
direct method used in the paper on ¢t even. 

To evaluate No,,;(0), we make use of a known formula for N2,(G), the number 
of solutions of (1.2) fort = 2s. Applying this formula, we first evaluate the sums 


N2,(G) Ne,(G’) 
a il 





extended over all primary G; the latter sum leads at once to the determination 
of Noes1(0). 


2. Determination of N2,(0). In equation (1.2), let tf = 28s,a = a+--- 
+ a, ~ 0, so that G is of degree 2k. Assume further 


(2.1) Yi = Qi + a2; ~ O (i = 1,---,8). 


Received October 27, 1936. 

1A polynomial is primary if the coefficient of the highest power of the indeterminate 
is the unit element of the Galois field. The capitals A, B, E, G, M, U, V, Y will denote 
primary polynomials. 

2 The even case in Transactions of the American Mathematical Society, vol. 35 (1933), 
pp. 397-410; the odd case in this Journal, vol. 1 (1935), pp. 298-315. These papers will be 
cited as I and II, respectively. 

1 








2 LEONARD CARLITZ 


Then by Theorem 4 of paper I, the number of solutions of (1.2) is determined by 


(2.2) N2,(G) = (1 ‘oa ot) > x" Ml + > x” | M 

Here x = +1 or —1 according as 

(2.3) (— 1)’ Gig +++ Ags 

is a square or a non-square in GF(p"); m is the degree of M, | M| = p””"; the 


summations are extended over all M dividing G of degree m > k and m = k, 
respectively. If now we denote the right member of (2.2) by A,i(G, x), we 
shall prove the following formula: 


AAG, xDAAG, x2) 


deg G=2k 
x k-1 
-_ > wu n(2k—u) (st+t+1 nu k  nk(s+t+2) 
a (1 wi scm) xX P P + x P ’ 


u=0 


where x = x:x2, and the summation in the left member is over all primary G 
of degree 2k. 

The proof of this formula is similar to the proof of Theorem 2 of paper I, 
and therefore only a sketch of it will be given. By means of (2.2), the left 


member of (2.4) may be put in the form 


X1 x2 X1 

=~ det) =~ _ on Sas ¥ 

( x)( x) “Sk 7 ( “s) u>k 
v> t= 


\ 


where each summation is over all A, B, U, V of degree a, b, u, v, respectively, 
such that 

AU = BV, atu=b+0 = 2k, 
and in addition satisfies the conditions under each Y. Call the sums 2, , 22, 


S;, X,, respectively. Then 


_—3 


u vi yiel rit 
>. = xi x2|U/"| V| 
1 AU=BV 
u,v>k 
2nk(s+t) a_ by —8 —t 
=p xixe|4| "| B| 
4U=BV 
ab<k 
2nk(s+t) m —s—t 
=p” : x’ |M| Su, 
deg M<k 


where 


ar ! 
(2.5) (A,B)=1 AU=BV 
a,b<k—m at+u=b+v=2k—m 


Su= >) xix2/A|*| BI" bm 1. 











SUMS OF SQUARES OF POLYNOMIALS 3 


Here the outer sum is over all A, B of degree a,b < k — msuch that (A, B) = 1, 
while the inner sum is over all U, V of degree 2k — m — a, 2k — m — b, respec- 
tively, for which AU = BV. Since (A, B) = 1, it follows that A | V and B| U. 
For the moment put V = AE, U = BE, so that deg E = 2k — m— a — b. 
Thus for fixed A, B there are | E | choices of U, V satisfying the conditions men- 
tioned. In other words, the inner sum in (2.5) = p"*""*~”, and therefore 
(2.6) Bes ins aie , ee oe. b), 

ab<k—m 
where’ ¥(a, b) denotes the number of pairs A, B of degree a, b, respectively, 
such that (A, B) = 1. The sum in (2.6) is easily evaluated but this need not 
be repeated here. It is sufficient to notice that comparing this point in the 
present proof with the corresponding point* in the proof of Theorem 2 of paper I, 
we are at once able to conclude that the left member of (2.4) 


2nk(s+t+1) m | —s—t-1 
=p x” | M| 


deg M=k 





deg M<k ‘ 


k-1 
k nk(s+t+2) x m __n(2k—m)(s+t+1) nm 
=x?Pp + (1 = —— >> x Pp Pp: 


We have therefore proved (2.4). 

It is now easy to evaluate No,(0). Since the conditions (2.1) are assumed to 
hold, we have in particular a = a + --- + a2 # 0. Then clearly all solu- 
tions of 


} (1 a oan) ” pees > x” | M ‘ees 


O = mYit--- + axV2 (deg Y; = k) 
may be obtained by pairing the solutions of 
aG = aY* +--+ + an V3.2 (deg Y; = k) 
with those of 
—aG = a%1Yo1 + a2¥ 9s (deg Y; = k) 


in all possible ways, and allowing G to range over all polynomials of degree 2k. 
We define x; as +1 or —1 according as (—1)* ‘aa - - + a.—2 is or is not a square 
in GF(p"); x2 is defined in the same way with respect to —a,102,. Then, 
by the remark just made, it follows that 


N2(0) = Yo Nus-2(G)N2(@), 


deg G=2k 
and by (2.2), the right member 
= bo As_2(G, x1) Ao(G, x2). 


deg G=2k 


3 See I, p. 399. 
‘I, p. 405 bottom; ef. also proof of I. Theorem 1. 








4 LEONARD CARLITZ 


We may now apply (2.4), and we have at once 


a4 
(2.7) N2,(0) _ p ” (1 ie _ xXx >) > ee ae 


prev b— 4 


Thus we have the following 
TuHeorEeM A. Jf a, --- , a, are 2s non-zero elements of GF(p"), such that 


a + --- + a, = 0, while 
G2;1 + a2; ~ 0 (i = 1, ---, 8), 


then the number of solutions of a,¥{ + --- + a2,¥3, = 0 in primary Y; of degree k 
is furnished by (2.7) together with (2.3). 


3. The sum =N2,(G)|G |“. In (2.2) G@ is assumed to be of even degree. 
However, if we rewrite the equation in the following form 


m>u m=u 
(3.1)  Na(G) = (1 ts) DD xi M+ DS x MI, 
G=MU G=MU 
then N2,(G) is defined for all G (for G of odd degree the second sum on the right 
is vacuous). We may now consider the sum =N2,(G)|G\~" taken over all 
primary G. Thus substituting from (3.1), we have 
N2,(@) x"| M | nao) 5 x" | MT 
3.2 eee oD Set (1 — xp**> Skee 
G2) 2 ier = [Mu * - p> (MU ’ 


where the sums on the right are over all M, U of degree m, u, respectively, satis- 
fying the conditions indicated. From this it is clear that the right member of 
(3.2) may be reduced to 


es) 2 2 
; *  % jaiahtiiee ) he (1 ae xp" ) : elie 7. gf pers 
m=0 u=0 a=e+1 
1 bo 4 xp’ 
-— ae ~ ae - ( l > prtl—a) u nu(s+1—2w) = 
i— at lia ba x ) p> xP | — xpre-~ 
l 1 ~ xp" xp" 





~ Type T 


_ xpreti-2w) Sm xpr~)’ 
from which it follows that 


N2,(G) = 1 a —— 7 
G |G |" (1 a xpreti-2w)) (1 aid xp") * 





(3.3) 
If we split the right member into partial fractions, (3.3) becomes 


(3.4) > N2,(@) nee 1 pr) 
oO. sl Ee xpre—~) — 1 — xprorite) 


=~ 14" 














SUMS OF SQUARES OF POLYNOMIALS 5 


From this it is easy to evaluate > N:.(G), summed over all G of fixed degree. 
For even degree we have the following simple formula: 
(3.5) D> N:2(G) =p", 


deg G=2k 
while for odd degree, 


(3.6) > N2,(G) = rr a xpt p*. 
deg G=2k+1 
The sum formula for even degree (3.5) may be derived in a slightly more direct 
manner. We consider the sum )> Ne,(G) | G|~" taken over G of even degree 
only, so that N2,(G@) has its original meaning. Then as above, 


N2,(G@) x” | M ‘Si n(1—s) x” | M ‘Sie 
ae ih me Ss 
deg G even | G |" z i MU \~ + ( xP ) >» | MU |v 


m—u even 


co} 


2 

m _nm(s+1—2w) f n(1l—s u(1—w) nm(s—w) 

> x"p + (= ss" ) Dap" w a: ie uaa 
“= 


m=0 m—u=2,4,°°* 
1 x ¢ ” nei 
nai ptii—s) u _ nu(s+1—2w) 
= To xprette T (1 — xp") »> xP 1 — paw) 
1 i xp" 2n(s—w) 
= —_ a qumanmniane 





© an xprsti-aw) Siow xprsti-2w) San pente—e)? 


from which we have at once 


N2,(G) es 1 


3.7 ca a i a eee 
( ) deg G even \G P ] —- gpae-w) . 


this agrees with (3.5). 
Comparing (3.7) with (3.4), we have 


n(1—w) 


N2(G) _ xp" _ 
deg G odd |G a 1 — gae-= = xpreti-2w) ’ 








which is equivalent to (3.6). 


4. The sum >> N2,(G’)|G|~". The evaluation of this sum is somewhat 
more elaborate than that of the sum (3.2). If we put G = Molo, and write 
(M,, Uo) = D, then clearly M,/D and U»/D are squares; we may therefore 
suppose that 


G’? = DM’.DU’, G = DMU, (M, U) = 1. 
Then substituting from (2.2), we now have 

N2,(G’) n(1—s) ? | DM’ a  @ | DM? “a 
4.1 ees ae (1 — 7 i. tlt 
a0 2 lap ~ ‘1 — xP ) 2e® (DMUP* Ate ® [DMUP’ 


m>u m=u 








6 LEONARD CARLITZ 


where the sum on the left is over all G, while the sums on the right are over all 
D, M, U of respective degree d, m, u satisfying the condition under each >>’, 
and in addition (M, U) = 1. We may evidently put the right member of (4.1) 
in the form 





xz i T 
(4.2) ® cio 2 . ; 
where 
(4.3) T = p M —— lU;*+a-— xp" ~- | las 18 ot 
M.U)=1 (M.U)=1 
m=u m>u 


As for the first sum in (4.3), for ¥(a, b) as defined in (2.6), it is clear that 


oO 
; M _— u | be io in = ” tail 2u ‘yW(m, m), 
M,U)=1 m=0 


m= 


1+ (1 —_ p ") 7 * ow) 
1 
1 — phe 


4. = EE. 
( 4) 1 ae prs 2w) 


The calculation of the second sum in (4.3) is somewhat longer. In the first 
place 
M|??*|U|" => pr" p “" o(m, u). 


(M,U)=1 m>u 
m>u 


The terms in which u = 0 contribute 


2 n(2s—1—w) 
nm(2s—2—w) nm Pp 


(4.5) Ee = Fs. 
For the remaining terms, we have 
nm(2s—l—w) _ nu(1l—w) —n 
p p™ (l—p") 
m>u>0 


él _ p ") yp ” aie > —— 1—w) 


u=i m=u+l 


”" higetbelans 
(4.6) Q1 = p") ” aagnl@eunOen)\ 1%  enelSe—t—ew) ? 
E (1 Fas geet ya ae pra 1 w)) 


by an easy calculation. Combining (4.4) and (4.5), we see that the second sum 
in (4.3) 


n(2s—1—w) n(4s—2—3u 


p 
— (1 ain pres) (1 ate pr(te—i-w)) ° 
Substituting from (4.4) and (4.5) in (4.3), we may verify that 


‘al ae xp") (1 — ” keaiateced 


et EE Rant SORE 
(1 aes pree-te)) (1 — grants) 


























SUMS OF SQUARES OF. POLYNOMIALS 7 
Therefore, returning to (4.2), we see finally that 


N:2,(G") 1 = ”galieiiaat 
(4.7) > |G \ ini al ce pres2u)) (] a pre-i—w)) * 











5. Determination of N2,.,(0). Making use of the formula just proved, we 


shall now evaluate Ne,,:(0). Let a,, --+ , ass; be 2s + 1 non-zero elements of 
GF(p") such that a; + --- + a4; = 0. We may assume them so numbered 


that the conditions (2.1) are satisfied,’ and therefore N2,(G) applies and is deter- 
mined by (2.2). Now the number of solutions of 


O = mYit ++: + aes Vbeu1 (deg Y; = k) 
is clearly the sum of the number of solutions of 
— a2. = mY" +--.+ a2, ¥35 (deg Y; = k), 
extended over all G of degree k. In other words, 
(5.1) Nov) = De NalG). 
But the sum in the right member is precisely the coefficient of p-“ in the 


expansion of 


(5.2) z i ok 2 bx ; N2,(G’). 


77 vy a. ee 
C | G | i= a axes 





Comparing (5.2) with (4.7), it is clear that the quantity in question is the co- 
—nkw 


efficient of p in 


oo 
(1 — ” lies > an 2iw ” ee 1) j—njw 


i,j7=0 


= (1 = ry p> aa + ” eiataliies 


2ns 


+ ” aati tetas p™ re naenae ar: 
Therefore, finally 


(5.3) Nays(0) = ph + (l= P) p 


2tsk 


n(2s—1)(k—2i) | 2nis 


For k < 2, this reduces to the first term: No4:(0) = p"“”*. According to 
(5.3), Nesia(0) depends only on s and k; in particular it is independent of the 
numbering of the a; required at the beginning of this section. We may now 
state 

TuHeoreM B. If a, --+ , a4; are 2s + 1 non-zero elements of GF(p"), such 
that a, + +--+ + a@e.41 = 0, then the number of solutions of oaYs + --- + Pe) 4 fr 
= Oin primary Y ; of degree k is furnished by (5.3). 


Duke UNIVERSITY. 


5 See IT, §3. 








THE DEGREES OF THE IRREDUCIBLE COMPONENTS OF SIMPLY 
TRANSITIVE PERMUTATION GROUPS 


By J. SuTHERLAND FRAME 


1. In a study of certain hyperorthogonal groups,’ there arose the problem of 
splitting into its r irreducible components a simply transitive permutation 
group G* of degree n and order g, which, when written in matrix form, gave an 
isomorphic representation of the given abstract group as a group G of linear 
transformations. In any simply transitive permutation group, the subgroup 
leaving one symbol invariant will permute the remaining symbols in \ = r — 1 
sets of transitivity of ki, ke, ---, ky symbols respectively. Let the distinct 
irreducible components of the group G have the degrees mp = 1, m, ---, mM’, 
and note that r = A + 1 is the sum of the squares of the multiplicities with 
which these occur in the reduction of G2 When the components are all distinct, 
and \’ = X, there appears to be a simple relation between the product K = 
1-k, ke --- ky and the product N = 1-m ne --- m. 

ConsecturED THeoreM I. n'K/N is an integer when the components of G 
are distinct, and this is a perfect square R’ when the nuinbers k; are distinct. 

We shall prove the theorem for all groups for which \ < 3, (here \’ must 
equal \), and for an infinite family of groups including all values of \. When 
\ = 1, the group G* is doubly transitive and n, = ki = N = K = n — 1,s0 
the result is trivial. When \ = 2, our theorem gives us a diophantine equation, 
nkike/nyne = R’, which, with mn; + ne = n — 1, enables us to solve for the un- 
knowns m and ne. 

To illustrate the application of the theorem, before passing to the details 
of the proof, we take as an example the case of the hyperorthogonal groups, 
where the problem of this paper was suggested.’ We have here a permutation 
group of degree QnQm—i/Qe (where Q,, = q”" — (—1)",q = p’, p prime) which 
is known to have 3 irreducible components. We know also that k, = q¢’"™*, ke = 
G' Qm2Qm—z/Qe. Hence, nine = G” 'QmQm+Qm—2Qm—2/Q2R*, where R is an in- 
teger, and nm + m2 = nm — 1 = (QmQmn1 — Q2)/Q2. The degree of n; or nz as 
a polynomial in g is 2m — 3, that of the other being less; so the degree of R in q 
is at least m — 2. Since nm is divisible by an odd power of q, and n + ne is 
divisible by q’, it follows that m, , say, is divisible by q° but not q’, and nz by q° 
or some higher odd power. R, not being divisible by q”*, must contain a factor 


Received September 17, 1936. 
1J.S. Frame, Unitdére Matrizen in Galoisfeldern, Commentarii Mathematici Helvetici, 
vol. 7 (1935), pp. 97, 98. 
J.S. Frame, The simple group of order 25920, this Journal, vol. 2 (1936), p. 477. 
2 W. Burnside, On the complete reduction of ony transitive permutation group, Proc. Lond. 
Math. Soc., ser. 2, vol. 3 (1905), p. 239. 
8 











COMPONENTS OF SIMPLY TRANSITIVE PERMUTATION GROUPS 9 
whose square divides QnQm—1Qm—2Qm—3/Q. The only such factor is Q,, or pos- 
sibly Q; for certain values of m. The latter possibility does not work, and we 
find R = gq” Qr, m1 = FQmQm—s/Q2Q: , M2 = FQm—r1Qm-2/Q2Q:. The degrees 
of the irreducible components are hereby uniquely determined from our formula. 

In attacking the proof of the theorem, we shall first study in §2 the ring of 
matrices permutable with the group G, and then the secular equation, in terms 
of whose roots p; the constants n; may be expressed. In §3 we shall give the 
proof of the theorem for the cases \ = 2, \ = 3. In §4 we shall prove it for a 
certain infinite family of groups including all values of \ by solving explicitly 
for the constants n;. A lemma on binomial coefficients will be proved in §5. 


a) . x Y . . 
2. To each permutation oa of the group G*, there corresponds in G a matrix 


of degree n, (643), Where a, 8 = 1, 2, ---,, which may be thought of as trans- 
forming the space of the variables x; --- z,. Let the subgroup G; of G leave x; 
invariant and permute the remaining variables in \ = r — 1 sets of transitivity 
T., of K, variables respectively, = 1,---,A. Then, setting ko = 1, we have 


(2.1) | > ke =n. 


Now if x, be any variable from the set 7; , and we transform the subscripts 
of the product Z:2,; by each permutation S of G*, so that S transforms z, into 
Sz,, and Z, into S%, = Sz, then the sum (nk;/g) > (S#,)(Sz,) is an invariant 


Sing 
hermitian form H,. Its non-vanishing coefficients are all 1, since the subgroup 
leaving x, and x, both invariant is of index nk, under G. Its matrix of co- 
efficients V; is one of the \ + 1 basis elements of the ring of matrices V which 
are permutable with each of the permutations of G.’ The conjugate imaginary 
form H, = Hy is also invariant, and so the corresponding matrix V,, the 
transposed of V,, is in the ring. In particular, Hp is the unit hermitian form, 
r 
and V) = I the identity matrix. Furthermore, W = >> V; is a matrix con- 
t=0 
sisting entirely of 1’s. 
oN 
Let V = >> a,V; be an arbitrary matrix of the ring, and let products be given 
1=0 


by the formulas 


r 
(2.2) ViV; = Dd Cine 
s=0 
aN dA 
(2.3) VV; = Zz. CisVe, where Cis = > OG Cijs « 
s=0 t=0 


37. Schur, Zur Theorie der einfach transitiven Permutationsgruppen, Berlin, Sitzungs- 
berichte (1933), pp. 598-623. Schur has developed several of the properties of this matrix 
ring, applying it particularly to the study of permutation groups having as a subgroup a 
regular group of the same degree. We assume no such subgroup here—in fact, none exists 
for the simple groups in which we are particularly interested. 








10 J. SUTHERLAND FRAME 
Since V;W = kW, we have 


iN 
(2.4) Dd ci = ki. 

j=0 

In order to display the reducibility of G, we must change the variables in 

such a way that the invariant hermitian forms H, become diagonal forms 
S'H.S on m variables respectively, £ = 0, 1,---, A. The matrices V, will 
then also be brought into diagonal form S”'V;,S, and can be written as linear 
combinations of new basis elements M; , each of which is merely a multiplication 
on a set of n; of the new variables. 


rN 
(2.5) MM; = piM;, M =S'YVS, pi = > pa. 
t=0 
Since, when we choose the parameters a, so that M = M;, we have M;M; = 


piM; = 0, but M;M; = p;M; # 0, no two multipliers p; and p; corresponding 
to different M; and M; can be identical in the a. Each multiplier 0; is a root 
of multiplicity n; of det(V — pl) = 0. More simply, it is one of the \ + 1 
distinct roots of the secular equation 


(2.6) D(p) = det(Ci; — pdi;) = (00 — p)(or1 — p) --+ (ox — p) = 0. 


By adding all the other rows to any particular row of the determinant D(p), 
we readily verify by (2.3) and (2.4) that 


aN aN 
(2.7) po = Di kia = > Cy 

1=0 1=0 
is a root. In particular, setting a; = 1, we find that p. = n is only a simple 
root of det(W — pI) = 0, the other roots being all 0. Hence po is the root of 
multiplicity mo = 1, and each of the other roots are functions only of the differ- 
ences a; — ao. If we denote the trace of the matrix V" by ns, , then 


nN 
(2.8) ns, = >. npr, 
i=0 
and we may solve for the constants n,. We let A be the Vandermonde deter- 
minant of po, pi, --~ , px, and A, be the Vandermonde determinant of po, p1, -- +, 
Pr-1, Prat, «++, px, noting that (—1)'"'D’(p,) = A/A;, and obtain 
coxa 1 ere 
” Po -** pi $1 Prsi*** Pr 
(2.9) = A = 2 2 2 2 = F, A ’ 


n po +++ prt So Prat +++ py 


N aN ) 
Pos:> Pr-t SR Pty ++ Pr 

















COMPONENTS OF SIMPLY TRANSITIVE PERMUTATION GROUPS 11 


where F, is the polynomial obtained from (—1)‘*'D’(p,) by replacing each power 
p: of p; by the corresponding s,. Since nm = 1, we have 


(2.10) Fo = A/ndo = —D'(—)/n, 


F, = n.A/nd; = (—1)'"D'(p,)ni/n. 


It follows that 


N r d 
N/n = nv? I] (n/n) = € ITF I] (—1)'* D’(p) 


The proof of Theorem I depends now only on the proof of 
r 


Lemma I. [] PF, = KF(F’, where K = kiko --- ky, and F is a factor of Ao 


t=0 


such that R = Ao/F is an integer. 


(2.11) 


3. We shall now prove the theorem for the cases \ = 2,\ = 3. To facilitate 
the algebraic work we introduce the notation: 


(3.1) 6; = po — pi; od = 1, => 4, ¢2 = Do 9:8; 5 +++ Gn = 0102 ++. 


(3.2) o=1l—l1/n, n=si—p, = > (1) ooo, w> 0. 














Since 
1 1 = 1 | 
o1 —6; ae a" —6, en —6 | 
(3.3) wel Cas 2 Ew a | 
a (—)*---(—a) | | (—@)**--- (—4)" 
= (—1)(@, + ogra + +++ + onde) 
and 
(3.4) Fo = (—1)'¢x/n, 
we have 
r 
(3.5) >» ogy = 0. 








12 J. SUTHERLAND FRAME 


The formula for F,, t > 0, becomes 


1 1 eee 1 1 l ewe 1 

0 —O, +++ —On1 Oo} —Ora1 +++ —A 
F = 2 2 9 ° 
‘1000 OF ees Oe gig tes BH 


0 (—6)*---(—@1)* on (—O4:)*--- (—A) 


|] 1 Hie 1 1 eae 1 | 

0 —A, +s =O “es 9s = | 
°e. €£ . Osa st OF 

| 


‘0 (~o)*... (ea (—ta)* «+ (—a |] 
(t) (t) 


= (—1)*[oig2, + o2Gi2 + --- + ml, 


where ¢\" is obtained from ¢, by setting 6, = 0, and satisfies the equation 
{) = ¢ — 6:6)°,. In view of the identity (3.5), we have 


du 
F,= (—1)"'[—oo¢r — o(dr1 — O..1) — +++ — Mahi — ¢;"”)], 
and hence 
(3.6) F, = (=1) foods + orbhes + «++ + 168"). 
Now consider the case \ = 2. We have 
(3.2a) o =1—1/n, o = —(airki + agks), o2 = 0; + aiki + ashe, 
(3.4a) Fo = 0,02/n, Fy = 0:(0002 + 01), Fe = —60(000, + 01), 
FoF, F2 = —Fo0:02(05 0192 + oooi(0: + 02) + 03) 
= —Fjn(— ooo2 + 63) = Fol(n — 1)o2 — noi] 
= Fi[(ki + ke)(aiki + agke) — (ark; + azke)’], 
(3.7a) FoF: Fs: = Fo(kike)(a1 — a2)” = KFOF’, 
where 


(3.8a) F = (a, — a). 








od 











—————— 


Frm sit. 











COMPONENTS OF SIMPLY TRANSITIVE PERMUTATION GROUPS 13 


To prove the lemma in this case, we show that F divides Ao . 


ao — p a) a2 | a — p a a2 
D(p) = | ak; Cu—p Cr | =| ak Cu-p Cr 
(3.9) aks Cu Cn — p |po — p Po — P po — Pp 
a — a—p a — a 
= (po — p) = (po — p)(p1 — p)(p2 — p). 


ak, — Cy Cu — Cr — p | 


A’/n?F5 = Ab = (p1 — pe)” = (pr + p2)” — Apipe 


(ao — ag + Cy — Cw)" — 4(ao — a2)(Cu — Ci) 
+ 4(a: — a2)(aik; — Cr) 


= (—a2 — aren — a2Cen + acne + acre)” 
+ 4(a: — ae)(ark; — arene — aCe) 
= (a1 — a2)*(cuz — em)” + 4(a1 — a2)’ car. 
(3.10) AS = (a1 — a2)" [(cuz — cu)” + 4eo) = FR’. 
This last bracket will be a perfect square 
(3.11) R® = (cuz — em)” + 4c 


if ki + ke, since p; and pe will then be rational. Lemma I follows from (3.7a), 
(3.8a), (3.10), and (3.11), so the theorem is proved in this case. 

The algebraic work, even for the case \ = 3, is quite complicated. Using 

the same notation, we have 

0% = 1 — 1/n, 0) = —(a,k, + ake a a3ks), 
(3.2b) 2 2 2 2 
o2 = 0; + ayky + acke + agzks, gobs + o1¢2 + ood: + 93 = 0. 
(3.4b), (3.6b) Fy = — 6; 02 03/n, F, = — 0; [co Oe 85 + o1( 62 aa 63) + a], etc. 
(3.7b) PF F2F3/Fo = —nloods + 2osordeds + oo a2dids + cooild2 + ors) 

7 
+ o00102(3¢3 + did2) + oi (bab: — os) + oioe(b2 + $1) + 2oro2d: + 03]. 
The left hand side may be simplified, and can be shown to equal KF’, where 
(3.8b) F = (ag — as3)(az — a1)(C2x3 — Cn) + (as — a1)(as — a2)(Cu — C2) 

) + (a, — a2)(a1 — as)(Ci2 — Cus) — (a1 — a2)(a2 — a)(a3 — a1). 


By (2.11), we have N/n** = KF’/Aj, or Kn\"/N = A;/F*. If the numbers 
k; are distinct, the roots p; --- p, must be rational, and so must Ao , the product 








14 J. SUTHERLAND FRAME 


of their differences. Ao/F is therefore a rational fraction, and is independent of 
the parameters a;. Assume it reduced to lowest terms. For a. = a; = 0, 
we see that the denominator must be a factor of ¢12 — cus ; similarly, it must be 
a factor of C223 — Co, , and of €33; — C332 , and hence a common factor of these three. 
But since we can choose ay , az , a3 So that F has no factor in common with these 
three, the denominator must be 1, and Ay/F = R is a rational integer. 


4. We shall now consider an infinite family of simply transitive groups for 
which we can compute the constants n;, and shall show that our formula in 
Theorem I holds also for larger values of \ than \ = 3. We define these groups 


r 
binations of the vy symbols taken \ at a time will be permuted among themselves 
by a transitive group G*. We may denote these combinations by variables 
Zi; iz---i,, defined to be independent of the order of the subscripts. The sub- 
group G, leaving fixed the variable x.2..., permutes transitively among them- 
selves those variables which have exactly \ — ¢ of the subscripts 1, 2, --- , A, 
and ¢ of the remaining u subscripts. Hence we have r = \ + 1 sets of transi- 


tivity 7,, ¢ = 0, 1, 2, --- , A, which contain respectively k, = (*) (*) variables. 


y . v 
as follows. Under the symmetric group of degree v = \ + u, the n = ( ) com- 


t 
The structure coefficient c.,; already defined in (2.2) for the ring of the matrices 


V = > aiV; is the coefficient of aaa, in the inner product of the row of V cor- 
1=0 


responding to the variable 2;,2,... ,, and the column corresponding to the variable 
T1s1,42,---,4a- It is given explicitly by the formula 


(4.1) we ee eae OR ee ee | 


We verify that 


son = £6 G2 Oost) = G20) =H 
(4.1b) Co = (‘) (" 7 ‘) = ba. 


Let the characteristic equation det (x QaCabt — pore) = 0 have the roots 


» 
(4.2) pi = >> puree. 
t=0 


Then by solving for simple cases, and generalizing the result, we surmise that 


(4.3) pe = >> (—1)" (‘) eee e-%. po = ke; po = 1. 




















COMPONENTS OF SIMPLY TRANSITIVE PERMUTATION GROUPS 15 


To prove that this formula is correct, it is sufficient to show that 
» 


(4.4) > Cabt Pit = Pia Pid, 
t=0 


since if we multiply by the corresponding p% each column of det(2 QaCare — POre 
a 


and add it to the first, this first column then has in its (b + 1)-th row the 
quantity 

DS aaCase pit — pps = >, aapiapin — pps = (pi — p)pn. 

a,t a 


Hence the determinant vanishes for p = p;. Equation (4.4) follows from the 
rather complicated identity 


tment FC Moe MT Nag d= e- JM OEE) 
(Fa) = FM NO=NE= JE) = N=.) 


for all \, uw, 27, a, b; the summation being taken over all values of the arguments 


for which the binomial coefficients are different from zero. 
In order not to interrupt the continuity, we postpone the proof of Lemma II 


until §5, and proceed to determine the constants n;. The equation (2.8), for 
oN 
h = 1, is > a Nip: = Nao, and actually involves \ + 1 equations 
i=0 
(4.5) , Ni pi = Ndoe , t=0,1,---,A, 
1=0 


which can be solved uniquely for the constants n;. The determinant P = 
det(pi+) has the value 


_ A+1 ‘ vy—2j 
(4.6) P = (—1) at,-%) 
and 


(4.7) ny = (") - ‘Se :) = (v + 1 — 2iv!/(v + 1 — aptil. 


We may now verify Theorem I for these groups. We have 


r aN 
P= [Im = TT (41 - ol +1 - Otay 
ny » 
n\K = [I nk = T] fotta—oliw@—-vx- OY 


n\K/N = TT {@+1-0VYA-d)!t@-A-D!W41—- 2} 








16 J. SUTHERLAND FRAME 


(4.8) n’"K/N ait {v—O)VA—OLt(—-D!~—-A-—O! (V+ 1 — W®W} = PR, 
gy 
where 
(A/2) 
(4.9) R = [II 1/(¢ — on | IT {(@ — 2s)!/(@ — 1 — 2d + 2s)!}. 
t=1 s=1 


The quantity R is easily shown to be an integer, and the theorem is proved for 


this family of groups. 


5. We close with a proof of Lemma II, which in itself involves interesting 
relations between binomial coefficients. Use is made of the following identities, 


which may be readily verified. 


(5.0) E(-1"(54 ys - (5-2). 
(5.8) it ire Se (“5 D(a). 
(5.7) p> F 4 ,) Pes )> F + D>): 


Proof of Lemma II. 


t t A-—t p—t ut —t\/u—2 
(5.1) p> fe Se 8 | ee i ome 56-3 
utov\f/u+tv\/A—u—v\/A-72 
“E-TOC Ne) 
(oa ere re) (ty), where v=t— 4, 
s ai 6 utv+s—r—q —v—s+27+4+q v 
= BN ee Nese 


wn ale bua» Moiese aD 
ee 


by repeated application of (5.y) and (5.8). Now if we rearrange the factorials 
in these binomial coefficients in a different way, and set 


_ (rx -—i\(z v u—t\/y v 
f(05 9,442, 0) = ( r VEC. 24 y | 4] aa 























COMPONENTS OF SIMPLY TRANSITIVE PERMUTATION GROUPS 17 


we obtain for (5.3) the expression 


uf t—p-—q+r i—u 
mf) Et eee | ee 


( a ({) se Ps % % ¥); 


q 
oii ptq—rt+stu—z $+ v—wZ 
= 2D at ee Oe 
Y 


q 2 
(‘) _- ‘) (") s0: P, % Z,Y), 
p+q 0 
= Peay’) fa csiuiiee ae es 


(0.22) (Danes 


(5.7) = dX (- we ( VO \iotatety—a-b; Ps Uy Xs Y)s 


ENON Manta) 
Bs ere 

6) =E( (NICIC-IC IE ) 

ai Eo) )Eco(JO-) Em, 


which proves the lemma. 


Brown UNIVERSITY. 





STRESSES IN MODERATELY THICK RECTANGULAR PLATES 


R. Arcuie Higpon anp D. L. Hou 


1. Introduction 





In 1922 G. D. Birkhoff' suggested a method for solving plate problems which 
involves the representation of the displacements by power series. C. A. Garab- 
edian® and H. W. Sibert’ used this idea in developing methods for solving prob- 
lems in moderately thick circular plates. Garabedian‘ has also published some 
results for uniformly loaded rectangular plates. 

The authors give a solution for the displacements in an elastic isotropic 
moderately thick rectangular plate under the action of any given load which 
can be expressed as a polynomial in x, y continuous over the entire plate and 
with prescribed boundary conditions at the edges. The method, similar to 
that used by Sibert’ for circular plates, is based on the assumption that the 
components of displacement can be developed in positive integral powers of z. 
In this type of problem, the displacements must satisfy (a) the stress equations 
of equilibrium throughout the plate, (b) the surface traction conditions on the 
upper and lower faces, (¢) the boundary conditions at the edges. 


2. General theory 


a. Form of the displacements. The displacements, u, v, w, are given by 


~ 


) a a n 
(1) “u= _ U, —, v= > V2 =, w= : W,, —, 
n=0 nm. n=0 nm. n=0 nm: 


where U’,, V,, and W,, are continuous and continuously differentiable functions 


of x,y. The equations of equilibrium are (A. E. H. Love, p. 134) 


aae ‘ 
(2) (A + nu) (2. ay’ aya + wV(u, v, w) = 0. 


Received July 15, 1936; presented to the American Mathematical Society, April 11, 1936. 
1G. D. Birkhoff, Circular plates of variable thickness, Phil. Mag., vol. 43 (1922), pp. 953- 


962. 


2C. A. Garabedian, Circular plates of constant or variable thickness, Trans. Amer. Math. 


Soc., vol. 25 (1923), pp. 343-298 
3H. W. Sibert, Moderately thick circular plates with plane faces, Trans. Amer. Math. Soc., 


vol. 33 (1931), pp. 329-369. 
*C. A. Garabedian, Comptes Rendus, Paris, vols. 178 (1924), 180 (1925), 181 (1925), 


186 (1928), 195 (1932). 
5 Loc. cit. 
6 In this paper all references to Love are to the fourth edition of his Mathematical Theory 


of Elasticity, 1927. 


18 


























STRESSES IN MODERATELY THICK RECTANGULAR PLATES 19 


When equations (1) are substituted in (2) and the coefficients of like powers of 
z are equated, there results’ 











, : a cet aVn-2 . OWa-ar 
(3.1) U, = — —_ . — 26)VUn-2 + + thee + = |, 
9 9\8 eon aViw 

(3.3) W,= [a — 2¢)V’Wa-2 re by | 


By successive Bas of the recurrence relations (3), U,, V,, and W, 
are expressed in terms of Uy, Vo, Wo, U1, Vi, and W,. The results are 


(4.1) Ux, = (—ptv| y "Uo + nM, 


(4.3) Wasi = (—1)*0™ ja — 26 —k)W, -— (3 + me) | 


(4.4) Una = (—1)'v** | Ui+k I, 


(4.6) Wa = (-1)'v"" t —2 +4 20)Wo + (a + ~)I, 
where 


(1 — 26), = 2Ue , Yes wy, 


oy 
(1 — o)W, = 24 4 OM: _ ey, (k = 0, 1, 2,3, ---). 
Ox oy 


When formulas (4) are substituted in (1) there results 


~— aw] 2” 
w= (19 [ru +4 Ms) om 


2k—2 r aW, Zz 
+t 1)‘v [ruses i a 


(5.1) 


2k+1 


2k 


— = k yg 2 au, av; Zz 
w= 2 (- 1)"V [2+ 26) + (2 + 2) om 
ko2k : aUo , aVo "si 
+ ps (—1)'V [a — 20 —k)W, — (2 + we) GED! 


These expressions for the displacements satisfy formally the stress equations 
of equilibrium throughout the body. 


C7] oe 
7 Here and henceforward V? = — 


Ox? ay? 5 

8 (3.2) is omitted because it can be obtained from (3.1) by interchanging z and y, U and 
V, etc. Throughout this paper an equation will be omitted when it can be obtained from 
the preceding equation in this manner. 








20 R. ARCHIE HIGDON AND D. L. HOLL 





b. Surface traction equations. A right-handed coérdinate system with the 
faces of the plate z = +h, x = 0,2 = a, y = 0, and y = b will be used. The 
x, y, and z components of the surface tractions will be designated by L,, J1, 
and P, respectively on the upper face and by Lz , Jz, and P: respectively on the 
lower face. Using the notation of Love (p. 77), the surface traction conditions 
on the upper and lower faces may be written as 


(Xz) = Ly, (X.)em—» = Le, 
(6) (Y.)enn = Ji, (Yea = do, 

(Z.)emn = Pi, (Z:):--» = Po. 
The stresses in terms of the displacements are (Love, p. 101) 


ae, 2u m aw du ov 
rs «a oo + (+ 2), 


> ou ow dv ow 
X = — —s Y = — ome Be 
. (2 + a), , (2 ¥ o) 


Substitute equations (5) in (7) and the results in (6). Then take the sum and 
difference of the two resulting values of Z,, X., and Y,. The final equations 
are 


(7) 


oo 


2k-2 k vu; av: 
2 (-v'v [wu + A. (SS +e) 














k=0 
(8) o 
_ k—-1l+¢e (™: ") | h _h — Lz 
—o (2k)! _ ? 
= k ew 2k k—1+¢ 2, cyte (a 1 n* 
410) 2 (-) * pat ve — 21 —«) \a (2k + 1)! 
_P—P 
ph’ 
. k-+1 go 2k ae aU Ly 
- 2 (-pv [vit + RAD (EU ee 
+ es) _lL—-L, 
1—2c¢ dc \(2k+1)! Bh ’ 











= k+1 gtk kas (20 ve) k—l+e al h* _Pi+Pr 
a3) 2-9 E eee taJtaics “jan” 

Equations (8), (9) and (10) involve only Wy, U; and V; and equations (11), 
(12) and (13) involve only Uy, Vo and W,. These two simultaneous systems 
of equations can be solved by an indirect process due to Sibert.’ This process 


® Loc. cit. 























STRESSES IN MODERATELY THICK RECTANGULAR PLATES 21 


requires that U», U1, Vo, Vi, Wo and W, be expressed as infinite sequences 
of terms of ascending order of magnitude. Let s represent a first degree func- 
tion of x, y. Order of magnitude may then be defined as follows: If r and ¢ 
are two functions of s which contain the same number of terms, ¢ is defined to 
be of the n-th order of magnitude as compared to r if each term of ¢ is propor- 
tional to (h/s)" times the corresponding term in r. It then follows that h°"V°"r 
is of the 2n-th order of magnitude as compared to r. It is necessary to assume 
that Uy, Vo, Wo, Ui, Vi, and W, are expressions in x, y which involve h in 
such a manner that their terms can be grouped and arranged in ascending order 
of magnitude. 

Since equations (8) to (13) inclusive have been arranged so that only even 
powers of h occur in their left members, it is only necessary to provide for even 
orders of magnitude. Therefore 


(14.1) Us = > Umno, Ui = > Usa; 
n=0 n=0 

(14.3) Wo = > Wino, Wi = > Wear; 
n=0 n=0 


where Wono, Ueno, Veno, Wena, Uena, and Vex. are of the 2n-th order of 
magnitude as compared to Wo , Uw, Voo, Wa, Un , and Vm respectively. It is 
assumed that Wo, ---, Vo, being the terms of lowest order of magnitude, 
do not vanish identically unless Wy, --- , Vi respectively are identically zero. 

For simplicity the problem will now be restricted to the case of a normal 
surface load only. The solution for the case of a shearing load is very similar 
to this case. By superposing these solutions, the results for more complicated 
problems can be obtained. 


c. General solutions for W,, U,, and V,;. In order to solve equations (8), 
(9), and (10) simultaneously it is necessary to write each one as an infinite system 
of equations by equating terms of the same order of magnitude. The right 
members of equations (8) and (9) are now zero, but the order of magnitude of 
the right member of (10) must be determined. Assume it to be of the same 
order of magnitude as V’Wo. Let P; — Ps = —p,, where p, is a function of 
zx, y. Then the equations of lowest order of magnitude in (8), (9), and (10) 
may be written 


(8.0) Un + —* =0, 
an aVan _ 
az + ay ~ Quh’ 


aan + aVn = (0. This result is in- 


Ox oy 


(10.0) Vv Wo + 





gives V°’ Wo + 





a a(8.0) 4 a(9.0) 
Ox oy 


consistent with (10.0). Therefore p,/(2uh) must be of the fourth or higher order 














22 R. ARCHIE HIGDON AND D. L. HOLL 


of magnitude as compared to Wo. It will be assumed now and proved later 
that p:/(2uh) is of the fourth order of magnitude as compared with Wo. Equa- 
tions (8), (9), and (10) may now be written as infinite systems as follows: 


(8.0) Un + = = 0, 





, ” 2n.0 = LA 9 k U n—2. 
Ura + =~ + 2 (-1)*™ ‘| vv, 2,1 + 7(2 — 


k=1 








(8.n) 2 m 
o "on—2k.1 k — I + og aw. 2n—2k,0 h- 
-3 v = 0 = 1,2,3, -<-). 
+ axdy . l-¢ (2 ax ‘| (2k)! (n 1 2,3, +++) 
9. — 
Since (8.0) eo (10.0), it is necessary to form another system of 
Ox oy 
( 
equations by subtracting - *e*) + 8) from (10.n). The resulting system 
is” : 
(15.0) 0=0, 
2 an ava) 27 | 2p—r 
; vy} (2— — + —]-oV = — 
(15.1) [ (20 >? oV Wem D’ 
- BV on OV en—2k,1 
1 k+ lv k © cs (% 12k, 1 2 v—2k “) 
2 — al + °) Ox + ay 
(15.n) 6kh2* 
Sur 7 is ej 
— (k — 1 a a)V W mato] (2k + 1)! = 0 (n = 2, 3, 4, ee -). 


Systems (8), (9), and (15) can now be solved simultaneously for Wan, Usn.s 
. 9. 
(8.0) + a(9.0) become 
Ox oy 





and Ve,,. Equations (15.1) and the Laplacian of 
an aVan a 2bopr 
ele) em] =p 


7] 
v | (Ue 4. ol av ora) re vw? Wa = =P, 


respectively, where b) = 1 and d) = 0. The simultaneous solution is 











V'Woo -_ le — a)dy — bol, = “(2s + ovat) ad F; lado + bol. 


duh? 2 Eh 


ae ee Ee 














STRESSES IN MODERATELY THICK RECTANGULAR PLATES 23 











. rf 
Equations (15.2) and the Laplacian of (8.1) 4 (9.1) become 
ox 02 
2 aU 2 OV, Senn 2h bi VT" pr 
ri. iS .... staat. ee bo On |b el 
v | (2 o)( = + re ) oV W »| D : 
o| (AU OV_ cae 2h* dV" py, 
Vv - - “Wa | = —x—, 
(2 + Va) 4 Ws D 
respectively, where b; = Sad a J _ The simultaneous solution is 
Ss} J? = 5 « 1 a1 — o) ‘ f b, ss ‘ 
— »(aUn . aVs tA. 
vin 20 > pr pl(2 _ a)d; _ bil, ( — —_ Ve) => 5 V plod, ob by]. 


Continuing in this manner, by solving (15.n+ 1) with the Laplacian of 


a(8.n) + oes), one obtains the following general solutions: 
’ 








Ox 
(16) V' Weno = * Vv" ni [(2 — od, — b,), 

- ofOUn1 . OVen  —* 
(17) v (2s +: ran = FP" pilods + bul, 
where 


i 6(7 + 2) {(@ + 2)bn—1-i = (7 + Ha —_ o) dys} 
(27 + 5)! 


n—l 
(18) b. = D (—1) 
(i + 1)bn—1-i - (1 — o)dy—1-i 


GN (n = 1,2,3, +++). 


(19) dy = D(-1) 


Sibert has given upper bounds of the sequences b,, and d,, as 


1 1 ve 1 

| bn | tah | dn | — - = 2,3,---). 

}Onl S35 In! S 3749 — 9) (n ) 
Since (16.0) is T’Wo = ae it is the differential equation defining the vertical 


displacement in thin plate theory (Love, p. 488), and Wo is the vertical dis- 
placement of the corresponding thin plate. Consequently the vertical displace- 
ment Wp» of the middle surface of a moderately thick plate is made up of the 
corresponding thin plate displacement Wo plus corrections Wa , W4 , ete. 

By combining equations (16), (17), and (8), Ue,..1 is obtained in terms of 
Wen»o. Similarly (16), (17), and (9) yield V2,,. The results are 














24 R. ARCHIE HIGDON AND D. L. HOLL 


al 00 








(20.0) Un + — =0, 

: OWe».0 ” OWon2. 

on a - — v° = — 
a Van + Ox a ee ( ax * 
20.0 

— fon (2 = i — On 
+ . (> ) | 2 + — - =] = 0 (n - l, 2, 3, ve), 
Ox l-—« 





If We, is eliminated from equations (20) and (21) the relation : 


OVen. - : ais , F , , 

: 2 " is obtained. This relation combined with (17) gives 
. ,” = a 

(22) V*Uens = —_ [od,, a b,1V (2). 


It is necessary to complete the proof that p:/(2uh) is of the fourth order of 
magnitude as compared to Wo. It has already been shown to be either of the 
fourth or of a higher order of magnitude. Assume that its order of magnitude 
as compared to Wo is greater than the fourth. Then equations (15.1) and the 


0 ( 
(8.0) 0) - a(9.¢ . 0) become 
dx y 


Vv’ | (2 — a) (20s + at) — Was = 0, 
Ox oy 
v| (2s + ay ra) v? We = 0, 
Ox “Oy 


respectively. The simultaneous solution is 


. 2/ au. ay 
‘ = V a 4... vn) == (). 
Vv Wo ( — + - ay 0 


Laplacian of - 





Since the trivial solution V‘W2,,0 = 0 does not depend upon the load, p; must 
occur on the right side of some one of equations (16). But since (16.0) is the 
equation of lowest order of magnitude in the system (16), its right member 
cannot be zero unless the right members of all equations of the system are zero. 
From this contradiction it follows that the term p;/(2uh) must be of the fourth 
order of magnitude as compared to Wo . 

Finally, equations (16), (17), (22), and (23) substituted in (14) give the fol- 
lowing general solutions for Wy , U; , and V,; : 


(24) v Wo = > =e Vv" nn [(2 — a)dn ce bal, 
n=0 

(25) vi=> - Vv" (2) [od, + dal, 
n=0 

- 2 (aU =f eh” os 

(27) V (2 + ay pm D V" plod, + bal. 








STRESSES IN MODERATELY THICK RECTANGULAR PLATES 25 


In 1899 J. H. Michell” published the differential equation defining Wo in 
the form 


4yyr Sh «+ F 2 5) | l+o (#) 
vm~ - S59 (e+ SL-*F°"(S). 


It is easily shown that this solution becomes identical with equation (24) when 
the stress Z, is expressed in terms of the load. 








d. General solutions for W,, Uo, and Vo. Let P; + P2 = —pe, where po 
is a function of z, y. It can be proved that the right member of (13) is of the 
same order of magnitude as Wy,. Then equations (11), (12), and (13) can be 
written as infinite systems of equations. These systems can be solved in essen- 
tially the same manner as that used to obtain the simultaneous solution of sys- 
tems (8), (9) and (15). The general solutions for Up , Vo and W, are 








hed 2n 
(28) vm = -> 7 Vv"? [(1 — olen + oael, 
4 = he" 2n+2 Ope 
(29) V Uo => u hal (22) [oc, aa (1 _ a)ap), 
aU, , aV — hh on 
where a = 0, @ = 1, 
n—l . ° 
vr (i + anu: — (i + 1)en—1-+ | 
= m= 2 (-1) (2 + 3)! | 
= 4D | (n= 1,2, --»). 
7 _ 1) An—-1-i — Up-1-i 
” m= 2 (-1 (2i + 2)! 
Sibert has given as upper bounds for these sequences 
lanl <5, lenl < gon: nw 1, S*--) 


The displacements u, v, w are given by relations (5) when the six functions 
Uo, Vo, Wo, U1, Vi, and W, are known. Therefore one can say that the differ- 
ential equations (24) to (31) inclusive define the displacements. Furthermore, 
the displacements defined by these differential equations satisfy the equilibrium 
equations and the surface traction conditions for any normal load which can 
be expressed as a polynomial in z, y continuous over the entire plate. It re- 
mains to solve these differential equations subject to particular sets of edge 
conditions. 


11 J. H. Michell, On the direct determination of stress in an elastic solid with application 
to the theory of plates, Proc. Lond. Math. Soc., vol. 31 (1899), pp. 100-124. 








26 R. ARCHIE HIGDON AND D. L. HOLL 


3. Normal load, p,(r, y) = po(x,y) = P(r + = + ee) 





a. Preliminary relations and remarks. The displacements will now be found 
x , 
for a normal load P(r += + pe) on the top surface of the plate where a and b 
a 


are the horizontal dimensions of the plate, A, a@ and 6 are arbitrary constants, 
and P is a uniform load per unit of area. In this case equations (24)—-(31) 


become 


‘ —P ax . By 
34 “WwW. = A+ —4+> 
wy D ( + a + b/’ 
; Pa 
35) VU; =- 
- = 
, 9 al 1 oF) 5( ar By 
(37 v(- = etl + — = 
ot) (7 + ay D + a + b/}’ 
(38) vw; = 0, 
(39) vil, = 90, 
ofl) , aVo 
41 vv = 0. 
(41) (= * mv) 
The load is a linear function of zx, y and it is of the fourth order of magnitude 
as compared with Wo. Hence all terms whose orders of magnitude are 2 6 


vanish. Therefore all infinite systems and sequences of the previous section 
become finite. The following results are readily deduced. 





(42) Ura Se _. e v (%) - “ = a v (*), 

wo mena (ie) eG 68) 
HEE (Sant Sa cay “(Fear — SE" ae 

(47) w= Wo+ Wiz + a = VW o+ 3 to v'We +t = v' Wo ii 


The problems of plane and generalized plane stress are very easily solved 
by use of equations (34) to (47) inclusive. For these problems P = 0, and 
since Wo, , dUo/ax and aVo/dy are of the same order of magnitude as P, they 
may be chosen equal to zero. With these simplifications, the values of the stresses 
obtained from equations (45), (46) and (47) agree with those given by Love 
(pp. 467-471). 

















STRESSES IN MODERATELY THICK RECTANGULAR PLATES 27 


The problem of this investigation is restricted to moderately thick plates 
because the tractions applied to the edges are represented by their force- and 
couple-resultants taken along a vertical element of an edge. The boundary 
conditions at an edge will be defined in terms of the components of these re- 
sultants; namely, 7, S, N, G, and H (Love, p. 455). At a pinned edge 7’, G, and 
W> vanish. At a free edge T, G, N — dH/d@s vanish, s denoting the direction 
along the edge. Ata clamped edge Uy , Vo , Wo , and @Wo/dn vanish, n denoting 
the direction of the normal to the edge line. These components must be ex- 
pressed in terms of the displacements before they can be used here. The results 


are 

4uh [avy aV, oP at ) 

T,= He 2 ee aad shed 
, jie | ao + 0 (at +5 


—h’ { P un) a (we: ave | 
6 \a aad (u ax? \ ax + 5) ? 


3 2a" 2a" 27 
(50) G; = —4uh | a 7 a Wo + a _h?y'W, — 8+e wee) | 





~ 30 —oe)L or °° ay * 100 —o) i0 ay? 
0H. —4yh’ | aw, aw 
N: —- — = =| (2 - —— + —— 
. ax 3(1 — a) ( °) ax? dy ay® 


aus 8 + aw (8 — 30)h° 4 (aw 

Oo 222 0 —_ on 4 0 
+ Sea (Ss) + een "(ey 
Further, each equation which results from equating quantities (48) to (52) to 
zero must be written as a system of equations by equating terms of like orders of 
magnitude. The equation of lowest order of magnitude in each case will be 
referred to as (48.0), (49.0), --- , (52.0); the next order will be referred to as 
(48.1), (49.1), --- , (52.1), ete. 

The method of solution used here does not give Wo directly. First Woo is 
obtained, then Wo , etc., and finally Wo is obtained from equation (14). It has 
already been pointed out that Wo is the vertical displacement for the correspond- 
ing thin plate under the same normal surface load. By observing that condi- 
tions (50.0), (51.0) and (52.0) are precisely the corresponding edge conditions 
from thin plate theory, one may employ the Wo solution of the thin plate prob- 
lem in case it is available. The thin plate solution for the particular load of this 
problem has not been recorded and it will now be obtained for three sets of edge 
conditions. 


b. Solution for Wo. The following Fourier expansions which are valid in 
the interval 0 < x < a will be needed. 
2 4r sin 6x 2 > ; sin bx 


r = —— <a 
(53) see-D-— bp “sme oe 





1 
. a,2 3 _ tae nti Sinér 2 — .sin 6x 
(54) Gq @2-*)= » (-1) —g- @ = 23 ra 


a n=1 











28 R. ARCHIE HIGDON AND D. L. HOLL 





P 2 sin @xr 
55) X = = ; 0 Yesnstie: 
(55 +> a ita e ? 
” ha‘ (/x* 2r° +2) $e. .Ga a, 
56 a 2 
(56) 24 (2 et a a p> ' 
4 5 3 a oo 
_ aa {x 2x 7x 2 . sin ao 
=) oxy (2 — 3a 7 iz) = aie 
where 
nr . n . n+l n 
¢= —, i= All —(-1)"), j = a(-1)"", m = B[1 — (—1)"). 


In the problems to be considered, the edges x = 0, a will always be pinned. 
\ solution satisfying edge conditions Wo = 0 and (50.0) at x = 0, a and satis- 
fying the differential equation (16.0) or (34.0) is given by 


ee — Pa‘ br) (2 - = *) (5 - 2° 7x =) | 
vo" Sb | (x + b Nat — at t+ a) + “\ sas — 3a + 8a 


“D2 +> = [A, shéy + B, chéy + C, 6y shéy + D,6y ch dy], 
cf 


(58) 


where the constants A, , B, , C, and D, are to be determined by the conditions 
at y = 0, b. 


Case Il. The edges y = 0,bare pinned. The boundary conditions are Wo = 0 
and (51.0). When (58) is substituted in these edge conditions and use is made 
of the appropriate Fourier expansions, the following system of equations is 
obtained. 

B, = —2(i + j), 
A,sh¢ + B,ché + C,@sh¢ + Dioche = —2(4¢ +7 + m), 
B,(1 — o) + 2C, = 2e(i + J), 
A,(1 — o) sho + B,(1 — o) che + C,[o(1 — o) shod + 2ch¢) 

+ D,|o(1 — oc) ch¢ + 2sh¢] = 2c(i + 7 + m), 
where ¢ = 6b = nrb/a. The simultaneous solution of this system of equations 
is 


(i + DA — ch¢@)(2sh¢ — ¢) + m(2sh¢ + och $) 





ao sh? @ 
B, = —2(i + j), 
C, = (+9), 


(i + jC = cho) + m 





sh@ 











STRESSES IN MODERATELY THICK RECTANGULAR PLATES 29 


These values substituted in (58) constitute the solution for Wo for y = 0, 6 
pinned. When d = 1, this solution reduces to that published by S. Iguchi.” 
Case II. The edges y = 0, b are free. The boundary conditions are (51.0) 
and (52.0). The substitution of equations (53), (54) and (58) in these edge 
conditions yields 
B,(1 — o) + 2C, = 2o(i + j), 
A,(1 — cysh¢ + B,(1 — a) cho + C,[2ch¢ + (1 — o)dsh¢] 
+ D,[2sh¢ + (1 — o)pch¢] = 2o(t + 7 + m), 
A,o(1l — o) — D,o(1 + «) = —2m(2 — o), 
A,o(1 — «) cho + B,.d(1 — o) she + C.g[(1 — o)¢chod — (1 + «) shg] 
+ D,e[(1 — o)@she — (1 + ¢) ch] = —2m(2 — oa). 


The simultaneous solution of this system of equations is 


2o(1 + o)(i + j)(1 — chg) 


























a 
4 2m|oe(1 + o)g(R — Scho) — (2 — o)(R — S)(2sh¢ + S)] 
S(R? — S?) 

— 2o(i + JIC. + «) she —S)_ 4m 4m|opS sh @ + (2 — o)(1 — ch o@)(R — S)] 
(1 — «)(R — S) S(R? — 8?) 

C. = 2o(i + j) sh¢ 2m [oS sh @ + (2 — «)(1 — ch g@)(R — S)] 
"""R=8 o(k? — S*) 

—_— 2o(i + j)(1 — ch ¢) 4 2m[(2 — o)(R — 8) sh¢ + o¢(R — Sch¢)] 
wii R-S o(k? — S?*) , 


where R = (3 + ¢) sh¢@ and S = (1 — a)d. These values substituted in (58) 
constitute the solution for Wo for y = 0,b free. If \ = 1 anda = 8 = 0, this 
solution reduces to that published by D. L. Holl.” 

Case III. The edges y = 0, b are clamped. The boundary conditions are 
Wo = @Wo/dy = 0. Equation (58) and necessary Fourier expansions com- 
bined with these edge conditions yield 
B,, = —2(i + dD, 

A, sho + B,ch¢ + C,dsh¢ + Didch¢d = —2(¢ +7 + m), 
Aid + De = —2m, 


A,¢dch¢ + Badshe + C.¢@ch¢ + sh¢) + D.d@sh¢e + cho) = —2m. 


2§. Iguchi, Eine Lésung fiir die Berechnung der biegsamen rechteckigen Platten, Julius 
Springer, Berlin, 1933. 

13D. L. Holl, The deflection of an isotropic rectangular plate under the action of continuous 
and concentrated loads when supported at two opposite edges, lowa State College Journal of 
Science, vol. 9 (1935), pp. 597-607. 











30 R. ARCHIE HIGDON AND D. L. HOLL 


The solution of this system is 


2(¢ + j)(1 — ch@) — 2mg(1 — ch¢) 


A,=-— 


o+sh¢ e— she” 
B, = —2(i + j), 
C, = 2(i + j) sh¢ 4 2m[¢ sh + (1 — ch¢) ( + sho)] 
@+sh¢ o(¢? — sh? ¢) 
D. = 2(¢ + j)(1 — ch) 2m(sh° o— ¢ cho) 


¢+sh¢ o(¢? — sh?) 


These values substituted in equation (58) constitute the solution for Wo for 
y = 0,bclamped. If \ = 1 anda = 6 = 0, this solution reduces to that given 
by Holl.” 


c. Solution for Wx. With Wo known it is now possible to solve for Wa. 
Since W2 is a bi-harmonic function of the second order of magnitude as com- 
pared with Ww , a solution of the following form is assumed. 


Wo = = An [04 x — ar) + ; 5 (a —« 2) | 


2D 
(59) Ph? 
_ coil > ee - IF. ch éy + G, shéy + H, 6y choy + I,6y sh 6y]. 
Pinned edge condition (50.1) at z = 0, a requires that A= — (8 — 30) Fa, Ga, 
10(1 — o) 


H,,, and J, will now be determined by the conditions at y = 0, b for all three 
cases. 

Case I. The edges y = 0, b are pinned. The edge conditions are Wa = 0 
and (51.1). The results are 


~.. aie 
™ =, i+) 

~— (8 — 30) — [@ + JU — che) + ml] 
a ee sh@ 

H, = 1, =0 


When these values are substituted in equation (59), the solution for Wo, for 
y = 0, b pinned, results. It is easily shown that W2) may be written in the form 


_& _ ae) 


"Ora V'Wo. 


(60) Wo = 


1* Loc. cit. 








= 








STRESSES IN MODERATELY THICK RECTANGULAR PLATES 31 


Case Il. The edges y = 0, b are free. The edge conditions are (51.1) and 
(52.1). The results are 








. —(8 —(8 + cs 1 8m(1 — ch¢) 
* 1001 — y Os 7 5 Bn + “5S(R +8)’ 
— —(8 + ¢) 2 4m(2 sh ¢@ > + S) 
~~ 10(1 — =) Dat gist 5S(R +8) ’ 
] 4m sho 
H, =-D,—: na 
5 5¢(R + 8) 
) - : C,, — ont = oo 
5 5¢(R + 8) 


where A,, B,, C, and D, are the constants determined for Wo for Case II. 
We may be written in the form 





- _ —(8+a)h’ 2), 2h* a Ww 4Ph° sin 6x 
We = SoG sy VWo- 5 oe + sa eed Ye 


o(k + S) 


(61) E ee ee “_ 4+ —2™ _ +_9(1 — ch¢) choy — (2sh¢ + S) shoy 
+ (1 — o)éy sh¢ ch by + (1 — o)dy(1 — cho) shy} |. 


Case III. The edges y = 0, b are clamped. The edge conditions are Wx = 
IW 





= 0. The results are 


oy 
; 8 — 30 8 — 3e 
F,, S awe ny Fn Sen ~ dias 
20(1 — a) “ ° 20(1 — a) 
8 — 3e 8 — 30 
sa ae es 
20(1 — oa) . 20(1 — oa) 


By use of these values solution (59) may be written as 
—(8 — 3a)h* Woo 

1001—o) a2? © 

When \ = 1 anda = 6 = 0, all three of these solutions for Wo reduce to those 
given by Garabedian.” 


(62) Wo = 


d. Solution for We,» (mn = 2). Now it is possible to solve for Wy. Since 
Ww is a harmonic function of the fourth order of magnitude as compared with 
£ 
Ww , 2 solution of the following form is assumed. 


. AP oe) Ph‘ sin a 
(63) Wo= AP (a A+ A+ + > 2 [J, chéy + K, sh Oy). 


15 Footnote 4, vol. 195. 








32 R. ARCHIE HIGDON AND D. L. HOLL 


The boundary conditions at x = 0, a are Wy = 0 and (56.2) which reduces to 
2y17 

tal = 0. These conditions require that A = 0. J, and K, will now be 

determined by the conditions at y = 0, b for each of the three cases. 

Case I. The boundary conditions at y = 0, b are Wo = 0 and (57.2). These 
conditions require that J, = K, = 0. Therefore Wy = 0. By inspection of 
equations (16) it readily follows that We. = 0 (n 2 2). 

Case II. The free edge conditions are (57.2) and (58.2). They require that 


_ 2(8 + a) _ 28 + 0) 
a" “~“-a 


With these values solution (63) may be written 


,  =—(8 + o)h'[ o, 8 — 30 au). 
(64) Wa = 5 Otel oy — Sa 3 Po a+2 +¥ 








By a consideration of orders of magnitude or by (16) it is evident that Wan.» = 
0 (n > 2). 


Case III. The clamped edge conditions are W 40 Ww 


dy 
ditions are satisfied when J, = K, = 0. Hence Wy = 0. As before, it follows 
that Wono = 0 (n = 2). 

All of these solutions for Wy reduce to those given by Garabedian” for the 
case \ = landa = B= 0. 


= (0. These con- 





e. Solution for [) and V). It remains to determine Uy and V» to complete 


the problem. It has been pointed out that Wm and “ + oe are of the same 


order of magnitude as the load which is of the fourth order of magnitude as 
compared with Wo. Therefore 


wee ema thet ey 


+ p> == [F, ch by + G, sh @y + H,,6y ch by + I,6y sh Oy). 


However equation (41) requires that this expression be a harmonic function. 
Therefore H, = I, = 0.. With this background one may consider the following 
forms of solutions. 

cos ane 


[A, ch dy + B, sh 6y], 





Uso = Co + crt + coy + cg + ary + cy" +2 


Voo = ko + kuz + key + kaa? + kary + hsy’ weit ch éy + D, sh 6y]. 


16 See footnote 15. 























STRESSES IN MODERATELY THICK RECTANGULAR PLATES 33 


It is obvious from the definitions of a pinned edge and a free edge as given 
in this paper that T = 0 is the only restriction on U» and Vo for Cases I and II. 
Since this condition is not sufficient to determine all the constants in the above 
general forms, the solutions obtained here must be recognized as satisfactory 
solutions and not necessarily the unique solutions. Garabedian” avoided this 
difficulty in his solution for a uniform load by assuming that Uo and Vo were 
closed linear functions of z, y. However, this assumption does not permit a 
satisfactory solution in the pinned-clamped case as it requires that 7’; be different 
from zero at the pinned edges x = 0, a. Love (p. 462) adds the condition 
S = 0, but this condition does not yield a reasonable solution. Furthermore 
S. Woinowsky-Krieger™ has shown that S is different from zero in general. 

Equation (48.0) is the only condition on Uo and Vo at z = 0, a. It gives 


oAP oBP P 
ot ee a a + ok = Tr, 2s + oki = Fo. 
Therefore choose 
OQ=a=a=k=k =k = 0; 
i. a Oo. ee a caP 
1 = ke = oe Cs = 2hs = orp} 2c3 ke = ao 


where the qty 5 E is Age modulus. .Equation (49.0) at y = 0, b yields 
D, = oA, and C,, If the plate be fixed in space by making Vo = 0 
at y = Oand Ug nepothy x = 0, then B, = A, = 0. Consequently D, = C, = 0 
and the final solution may be written 


(65) Uw = = Q+2 $+ 6), 
(66) Va (+s +2 er). 


Since Us and V2 are of the second order of magnitude as compared to Uo and 
Vo , they must either be zero or constants. But Voo = 0 at y = Oand Us» = 0 
at x = 0 to fix the plate in space. Therefore they are zero everywhere. Conse- 
quently Ueno = Vero = 0(n 21). This means that (65) and (66) give the 
complete horizontal displacement of the middle surface. Although not a unique 
solution it is a rational one in the sense” that it yields a displacement at the 
middle surface which is one-half of what the displacement would be if the plate 
were resting on a complete foundation. If \ = 1 and a = 8 = 0, these solutions 
reduce to those published by Garabedian” for a uniform load. 


17 See footnote 15. 

18 S. Woinowsky-Krieger, Der Spannungszustand in dicken elastischen Platten, Ingenieur- 
Archiv, vol. 4 (1933), pp. 203-226, 305-331. 

19 See footnote 15. 











34 R. ARCHIE HIGDON AND D. L. HOLL 


The values of Uo and Vo for the pinned-clamped case are most easily ob- 


tained by imposing the conditions Up = Vo = 0 at y = 0, b first and then 


requiring that 7; = Oatz = 0,a. The final results are 


oP a rypB 
et an ¢ ee ps Lat 
2B [ar -2 saline = + b 


(¢+j7)(1 —ch¢d) +m choy}, 
sho 


Uo = 


(67) 








+ > Pees + j) choy + ° 


; oP ary + - sin dx {2(i +j)+m,. ) 
(68) Vo = ely y+— + oF 2 ae \- she @ sh wv} 
It readily follows that Uono = Varo = 0 (nm 2 1). 

Woinowsky-Krieger’s” solution for an infinite plate strip pinned at the edges 

= 0, a is readily obtained from any one of the three cases solved here. It is 
only necessary to set Vo = 0 and redetermine Up to satisfy the conditions at the 
edges x = 0,a. Then in the limit as b — ~ the solutions given here reduce to 
those Woinowsky-Krieger gives for the infinite plate strip. 





Iowa State COLLEGE. 


20 Loc. cit. 








cs 





ON MATRICES OF INTEGERS AND COMBINATORIAL TOPOLOGY 
By Hasster WHITNEY 


1. Introduction. Our object is to give an elementary account of some alge- 
braic theorems, with some immediate applications in combinatorial topology, 
in particular, in the theory of homology and cohomology’ groups. The theo- 
rems are to a certain extent known, if in somewhat different forms. 

The main tool in the algebraic part is the theory of group pairs, and in par- 
ticular the question of when one group “resolves” or “completely resolves” 
another. The main theorems are on the existence of extensions of a homo- 
morphism, and the existence of solutions of linear equations, with a matrix of 
integers and elements of an abelian group as unknowns. In each theorem, two 
types of conditions are employed, one using mod m properties, the other using 
group pairs. We shall use only discrete groups (except in Theorem 1). 

The applications to topology are concerned with the relations between the 
ordinary homology theory, and the newer “‘dual” theory. An illustration of 
the convenience of the newer theory is given in Appendix I. For other illus- 
trations, we mention the duality theorems (Kolmogoroff, Alexander, Cech, etc.), 
and properties of maps (see a following paper). 


I. Group pairs and homomorphisms 


2. Group-pairs. All groups will be abelian. 0 will denote the identity in 
any group. Let 


mG = all elements mg, g in G (m an integer), 


m@ = all gin G such that mg = 0. 


Then 0G contains 0 alone. G — G’ is the difference (factor) group of G over 


the subgroup G’. 
Let G, H, Z be three groups. If to each g in G and h in H there corresponds a 
z = gh in Z, and both distributive laws are satisfied, we say G and H form a group 


Received December 1, 1936; presented to the American Mathematical Society Decem- 
ber, 1936. 

1 The relation of ‘‘coboundary”’ is becoming of increasing importance. Other terms 
have been used: dual boundary, upper boundary, inverse boundary, boundary in the dual 
subdivision, derived (of a function). The present term (accepted by E. Cech) offers 
definite advantages. In differential geometry, the (exact alternating) covariant tensors 
and the contravariant differentials play the same réle as cocycles and (contra)cycles in the 
combinatorial theory; in fact, they may be obtained directly by a passage to the limit. 
Moreover, the prefix co is very convenient to handle. 


35 














36 HASSLER WHITNEY 


pair with respect to Z. If gh = 0, we say g and h are orthogonal. For any 
subgroup H’ of H, set 


(G, H’) = nullifier® of H’ in G = all elements g in G orthogonal to every A in H’. 


Define (7, G’) similarly. 

If (G, H’) = 0, we say H’ resolves G.’ If (G, »H") = mG, we say H’ m-resolves 
G.’ Clearly “H’ 0-resolves G@”’ and “H’ resolves G” are equivalent statements. 
If H’ m-resolves G for all integers m 2 0, we say H’ resolves G completely. Note 
that 
(2.1) if H’ resolves G, then (G, mH’) = »G (all m). 


(For m = 1, this is the definition.) For if mg # 0, choose h in H’ so that 
(mg)h # 0; then g(mh) # 0. 

Let G and Z be groups. A Z-character of G is a homomorphism of G into 
(part of) Z.2 Let Chz(G) be the group of Z-characters of G. Given g in G 
and @¢ in Chz(G), set g-¢ = $(g); then these groups form a pair. Clearly any 
group resolves any of its character groups. The converse is false. 

G is infinitely divisible if mG = G for each m. 


3. Examples. Let J) be the group of integers, and J, = Jy) — ulo the group 
of integers mod uw. Using ordinary multiplication into Jy,,), it is seen that 
I, resolves I, if and only if uw is a multiple of v. J») m-resolves no group G unless 
m = 0orlormG =G. The general question of when J, m-resolves J, is rather 
complicated. However, I, completely resolves I, if «4 > 0 and ordinary multi- 
plication mod yu is used. This is a consequence of Theorem 1 below.’ If Ri 
is the group of the real (or the rational) numbers mod 1, then J) and R, com- 
pletely resolve each other, as is easily seen; this also follows from Theorem 1. 

R, is infinitely divisible; no group with a finite number of generators is. 

THEoreM 1. A discrete or compact (or locally bicompact') group completely 
resolves and is completely resolved by its R\-character group. 


? The term annihilator has been used, but this word seems unnecessarily barbarous. 

3 If H’ resolves G, and gi * gz, then (gi: — g2)h ¥ 0 for some h in H’, and gih # goh. 
Thus distinct elements of G may be shown to be distinct by multiplying by elements of H’. 

‘Thus, by multiplying by elements of H’, we can tell whether a given element of G is 
divisible by m or not. Any H’ 1-resolves G. 

5 It is often important to introduce a topology into groups and character groups; we 
shall not do this here. 

6 Direct proof: Note that 

ml, = all numbers in /, = 0 mod (m, x), 


ml, = all numbers in J, = 0 mod u/(m, xu). 


Suppose ab = 0 mod uz for all bin,,J,. Taking b = u/(m, u) shows that a = 0 mod (m, x), 


and a isin ml, . 
7See L. Pontrjagin, Annals of Math., vol. 35 (1934), pp. 361-388; also van Kampen, 


Annals of Math., vol. 36 (1935), pp. 448-463. 








ON MATRICES OF INTEGERS AND COMBINATORIAL TOPOLOGY 37 


That G and H = Chg,(G) resolve each other is shown by Pontrjagin, loc. cit., 
Theorem 5 and Remark 4. Now set H’ = mH. As H resolves G, (G, mH) = 
m@, by (2.1). By Pontrjagin, Theorem 4, (H, (G, mH)) = mH; hence (H, ».@) = 
mH, and G m-resolves H. As m is arbitrary, G completely resolves H. Sim- 
ilarly H completely resolves G. 

Note that a group may resolve and be resolved by its character group, and 
not resolve it completely. For an example, takeG = H = Z=I1,. 


4. Extensions of homomorphisms. Suppose part of a group is mapped into 
another group. Can the map be extended through the first group so that it is 
a homomorphism? We shall give two answers to the question. 

TueoreM 2.° Let A be a free group with a finite number of generators, let A’ 
be a subset of A, and let f map the elements of A’ into the group G. The definition 
of f may be extended over A so that it is a homomorphism if and only if (a) holds: 

(a) For any elements a, , --- , a, of A’ and integers m = 0, a, +--+, a, 


(4.1) > aa; in mA implies }> a:f(a;) in mG. 


Suppose that A and H, and G and H, form group pairs, and let H resolve G 
completely. If the following condition holds, the extension of f over A is possible: 
(b) For any elements a, , --- , a, of A’, integers a, , --- , a, , and element h of H, 


(4.2) (S aah = 0 implies (SX a:f(ai))h = 0. 
The necessity of (a) is clear: 


Dd aif(a) = f(X aia;) = f(ma) = mf(a). 


We turn to the sufficiency. Let A* be the subgroup of A generated by the 
elements of A’. For each a = >> aja; in A*, set 


fla) = Do af(ai). 


To prove that f is uniquely determined in A*, suppose 
be aja; = p» Biai, bi (8; — aa; = 0. 


8 The theorem will be strengthened in Appendix II. It is easily seen that the conditions 
may be weakened as follows: (a) and (b) may be replaced by the corresponding pairs of 
hypotheses 

(a:) Sa,a; = 0 implies Sa,f(a;) = 0, 

(az) a in mA implies f(a) in mG; 

(bi) Sa;a; = 0 implies (Za;f(ai))h = 0 (all A), 

(be) ah = O (any fixed h, a in A’) implies f(a)h = 0. 
Further, the values m = 0, 1 may be omitted in (a). We note from the proof that (b) 
implies (a). 

The theorems in Alexandroff-Hopf, Topologie, I, pp. 591-3, are consequences of the 
strengthened theorem (using footnote 6). Our proof has naturally much in common with 


theirs. 


| 
| 











38 HASSLER WHITNEY 


(We may carry out both sums over the same terms.) Suppose (a) holds. Then 
as 0 is in OA, 


Dd (6: — adf(a) = ¥ Bif(a) — ¥ aif(ai) 


is in OB, hence it is 0, and the last two terms are equal. Now suppose (b) holds. 
Then for any A in H, 


[>> (8: — aah = 0; 
hence 


[>> Bif(ai) — > aif(a;)|h = Q, 


and as H resolves G, the term in brackets equals 0. 

Obviously f is a homomorphism in A*. Moreover, as each linear combina- 
tion of elements of A* is a linear combination of elements of A’, (a) or (b) is 
easily seen to hold in A* if it holds in A’. 

Let by, --- , be, Desa, --- , b, form a base for A, and let m,, --- , m, be posi- 
tive integers such that a, = mb,, --- , a, = m,b, form a base for A*.’ Suppose 
(a) holds. As a; is in m;A, f(a;) is in m,G, and we may choose g; so that mg; = 
f(ai) @ = 1,---,r). If (b) holds, ak = b(mh) = O for all h in ,.,H; hence 
f(ayjh = 0 for all h in ,,,H, and as H resolves G completely, f(a;) is in mG. 
Again g; exists. Set 


f(b) = gi © = 1,---,r), and f(b) =0@=r+1,---,s). 


The resulting homomorphism of A into G obviously agrees with the one already 
found in A*. 

Remarks. If A’ is a subgroup of A with division (i.e., ma in A’, m # 0, 
implies a in A’), and f is a homomorphism of A’ into G, we may always extend 
fover A. For we may choose a base b, , --- , 6, , bu: , --- , b, for A such that 
b,, --- , b, form a base for A’, and set f(b;) = 0(¢ > 7). Further, any homo- 
morphism of any subgroup A’ of any group A into G may be extended over A, 
if G is infinitely divisible; see Pontrjagin, loc. cit., proof of Lemma 1. 


II. Linear equations with integer coefficients 


5. Cycles, etc., of a matrix. Let » be a matrix of integers: 


1 

m Nn 
(5.1) n wm ll ee eeeren | ‘ 

mi -** Mn| 
Given a group X, let X° = X + --- + X, a direct sum with s terms. The 
elements of X* may be written (z;, --- , 2.) or (2', --- , 2°). We define the 
boundary and coboundary (with respect to 7) of elements of X" and X” by 
(5.2) A(z’, ---, 2") = (> az’, ---, > af 2’) in X’, 
(5.3) (a1, --- 2p) = (D aiz;,---, >, 2,) in X*. 


® See Alexandroff-Hopf, loc. cit., p. 568, no. 24. 


ON MATRICES OF INTEGERS AND COMBINATORIAL TOPOLOGY 39 


Any element (5.2) we call an X-boundary; these form a group B*(n). Define 
similarly X-coboundaries and Bx(n). 

Any element of X"[X”] whose boundary [coboundary] vanishes we shall call 
an X-cycle [X-cocycle]. We form with these the groups C*(n) and Cx(n). 

Given a group pair G, H, we form a group pair G’, H* by 


(5.4) (g', --- ,9)-Gh, >> A) = y g'h;. 
TueoreM 3. If H resolves G, or G resolves H, then 
(5.5) C%(n) = (G", Bu(n)), or Ca(n) = (H’, B°(n)). 


We shall prove the first; the other is obtained by transposing 7. For any 
G-cycle (g' , --- ,g") and H-coboundary 6(hy, --+ , hp), 


(5.6) (g', --+,9")-6(h1,-++ yh») = Dog’ Do nih; = Gg", ---,g")-(ha, +++, hyp) = 0. 
i i 


Now suppose (g', --- , g") satisfies this relation for all (i, --- , Ay). Taking 
all the h, = 0 but the j-th gives (}> nig')h = 0 (all h); as H resolves G, oni g' = 0 
(all j), and (g', --- ,g") is a G-cyele. 


6. Linear equations in integers. We shall now prove 
Tuerorem 4.” Let » and G be given; if we use (8), assume H resolves G com- 
pletely. (g', ---,g”) is a G-boundary, i.e., the equations 


(6.1) Lag =9' ({=1,---,p) 
i= 


are solvable for g', --- , 9", if and only if one of the following is true: 
(a) For any integers m 20,01, +--+ ,@, : 


P : Pp ; 
(6.2) > ain; = 0 mod m (all j) implies }> aig’ is in mG; 
=1 o=E 


° ll e 
in other words, every Iy-cocycle mod m™ (or, every Im-cocycle) is orthogonal mod m 


to (g', aa , 9’). 
(8) For any (hi, --- , hp») in H’, 


Pp . P . 
(6.3) > nj hi = 0 (all j) implies > g' hi = 0; 
i=1 i=1 


in other words, every H-cocycle is orthogonal to (g' , --- , 9’). 

Example. Let G = I,, ua prime; then » may be considered as a matrix of 
integers mod uw. Using footnote 7, both (@) and (8) (with H = Z = IJ,) reduce 
to testing (a) for the single integer m = yu. 


1” A proof of the first half, (a), of this theorem, using a standard theorem on linear 
equations (see Veblen, Analysis Situs, Second edition, Appendix 2) has been furnished to 
me by H. T. Engstrom. 

1 yx is a cocycle mod m if its coboundary is divisible by m; it is orthogonal mod m to y 
if zy is divisible by m. We might state the condition as follows: (g', --- ,g™) is in the 
“complete nullifier’’ of Cy,(7). 











40 HASSLER WHITNEY 


The necessity of either condition is trivial; we shall prove the sufficiency. 


Set A = I} = all (a, --- ,a@,) (a’s integers). Let A’ be the rows 7’, --- , 7” 
of », and set 
(6.4) f(r’) = f(m, +--+, m= 9 (i = 1,---, p). 


Suppose f has been extended over A so that it is a homomorphism of A into G. 


Then set 


(6.5) g =f(1,0,---), gg =f(0,1,---), 

Then 
n n 1 j n : 
LiF = DSO, --- 51, +++ 0) = fini, +--+ me) =o". 
2” a 


Thus we need merely show that (a) or (b) of Theorem 2 is satisfied. 
Suppose first (a) holds, and }> ain’ is in mA. Then > ain} = 0 mod m (all j), 
hence >> aig’ = > aif(n') is in mG, and (a) holds. 
Suppose next (8) holds. Let A and H form a group pair by setting (a, --- , 
a,)Jh = (ah, --- ,a,h). Suppose 
(S ain')h = > ai(nih, ---, 9h) = 0. 
Then 
LD ainjh = Do ajhi = 0 (all j), 
where h; = ajh. Hence, by (8), 
(X aif(n'))h = Df hi = DV g'hi = 0, 


and (b) holds. This completes the proof. 
Remark. (8) of this theorem and a similar theorem with 7 transposed say 


that 
(6.6) B“(n) = (G’,Cu(n)) if H resolves G completely, 
(H",C°(n)) if G resolves H completely. 


(6.7) Bu(n) 


III. Applications to topology 


7. Cycles, boundaries, cocycles, and coboundaries. Let K be a (finite) com- 


plex, with cells ¢; (¢ = 1, --- , a’) of dimension r, and incidence matrices "@ = 
|'a}|. The boundary and coboundary of o; are 
- r r—l e r+lat r+l1 
(7.1) a0; = >> dia}, boi = Dao. 
i 7 


In terms of the coefficients of chains, boundaries and coboundaries are given by 
~ @ « . r r+1 
(5.2) and (5.3) with »; replaced by ‘d; and ""’d;. 


Ore Ben RTE 





ATED Rtn eee we. 


ON MATRICES OF INTEGERS AND COMBINATORIAL TOPOLOGY 41 


An r-X-chain is a linear form > x;0; (x; in X); we may consider it as an ele- 
sat 
ment of X*. Define the groups of 


r-X-cycles: ‘C* = C*('8), 
r-X-boundaries: "BY = B*(""a), 
r-X-cocycles: "Cx = Cx("9), 
r-X-coboundaries: "Bx = Bx(‘8). 


Then the r-X-homology groups and r-X-cohomology groups are the difference 
groups 


(7.2) "H* = . a as 'B*, "Hx = "Cy = "Bx e 

If G and H form a group pair, then r-G-chains and r-H-chains may be multi- 
plied, using (5.4). By (5.6), 
(7.3) dA-B = A-6B 


for any (r + 1)-chain A and r-chain B. 
Equations (5.5), (6.6) and (6.7) give 


(7.4) 'C? = (G™, Bu) if H resolves G, 
(7.5) "Cu = (H™, 'B’) if G resolves H, 
(7.6) "BY = (G”, Cz) if H resolves G completely, 
(7.7) "Bs = (H“,C°) _ if G resolves H completely. 


Expressed in words, we have (using (7.3)) 

TueoreM 5. With suitable G and H, an r-chain (with coefficients in G or H) 
is an r-G-cycle, or an r-G-boundary, or an r-H-cocycle, or an r-H-coboundary, if 
and only if it is orthogonal to every r-H-coboundary, or r-H-cocycle, or r-G-boundary, 
or r-G-cycle. 

TueorEeM 6. An r-G-chain is a cycle, or a cocycle, if and only if it is orthogonal 
to every Ip-coboundary, or Ip-boundary. It is a boundary, or a coboundary, if 
and only if it is orthogonal mod m to every I,-cocycle, or I,-cycle, for all m (#1). 

The proof of the first half is like that of Theorem 3; the second half follows 
from Theorem 4, (a). 


8. Homology and cohomology groups. We shall find a case in which the 
homology groups are determined by the cohomology groups or vice versa. 

TueoreM 7. For any G and H, the groups "H°, "Hx form a pair, if G and H 
form a pair, as follows. Given elements & of "H° and ¢ of "Hu , choose a cycle C of & 
and a cocycle D of ¢, and set & = CD. 

Theorem 5 shows that the definition depends on £ and ¢ alone. 

TueoreM 8. Let Z be infinitely divisible. If H = Chz(G) and G resolves H 
completely, or if G = Chz(H) and H resolves G completely, then 


(8.1) "He = Chz('H"), or "H’ = Chz('Ha). 








42 HASSLER WHITNEY 


We shall prove the first equation; the proof of the second is the same. Each 
element ¢ of "Hw determines a Z-character ¢; of "H® by the last theorem. Sup- 
pose ¢ # ¢’. Take cocycles D in ¢ and D’ in ¢’; then D’ — D is not a cobound- 
ary. By (7.7) there is a cycle C such that C(D’ — D) # 0,CD # CD’. Let 
£ be the homology class of C; then & # &’, and ¢; # ¢;. Therefore "Hz 
determines in a (1-1) way a subgroup of the Z-characters of "H®. 

To show that this is the whole group, take any Z-character ¢ of "H®. For 
any cycle C with homology class £, set ¥(C) = $(£); this is a Z-character of 'C°. 
By the remark following Theorem 2, we may extend it to a Z-character y of 
G*’, the group of all r-G-chains. Set 


¥i(g) = ¥(0, --- 9g, --- , 0) (g in the i-th place). 
As H = Ch2(G), there is an h; such that gh; = ¥i(g) (allg). Set 
D = (hh, --+ , har) = Dy hioi; 
then for any r-G-chain C = (g', --- , gq”), 
CD = Vagh = Vvig') = VG", ---,9") = WC). 


As ¥ maps all boundaries into 0, D is a cocycle, by (7.5); let ¢ be its homology 
class. Then for any & in "H“, & = (€). This completes the proof. 

Coro.tiary. "H,, is isomorphic to the direct sum of the reduced r-th homology 
group and the (r — 1)-th torsion group, all with integer coefficients. 

This follows from the last theorem, Theorem 1, and Alexandroff-Hopf, To- 
pologie I, p. 234, equation (17’). A direct proof is not hard to give, using the 
ordinary elementary divisor theory. 

Remarks. The theorem does not hold for arbitrary Z. For, consider the 
projective plane, using G = R,, Z = I, ; then H = 0, and hence "Hz = 0. But 
*H° = torsion group of dimension 1 = J, (see Alexandroff-Hopf, loc. cit., p. 234), 
and hence Chz((H°) = CH1,I2 = Is. 

Given H, set G = Chg,(H); then H = Chg,(@), and G completely resolves H, 
by Theorem 1. Hence "Hy = Che,("H“). As "H° is a topological invariant, 
"Hz is a topological invariant of K. 


APPENDIX I 


On closed complexes and pseudomanifolds 


For the general theory, see Alexandroff-Hopf, Topologie I, Ch. VII, $1. We 
refer to this work as AH. K will always denote a homogeneous” n-complex, 
and o;, 02, ---, its n-cells. The chain o; means the n-chain }> 6;;0; with co- 
efficient 1 for ¢; and 0 for all other ¢;. Write C ~ Dif C is cohomologous to D. 


22 That is, each k-cell (k < n) is on some n-cell. 








ee 











ON MATRICES OF INTEGERS AND COMBINATORIAL TOPOLOGY 43 


9. Closed complexes. We say K is closed if no o; is —0in K. That this 
definition agrees with the ordinary one is shown by Theorem 9 below. 

LemMMA. a; is not — 0 if and only if it is contained in some cycle C mod m 
for some m # 1." 

By Theorem 6, o; is not — 0 if and only if for some m # 1 there is a cycle C 
mod m such that C-¢; # 0 mod m. Write C = ao; + --- ; then C-o; = a, 
and the above holds if and only if a # 0 (mod m), i.e., if and only if C con- 
tains o;. 

TuHeorEM 9. K is closed if and only if each o; is contained in a cycle mod m 
foranm # 1. 

This is a consequence of the lemma. Alexandroff-Hopf, p. 275, Satz Ia, shows 
that our definition is equivalent to the ordinary one. 


10. Irreducibly closed complexes. We say K is irreducibly closed if it is 
closed but no proper subcomplex is. 

Tueorem 10." Jf K is irreducibly closed, then 

(a) the cohomology group "Hz, is cyclic (of order # 1), and 

(b) no a; is — 0, but each o; is — some multiple of each a; . 

To prove (b) we shall show first that o, — 0in K — o;. If not, then by the 
lemma o; is contained in a cycle C mod m, m # 1, C in K — a;. Let K’ be 
the complex containing the cells of C; then each of its cells is contained in C, 
and hence K’ is closed, by Theorem 9; but K’ is in K — o;, a contradiction. 
Say o; = 6A in K — o; ; then 


6A = oj — pijo;in K (some pi;), 


and o; — pijo; , as required. 

To prove (a), take a o; , and let m be the smallest positive integer such that 
mo; — 0; if there is none, set m = 0. Each yo; is a cocycle (as dim(K) = n), 
and determines an element of "H;,. Further, given any element & of "Hy, , 
determined by the cocycle C, 


C= ps apap } a(pjoi:) = wo, 
2 7 


and £ is determined by uo;. Hence "Hr, is cyclic, of order m. 

Remark. There exist complexes in which some p;; is # +1; see AH, p. 280, 
Bemerkung. 

TuroreM 11.” For the homogeneous n-complex K to be irreducibly closed, 
either of the two following conditions is (necessary and) sufficient. 

(a) This is (b) of Theorem 10. 

(8) "H,,(K) # 0, but "H;,(K’) = 0 for any proper subcomplex K’ of K. 


13 The condition m #~ 1 could be omitted; for a cycle mod 1 contains no cells, i.e., is 
= 0 mod 1. 

144 Compare AH, p. 277, Satz IV and Satz V. 

% See AH, Theorems on p. 280. 








44 HASSLER WHITNEY 


Suppose (a) holds; then K is closed. If K is not irreducibly closed, then 
there is a proper (homogeneous) subcomplex K’ which is closed. Take o; in 
K’, 0; in K — K’, and say 

o; — ao; = 6A in K. 
Write 
A=A,+Az2, A; in K’, A,in K — K’; 


we shall show that ¢; = 6A, in K’. As no of! of Ae is on a cell of K’, Ae has 
no cells in K’; hence the part of 6A in K’ is the part of 6A; in K’, and this is o;. 
But then ¢; — 0 in K’, contradicting the assumption that K’ is closed. 

Suppose (8) holds. Let D be a cocycle not — 0. By Theorem 6, there is a 
cycle C mod m for some m # 1 with C-D # 0; hence C # 0. The cells con- 
tained in C form a closed complex K’, by Theorem 9. Now let A* be any 
proper subcomplex of K. Then “H,,(K*) = 0; consequently each o; of K* is 
— 0 in K*, and K* is not closed. Therefore K’ = K, and K is irreducibly 


closed. 


11. Pseudomanifolds. A pseudomanifold is a strongly connected” homo- 
geneous complex in which each (n — 1)-cell is on either one or two n-cells; it is 
closed if each (n — 1)-cell is on two n-cells. 

Tueorem 12.” If the pseudomanifold K is not closed, then "Hz, vanishes; if it 
is closed, then it is an irreducibly closed complex, and "H,, is cyclic of order 0 or 2 
according as K is orientable or not. 

Take any succession of distinct n-cells of, of, --- , 2, 0; and o7+, having the 
common (n — 1)-face o~'. We may orient the cells so that ¢?~ is on o? nega- 
tively and on fs, positively. Let A be the sum of these o?~'; then 


6A = 6, — ao. 


It follows that any two n-cells of K, with the proper orientations, are —. 

If K is not closed (as a pseudomanifold), there is an (n — 1)-cell ¢”* on just 
one n-cell o”; then 60" = +o”, and o” — 0; hence each of — 0, and "Hy, 
vanishes. Suppose K is closed. Then each de" and hence each 6A™™ is a 
chain the sum of whose coefficients is even, and hence cannot be any a; ; as each 
o; — + each o;, K is an irreducibly closed complex, by Theorem 11, (a). 

If K is not orientable, a cell of may be joined to itself by a succession of n- 
and (n — 1)-cells so as to reverse the orientation. Let A be the sum of the 
properly oriented (n — 1)-cells of the chain; then 6A = 209, so that 209 and 
hence any 2¢? is — 0; hence "Hy, is of order 2. Otherwise, we may orient the 
cells of K concurrently; then each do? ' and hence each 6A” is a chain, the 
sum of whose coefficients vanishes, and no mo; is — 0 for m ¥ 0. It follows 
that “Hy, is the infinite cyclic group. 


‘6 That is, each two n-cells are joined by a succession of n- and (n — 1)-cells. 
17 Compare AH, p. 281, Satz VIII. 








ON MATRICES OF INTEGERS AND COMBINATORIAL TOPOLOGY 45 


AppENDIx II 


We shall show that Theorem 2 holds if A is any group with a finite number of 


generators. Extend f over A* as before. Let b’, --- ,b" form a base for A, 
and a’, .-- , a’, a base for A*. Say 
a’ = > nj’ (i = 1,---,8). 
js 
Let yu; be the order of b' (u; = 0 if b' is of infinite order). Set 
no) ' =u ifj =i, =O0if j ¥i (i,j =1,---,r); 
then 
21) 'b' = pb' = 0 (i= 1,--- ,r). 
Set 


g' = f(a’) Gi =1,--+, 8), g*=0 (i =1---,7). 


Suppose }> ain} = mk; (all j). Then 
| 


s s r str r : r 
p aa’ = > an; b? = : 4 p ain; b’ = m > k;b’ 
i=l ou Fs c“a gs oe 


is in mA, and hence, using (a) or (b) (see footnote 8), 


(¥ wc’) - ‘ a; f(a’) = »» ag’ 


i=1 i=1 
is in mG. Therefore, by Theorem 4, there are elements g', --- , 9° of G@ such 
that 
Salat! east mish 
, 


Setting f’(>> a:b’) = >> aig’ defines uniquely a homomorphism of A into G; for 
f'(uid’) = ¥ nitty’ = g'** =0 (j= 1,---,7r). 
7=1 
Further, 
f@) =f(L jb’) = Va’ =9 =f’) (=1,---,8), 
so that f’ is an extension of f. 


HarvARD UNIVERSITY. 








ON THE MAPS OF AN n-SPHERE INTO ANOTHER n-SPHERE 
By Hasster WHITNEY 


1. Introduction. It is well known that to each map' f of an n-sphere S" 
into another one Sj (n = 1 always) there corresponds a number d; , the degree of 
f, and d,; = d, if f and g are homotopic (see §2). H. Hopf* has proved the con- 
verse theorem, that if d; = d, , then f and g are homotopic. The object of this 
note is to give an elementary proof of the latter theorem. The methods will be 
used and extended in later papers. 

In an appendix we give somewhat briefly a proof of the theorem for the case 
that d; = 0. This is the only case needed in the following paper; the general 
theorem then follows from that paper. The second proof is more intuitive 
geometrically than the first, but complete details would make it perhaps more 


lengthy. 


2. On deformations. A deformation of one space S in another Sp is a family 
o(p) (0 < t S 1, pin S) of maps of S into So, continuous in both variables 
together. Given maps f and g of S into S), if there exists a deformation ¢& 
such that d = f and ¢ = g, we say f and g are homotopic. If f is homotopic to 
g, where g(p) = P» (all pin S), we say f is homotopic to zero, and f may be shrunk 
to the point Po. 

Suppose S and Sp are complexes, Ko is a simplicial subdivision of So, and 
f maps S into Sy. Then, for a sufficiently fine simplicial subdivision K of S, 
the following is true. To each vertex V of K we may choose a vertex g(V) of a 
cell of Ko which contains f(V), so that the vertices of any cell of K go into the 
vertices of a cell of Ky. This determines uniquely a “simplicial map” g of K 
into Ko , affine in each cell (see §5); moreover, f is homotopic to g. 


3. The degree of a map. Let Sj be the unit n-sphere in (n + 1)-space, let 
K? be a simplicial triangulation of Sj, and let o> be an n-cell of Ko. We choose 
K¢ so that if P; is a point of of and Py is the antipodal point of So, each great 
semicircle from P; to P» intersects the boundary de¢ of o¢ in exactly one point. 
By pushing along these semicircles, we define a deformation Q, of the identity 
Q(p) = pinto a map Q , where Q;(p) = Po for pin Sp — ao. 

Let o* be a k-cell (k < n), in fixed correspondence with a k-simplex, and let 


Received December 1, 1936; presented to the American Mathematical Society, October, 
1936. 
1 All maps will be assumed continuous. 
2 See Alexandroff-Hopf, Topologie, I, Berlin, 1935, pp. 501-505. See also the reference 
to Lefschetz in the following paper. 
46 


a aes 











ON MAPS OF AN ”-SPHERE INTO ANOTHER ?-SPHERE 47 


f map o* into St. We say f is standard if f(p) = Po, or, k = n and for some 
affine map ¢ of o* = o" into «3, f(p) = %(¢(p)). In any case, f(p) = Po in do** 
The map f of an n-complex K" into S¢ is standard if it is standard over each 
k-cell (k S n). 

We may orient So by orienting og. Let K” bea simplicial triangulation of 
the oriented n-sphere S", and let f be a standard map of K” into Sj. Let o" 
be an (oriented) n-cell of K". If f(p) = Po in o", we set d;(o") = 0. Other- 
wise, there is a simplicial map ¢ of o” into (the whole of) o¢ such that f(p) = 
2:(¢(p)) in o”; we set d;(o") = 1 or —1 according as ¢ is positive or negative. 
We define the degree of f by 


(3.1) dy = dX 4,(0"). 


4. The theorem. In homology theory it is shown how to attach to each map 
f of S" into S¢ (both spheres oriented) an integer d; , the degree of the map. 
Moreover, if f is homotopic to g, then d; = d, , and if S" and So are triangulated 
and f is standard, then d; is given by (3.1). 

Suppose f and g map S” into Sj, and d; = d,. Then for a sufficiently fine 
subdivision K” of S”", both f and g can be deformed into simplicial maps and 
hence into standard maps ¢@ and y. As f and ¢, also g and y, are homotopic, 
d, = d,. By Theorem 1 below, ¢ is homotopic to ¥; hence f is homotopic to g. 
Therefore this theorem furnishes the converse of the statements above. 

TuHeoreM 1. If ¢ and y are standard maps of S" into Sj, using the same sub- 
division K" of S", and dy = dy , then $ is homotopic to yp. 

From the proof below, the following corollary is apparent. 

Corotiary. If o(V) = ¥(V) = Po for a fixed vertex V of K", we can make V 
remain at P, throughout the deformation. 

In fact, if nm = 2, all vertices of K" remain at Py. If n = 1, we may choose 
the chains of cells in §8 so that in no chain do we pass over V; then V is never 
moved. 

THEOREM 2. For any integer y there is a map f of S" into Sj with d; = y. 

To prove this, subdivide S" into a = |! n-cells. Let @ map | y| of these 
cells simplicially into o¢, positively or negatively according as y > 0 ory < 0 
(if y * 0), and set f(p) = 2,(¢(p)) in these cells and f(p) = Po elsewhere. 
Clearly d; = y. Note that the degree of the identity map of So into itself is 1. 

The remainder of the paper is devoted to the proof of Theorem 1. 


5. Codrdinates p, in a cell. Any simplicial complex K” is homeomorphic 
to a complex K" in euclidean space whose cells are straight. Using K", we de- 
fine straightness in K", the center of a cell (i.e., center of mass of its vertices), 
ete. Hence an “affine map” of one cell into another has meaning. Let o be 
a cell of K", and a, the center of ¢. For each point p of the boundary ae of « 
let p, be the point of the segment ap such that ap,/ap = t. 


’ There-are (n + 1)! standard maps ¢ of o” into S} with o(p) # Po. 








48 HASSLER WHITNEY 


6. Certain deformations of simplexes. We prove first a combinatorial lemma, 
needed in Lemma 2. 

Lemma 1. Any even permutation of the letters aga, --- a, (n 2 2) may be made 
by means of a succession of cyclic permutations, each on three of the letters. 

This is clear if n = 2; then any even permutation is cyclic. Suppose n > 2, 
and let B = ag, --+ Ga, be any even permutation. If a, # n, bring aq, to the 
right end by a cyclic permutation; bring a,.,_, next to a,,. Suppose ay + 0. 
We then perform the two cyclic permutations 


Gq ++ * Gay *** Ban; Fan —* Gag *** Bag-; *** Wa, —> Gag -** +++ Aa, Ga,-; - 


If n = 4 and ag, is not now in the second place, we perform two cyclic permuta- 
tions to bring it there, again interchanging a,,_, and a,,, ete. When a,,,---, 
@q,, are in their correct places, a,,_, is also; as B is even and the above permuta- 
tions are even, d,,_, and a,, are also in their correct positions. 

Lemma 2. Let co” = a--- a, be a simplex, and let aa, --- Ga, be an even 
permutation of its vertices. Then there is a deformation ¢: of o” in itself, such that 
oo(p) = p, or(@,) = Ga, , o: 18 affine, and ¢: for each t is a homeomorphism both in 
o” and in its boundary. 

If n = 0 or 1, the lemma is trivial. Suppose that n = 2; say dap@a,da, = 
ayaxda,. Let ¢(a;) be the point p of aaj, (setting 2 + 1 = 0) for which 
a;p/aai,, = t. Let @& map the segment a;a;,, into the broken line ¢;(a,)ai1¢: 
(a;,,) so that, if the line were straightened, the map would be linear. For any 
point p, (see $5) interior to o°, set ¢(p.) = (@(p)).. As ¢(a) = a = center 
of mass of a”, ¢) is easily seen to be affine. 

Now suppose n > 2; consider first a cyclic permutation, changing say apa;d2 
intO a;d2d). Set ¢ = apasa2, o’ = az --- a,, and let [p, q, u] for p ino, gino’, 
0 < u S 1, be the point r of the segment pq for which pr/pq = u. Define ¢ ine 
as above. For any point [p, q, u] not in a, set [p, q, u] = [¢:(p), gq, ul. We 
show that ¢@ is a homeomorphism. Suppose ¢:[p, g, u] = ¢&[p’, q’, u’]; then 
[d:(p), g, u) = [d(p’), 9’, u’], which implies ¢,(p) = ¢(p’), q = q’, u = w; as 
@ is a homeomorphism in ¢, p = p’ also. Further, given [p, q, u] and t, we may 
find a p* for which ¢(p*) = p; then ¢&[p*, q, u] = [p, q, u]. The other proper- 
ties of ¢ are clear, and the lemma for this case is proved. Now take any permu- 
tation. We may obtain it by cyclic permutations as in Lemma 2; the cor- 
responding deformations together give the required deformation. 


7. Two types of deformations of S" in Sj. Let ¢’ be a standard map of S" 
into Sj, and let o and o’ be oriented n-cells of K" with the common (n — 1)- 
face r: 


o = Api --- an, o’ 


, 
—QpQ, +--+ An, T= A+++ An. 


(a) Suppose dy-(¢) = 1, dy-(o’) = 0; we shall deform ¢’ into ¢” so that d,--(¢) 
= 0, dy(o’) = 1, leaving K" — (¢ + a’) fixed. 
(b) Suppose dy:(¢) = 1, ds-(o’) = —1; we shall obtain d,--(¢) = dg(o’) = 0. 


‘ This is so if 0 < u < 1, as we may assume. 








al 


| 


1- 





ON MAPS OF AN -SPHERE INTO ANOTHER ”-SPHERE 49 


In each case ¢” will be a standard map. 

(a) Set o, = aod2 +++ Gn, 0, = Qod2 «++ a,, or if n = 1, then a, = a, o; = a. 
Let @, and 62 be the affine maps of o; into r and o; determined by sending ao 
into a; and ay respectively. For each p in o;, let a(p, wv) run linearly along the 
segments p@;(p) and 6;(p)62(p) as u runs from 0 to 1 and from 1 to2. Set 


” , _ ¢'[a(p, dies t)] (t Ss u), 
(7.1) dila(p, u)] _ ae 0)] (t > u), 
and ¢;(p) = ¢’(p) in K" — (0 + 0’). As¢'(p) = Poin do; + doz, this is clearly 
a deformation of ¢’ = ¢ into a map ¢” = ¢,. The map ¢” in o’ is obtained 
from the map ¢’ in o by replacing ao, a, --- , a, (which form +c) by a, ao, 

- , a, (which form +o’); hence dy--(o’) = dg(c). Also dy (¢) = Oas ¢”(p) = 
P» in o, and (a) is proved. 

(b) Let \ and »’ be the affine maps of o and o’ into of such that ¢’(p) = 
0,(A(p)) in ¢ and = Q,(\’(p)) ino’. Say op = bo --- bn, 


A(ai) = bk,, and A’(ao) = b,, Na) = bi; (i > 0). 

As dy (o’) = —d,s(—o’), and hence 
dg(o) = dy(aoa, +--+ dn) = —dy(o") = dy(aga, «++ Gn), 

bi, --+ by, is an even permutation of bj, --- bk, . Applying Lemma 2, we find 
a deformation Xj of o’ in «} such that \b = X’, Aj is affine, and 
(7.2) Xi(ao) = A(ao), Ai(a:) = A(ai) (i > 0). 
Set 
(7.3) ¢i(p) = oo p in 0 ’ 


Then as 2,(A;(p)) = Po in do’, ¢: is a deformation of ¢’ into a map ¢* = ¢}. 

For each p in 7, let 8(p, u) be the point q of the segment aop of o such that 
anq/aop = u, and let B’(p, u) be the corresponding point of the segment agp in o’. 
As \ and }; are affine, (7.2) and (7.3) give 


(7.4) o*[B(p, u)] = o*[8'(p, u)] (pin 7,0 Su S 1). 
We deform ¢* into ¢” by setting 
(7.5) $:[B(p, u)] = o:[8'(p, u)] = o*[8(p, (1 — t)u)), 


and ¢;(p) = ¢*(p) in K" — (¢ + o’). This is clearly a deformation; (7.4) 
shows that ¢¢ = ¢*. As¢’(p) = ¢:(p) = Poine + 0’, dg(e) = dyr(o’) = 0. 


8. Proof of Theorem 1. Suppose there are cells of K” mapped positively 
over So by ¢, and also cells mapped negatively. Then we can find a chain 
oo, ---,¢, Of adjacent n-cells of K” such that 


dg(oo) = 1, doi) = 0, +++, = dg(onn) = 0, = (o,) = —1. 











50 HASSLER WHITNEY 


Using (a), §7, we deform ¢ in op + 0, then in o + o2, ete.; then, using (b), §7, 
we deform the map in o,., + o,. The new map @¢’ has d,-(e;) = 0 (@ = 0, 

, v). Continue in this manner till no cells are mapped positively or none 
are mapped negatively over So; for definiteness, say the latter holds. Do the 
same for y. The new maps ¢* and y* each have exactly d, = d, cells mapped 
positively over S;. 


Suppose d,.(¢) # dy-(c) for some ¢. Then let oo, 01, --- ,o, be a chain of 
adjacent n-cells such that 
dg+(o0) = dy-(¢,) = 1, dss(o,) = dy+(ao) = 0, 
dss(o;) = dy+(o;) (0 < i <q v). 
Let oo , ox, , «++ , ox, be the cells of the chain for which dy. = 1. Using (a), §7, 
we deform ¢* over ox, + ox,41 ete. until we have dst(ox.,) = 0, dsi(e,) = 1; 
another succession of deformations makes d53(ox,_,) = 0, dsi(ox,) = 1, ete. 


Finally dgs(a9) = 0, dg+e(ox,) = 1 (all 2), and dy-(¢,) = 1. dy-e(o) differs from 
d,-(¢) over fewer cells than d,-(¢). Continuing in this manner, we deform ¢* 
into a map ¢’ with ds-(¢) = dy-(c), all ¢ ’ and y* are standard. Applying 
Lemma 2, we deform ¢’ over each n-cell where necessary, to obtain y*. (Com- 
pare the first half of the proof of (b), §7.) This completes the proof. 


Appendix’ 


Let f be a map of S" into Sj with the degree 0. We first deform it into a 
simplicial map and then into a standard map ¢ (see §§ 2, 3). To shrink ¢ to 
a point is equivalent to extending ¢ through the interior R of S” (see the fol- 
lowing paper, § 4). Let o,---,o¢, and o;,-::,¢. be the simplexes of S" 
mapped positively and negatively over Sj respectively. Let 7; be a tube join- 
ing ; to «; inside R. We may choose these so no two intersect, and also (to 
prove the corollary) so no one cuts the radius of R to the vertex V. Let ao --- a, 
and ay --+ a, be positive and negative orientations of o; and o: respectively, 
such that A(a;) = b; and X/(a;) = b; determine simplicial maps of o; and o; into 
o} , which in turn determine ¢ in o; and ¢;. Now carry o; through T; to o., 
turning it so that a; goes into a; ; let g:(o;) be the position of o; after the time ¢. 
We do this so that g:(c;) does not intersect g:(o;) if t # t’. (We are using a 
deformation theorem on simplexes in euclidean space, similar to but simpler than 
Lemma 2.) The definition of ¢ in Ris as follows. For p not in any g;(o;), set 
¢o(p) = Po. For p ing:(o;), choose qin ¢; so that p = g:(q), and set ¢(p) = $(q). 


HARVARD UNIVERSITY. 


5 Added in proof. 





PGES 2 








THE MAPS OF AN n-COMPLEX INTO AN n-SPHERE 
By HassterR WHITNEY 


1. Introduction. The classes of maps of an n-complex into an n-sphere 
were classified by H. Hopf’ in 1932. Recently, W. Hurewicz’ has extended 
the theorem by replacing the n-sphere by much more general spaces. Freu- 
denthal® and Steenrod‘ have noted that the theorem and proof are simplified 
by using real numbers reduced mod 1 in place of integers as coefficients in the 
chains considered. We shall give here a statement of the theorem which seems 
the most natural; the proof is quite simple. As in the original proof by Hopf, 
we shall base it on a more general extension theorem. 

The fundamental tool of the paper is the relation of “coboundary’’;’ it has 
come into prominence in the last few years. 

In later papers we shall classify the maps of a 3-complex into a 2-sphere and 
of an n-complex into projective n-space. 


I. Elementary facts 


2. Boundaries and coboundaries. Let K be a complex, with oriented cells oj 
(not necessarily simplicial) of dimension r, r = 0,--- ,n. Let dj; = 1, —1, or 0 
according as o; is positively, negatively, or not at all, on the boundary of o. 
An r-chain C’ is a linear form Lajoj, the a; being integers (or elements of an 
abelian group). The boundary (or contraboundary) and coboundary of C” are 
defined by 


(2.1) (2 ao: = p> Qj aso; , (2 aot) = > a 95305". 


As in the ordinary theory, we say C’ is a cocycle if its coboundary vanishes, 
and C’ is cohomologous to D’, C’ — D’, if C’ — D’ isa coboundary. The relation 
56C” = 0 (easily proved; equivalent to a0C’ = 0) says that every coboundary 


Received December 1, 1936; presented to the International Topological Conference in 
Moscow, September 1935, and to the American Mathematical Society, October 1935. 

1H. Hopf, Commentarii Mathematici Helvetici, vol. 5 (1932), pp. 39-54. See also 
Alexandroff-Hopf, Topologie 1, Ch. XIII. A recent proof has been given by 8. Lefschetz, 
Fund. Math., vol. 27 (1936), pp. 94-115. In Lemma 3 he gives a new proof of the theorem 
of the preceding paper; the author does not understand how the final map is made 
simplicial. 

2 W. Hurewicz, Proc. Kén. Akad. Wet. Amsterdam, vols. 38-39 (1935-36); in particular, 
vol. 39, pp. 117-126. The full paper will appear in the Annals of Math. 

3H. Freudenthal, Compositio Math., vol. 2 (1935), footnote 8. 

4 Unpublished. 

5 This is discussed briefly in §2. For further details, see our paper On matrices of in- 
tegers, pp. 35-45 of this volume of this Journal. We refer to this paperasI. The relation 
of Theorems 2, 3 and 4 to the theorems as stated by Hopf are made apparent by the theo- 
rems in I. The present paper is independent of 1. 


51 














52 HASSLER WHITNEY 


isa cocycle. Hence we may define the difference group of the group of r-cocycles 
° ° 6 
over the group of r-boundaries, forming the r-th cohomology group. 


3. Normal maps of cells into S;. Let Sj be the (oriented) unit n-sphere in 
(n + 1)-space. Let f map the (oriented) n-cell ¢” into S¢. We say f is normal 
if f(p) = Po , a fixed point of Sj, for p in the boundary dc" of ¢”. This is equiva- 
lent to identifying the points of dc” in o", forming an n-sphere S”, and mapping 
this sphere into Sj. Hence we may define the degree’ d;(o"). If f and g are 
normal in o” and d;(o") = d,(o"), then we may deform f into g, keeping do” 
at Py, by I, corollary. 

Any map f of o’ into Sj, r < n, may be shrunk to Py : we deform f into a 
simplicial map, and apply ©, (see II, §3). Po being assumed a vertex of Ko, 
if Ac” is at Py it remains there during the deformation. 

If K is any complex, let K" be the subcomplex of K containing all its cells of 
dimension < r. The map f of K into Sf is normal if f(p) = Po for p in K"™. 
Suppose o” or S” is subdivided into cells ¢?, and f is a normal map of it into S>. 
Then the d;(o?) are defined, and 
(3.1) do") or dS") = p d,(o?). 

To show this, subdivide o” or S" further, so that we can deform f into a sim- 
plicial map, and apply Q (see II, §3). The above quantities are unchanged, 
and (3.1) is now a consequence of II, (3.1). 


4. On deformations. We shall need the following elementary results. Let 
K X I be the product of K and the unit interval J, consisting of all pairs (p, 2), 
pinK,0Ost<1. The deformation ¢,(p) of K in Sp is equivalent to the map 
&(p, t) = o(p) of K X T into Sj. Hence ¢ is homotopic to ¢, if and only if , 
defined over K XK 0 + K X 1, may be extended over K X I. 

Let f map the boundary @o’ of o' into Sj. Then f is homotopic to zero (in do’) 
if and only if it may be extended through o’. For the deformation f,(p) (p in 
ao") into fi(p) = P is equivalent to the map f(pi_,) (see II, §5) = f.(p) of o’ 
into S;. 

Lemmal. If ¢ = do maps o” into S, and the deformation ¢: of is defined over 
do”, then its definition may be extended over a”. 

We define ¢,; in o” by 


pose) (0 <t<s - — 1), 
(4.1) d: (pu) = | 

\p _1(p) (t-1sesi) 

( t+1—— " 


* This is the character group of the homology group with numbers mod 1 as coefficient 


group. 
7 See pp. 46-50 of this volume of this Journal; we refer to this paper as IT. 





Rl a ee eee OR 





ant ON 





en 











Ce tis 


tee 








MAPS OF AN -COMPLEX INTO AN n-SPHERE 53 


LemMa 2. Any map ¢ of K into Sj may be deformed into a normal one; all cells 
already at Py we may keep fixed. 

We deform the map successively so that K°, K',-..,K”" ‘are at Py. Sup- 
pose K”' is at Py (if 0 < r <n). As each ao’ is at Py, we may deform each 
o into Py, keeping do’ at Po (see §3). This deformation, defined over K’, is 
extended over all (r + 1)-cells, (r + 2)-cells, ete., by Lemma 1. It is now de- 
fined over K, and K’ is at Po. 


5. Parts of cocycles. Let K’ be a subcomplex of K. Any r-chain C of K 
may be written C’ + C’’, the coefficients of cells of K — K” [of K’] being zero 
in C’ [in C’’]. We say C’ is part of C. Clearly the chain C’ in K’ is part of a 
cocycle if and only if 6C’ cobounds in K — K’, i.e., if and only if for some chain 
C” in K — K’, 6C’ = 6C”. The (r + 1)-chains are chains of K. 


6. The product K X J. We subdivide K X I (see §4) by means of all cells 
o; X I (e;in K). Orient the cells o; X 0 and o; X 1 like the o{, and orient each 
(r + 1)-cell ¢; X I so that of X 1 is on its boundary positively. Then 


(6.1) 66; X 0) = -oj XI 4+---, oi X 1) =o, XI 4+-:-, 


(6.2) (0; X I) = — Do aij"(o5"" X D. 
2 
To prove (6.2), say (0; X I) = Ajj'(o}"" XK I) +.---. Then 
66(o; X 1) = 


dl(oi X 1) + Dai; "(oj" X VI 


(Ajj + a7;')oj" XD +--+. =9, 


and Aj;' = —aj;'. The first equation in (6.1) is clear for r = 0; it is proved in 
succession for r = 1, 2, - - - by considering the coefficient of «, X I in 54(¢; ' X 0). 
TuHeoreM 1. Let Co and C, be n-chains in K = K”", and let Do and D, be the 
corresponding chains in K X Oand K X 1. Then Dy + D; (as achainin K X I) 
is part of a cocycle if and only if Co — C, in K. 
Say 


Co = Dai, C1 = > dio}. 
Consider any n-chain 
(6.3) D = Do + Di + Dhioj" X 1); 
then, by (6.1) and (6.2), 
6D = — Diao? X 1) + DY bilo? X 1) — Y hj apo? X D 
> eh -«- > h; 0;:\(o? X I). 


(6.4) 


8 K — K’ isin general not a subcomplex of K, i.e., is not closed in K. 














54 HASSLER WHITNEY 


Suppose Dy + D, is part of a cocycle D; then (6.4) set = 0 gives 
6( hej”) = > hja}io? = Dd (6: — ao; = C1—Co, 
i t2 : 


and Cy —C;. Conversely, suppose C; — Cy = 6(>-hjo}'); then the last set of 
equations shows that the bracket in (6.4) vanishes, and hence D, defined by 
(6.3), is a cocycle. 


II. The theorems 


7. The extension theorem. We shall prove 

TuroreM 2. Let f be a normal map of the subcomplex K' of K = K"*' into S$. 
Then f can be extended over K if and only if the chain 
(7.1) D= DY dfo?)o? 


nm. 
ao. in K’ 
' 


in K’ is part of a cocycle. 
First suppose D’ is part of a cocycle D = }> ajo?: 


(7.2) a;=dfo?) (ofin K’), aaj" = 0 (all j). 


f maps (K’)"" into Py ; set f(p) = Poin K"". Let f map each of not in K’ 
into Sj with the degree a; (see II, Theorem 2); then (7.2) holds for all ¢?. Con- 
sider any (n + 1)-cell o?*' of K — K’. Using (3.1), we find 


d;(aa; di(Q atj'e?) = p a7; do?) 
(7.3) a ' 


> a7; a; = 0. 


Hence f, considered only in d¢}*", is homotopic to zero (II, Theorem 1), and f 
may be extended over a! *' (see §4). Thus we extend f throughout K. 

Now suppose f is extended throughout K. By Lemma 2, we deform f into 
a normal map, leaving (K’)""", and hence also K’, fixed. Call the new map f 
again, and define the a; and D by (7.2). Then D’ is part of D. By §4, f, in 
each a¢)*', is homotopic to zero; hence (7.3) holds, and D is a cocyele. 

Remark. If f is any map of K’ into Sj, we may deform it into a normal map 
¢, by Lemma 2. From Lemma 1, it is apparent that f can be extended over K 
if and only if ¢ can be. Define D’ by (7.1). By Theorem 2, 6D’ has zero co- 
efficients over cells of K’, and is therefore a chain, which is clearly a cocycle, of 
K” = K — K’. By Theorem 3, Remark, if f is also deformed into the normal 
map w, defining the chain C’ of K’, then C’ — D’ in K’, and hence for some 


Hin Kk’, 


C’ — D’ = (6H)’ = 6H — (6H)”. 
Therefore 6C’ — 6D’ = 4{(6H)’’], which lies in K”. Thus the cohomology class 
in K” of 6D’ is uniquely determined by f, and we have (using Theorem 2): f may 
be extended over K if and only if its cohomology class thus defined in K” is ~ 0 
in K”. 


| 
i 
i 
x 
* 


oven oc 


Pe name en ae A TO 








ONE gn een AO A BES EOL EI. ERE APO hl ste 








MAPS OF AN #-COMPLEX INTO AN ?i-SPHERE vo 


8. The classes of maps of K” into S;. If we put two maps of K" into Sj 
into the same class if they are homotopic, the maps fall into classes, the homotopy 
classes. To any normal map f of K” into Sj we let correspond a chain C; as 
in (7.1). 

THeorEM 3. The normal maps ¢ and ¥ of K = K" into So are homotopic if 
and only if Cs — Cy. 

Set 6(p X 0) = o(p), ®(p X 1) = Y(p); then ¢ is homotopic to y if and only 
if @ may be extended through K X I (see §4). If Do and D, correspond to 
C, and Cy in K X 0 and K X 1, Theorem 2 shows that this is possible if and 
only if D’ = Dy + Dy, is part of a cocycle in K X I. By Theorem 1, this is true 
if and only if Cs — C,. 

Remark. If K is of any dimension and ¢ and y¥ are homotopic, then C, and 
Cy are coecycles and C,; — Cy. The first statement follows from Theorem 2; 
the second follows on considering ¢ and y in K” alone. 

THEeoreM 4. The classes of maps of K" into So are in (1 — 1) correspondence 
with the elements of the n-th cohomology group of K with integer coefficients. The 
correspondence is given by deforming the map f into a normal one and taking the 
cohomology class of the resulting cocycle. In particular, f is homotopic to zero if 
and only if the corresponding cohomology class is zero. 

The deformation is possible, by Lemma 2. The cohomology class is uniquely 
determined by f, and non-homotopic maps determine different classes, by 
Theorem 3. Finally, to each cohomology class corresponds a map; we take a 
cocycle C of the class, and let f map each o” normally into Sj with the degree 
equal to its coefficient in C (see II, Theorem 2). 


9. The Theorem of Hurewicz. Let Q) be a fixed point of a space S. Then 
the classes of maps of So into S for which Py goes into Q» form an abelian group, 
the r-th homotopy group of S.° If f maps o” [or Se] into S, and f(p) = Qo in 
do"[f(Po) = Qo], we may call the corresponding homotopy element the degree 
d,;(o")[d,;(S¢)] of f. (if S = So, the n-th homotopy group is the group of integers, 
as was seen in II, so that this is a natural generalization of the term degree.) 
The fundamental formula (3.1) holds still. The theorems of the preceding 
paper become matters of definition. The proofs in the present paper hold with- 
out change, and we have a new version of the Theorem of Hurewicz: 

Turorem 5. Theorems 2, 3 and 4 hold if we replace S} by any locally con- 
tractible space So whose r-th homotopy groups vanish for r < n, and replace the 
integers by the n-th homotopy group of S as coefficient group in the chains and co- 
homology classes. 

Hurewicz also shows that in the above space So the n-th homotopy group is 
the same as the n-th homology group with integer coefficients. 


HARVARD UNIVERSITY. 


® See Hurewicz, loc. cit. We assume a knowledge of the fundamental properties of 
homotopy groups. 











ON THE EXTREME POINTS OF CONVEX SETS 
By G. Ba.Ley Price 


Introduction. A convex set is a set such that if it contains two points, it 
contains the segment joining these points [1, p. 2, and 2. Numbers in square 
brackets refer to the bibliography at the end]. Minkowski defined certain 
points of convex sets which he called extreme points [1, pp. 15-16; 3, p. 157]. 
They are related to certain other points which are here called extreme points 
in the sense of distance to distinguish them from the former, which are called 
extreme points in the sense of Minkowski. A detailed study is made of these 
two types of extreme points of convex sets in abstract normed linear spaces. 

In the first place, it is necessary to distinguish two types of normed linear 
spaces on the basis of the convexity properties of spherical neighborhoods (§1). 
A normed linear space such that the segment joining any two points of a spherical 
neighborhood is interior to the neighborhood except at most for the given points 
themselves is called a space L*. All other normed linear spaces are classed 
together and denoted by L. The study of extreme points is far simpler in spaces 
L* than in spaces L, and the results are more complete. An example con- 
sidered in $10 shows that the property of being a space L* may depend on the 
properties of the distance function alone and not on the linearity properties 
of the space. 

In §2 the existence of extreme points is considered. An approximation 
theorem first proved by Minkowski for euclidean 3-space is extended to spaces 
L* and L in $3. Two kinds of convex sets are distinguished in §4 on the basis 
of the relation of the two kinds of extreme points, and it is shown that the set 
of extreme points in the sense of Minkowski may be either closed or not closed. 
In $5 the closed convex hull of a given set is considered, and Minkowski’s Ap- 
proximation Theorem (§3) is extended. 

A general theorem on compact sets is established in §6. It is shown that in a 
complete metric space a set is compact if it is possible to approximate uniformly to 
it by means of closed compact sets. This theorem and Minkowski’s Approxima- 
tion Theorem ($5) enable us to show in §7 that the closed convex hull of a 
compact set in a Banach space is compact. 

The significance of Minkowski’s Approximation Theorem is considered briefly 
in §8. A series of theorems is given in §9 which establish more precisely the 
relation between a convex set and its extreme points. Some examples are 
considered in §10. 

The paper may be considered a study in the geometry of abstract space. 


1. Linear spaces and extreme points. A space which is linear and normed 
will be designated by L, its elements or points represented by z, y, --- , and 


Received September 10, 1936. 
56 


: 





So sv Ree ebubantts ious 





ie Sete 





are... 





ON EXTREME POINTS OF CONVEX SETS 57 


the distance between z and y by ||2 — y||. Any spherical neighborhood 
||2 — 2 || S ris convex, since the distance function satisfies the triangle in- 
equality. If the space L has the additional property that no point of the 
boundary || z — 2» || = r of the sphere || z — 2 || S r is an interior point of a 
segment joining two other points of the sphere, it will be designated by L* and 
called a space L with non-flat spherical boundaries. It can be shown at once 
that no point of the boundary of a sphere is an interior point of a segment join- 
ing two interior points of the sphere. 

Any euclidean space of n dimensions is a space L*. A sufficient condition 
that a space L be a space L* is that the distance function have the properties 
of the distance function in Hilbert space. 

There are spaces L which are not L*; for example, the space of continuous 
functions f(s), a S s S b, in which the distance between f(s) and g(s) is max 
| f(s) — g(s)|ona Ss <b. For consider the functions f(s) such that | f(s) | S 
M. They form a spherical neighborhood, and two functions f(s), g(s) in it such 
that f(a) = g(a) = Mare onitsboundary. But all the functions @f(s) + (1 — @) 
g(s), 0 S 6 S 1, are also contained in the boundary. This space has flat spheri- 
cal boundaries, i.e., they contain segments. 

We shall now define two kinds of extreme points of convex sets. 

(1) An extreme point in the sense of Minkowski [3, p. 157] of a conver set C 
is a point xy which is not an interior point of any segment joining two points of C. 

(2) An extreme point in the sense of distance with respect to a point y of a convex 
set C is a point p(y) of C whose distance from y is a maximum. 

As a consequence of the definitions which we have made, we have at once 
the following theorem. 

THEOREM 1.1. In a space L* every point of the boundary of a sphere is an ex- 
treme point in the sense of Minkowski. 

Notation. In the future C will designate a set which is closed, compact, and 
convex. The set of extreme points ry of C will be designated by Ey, and the 
set of extreme points xp of C with respect to the points of a set S by Ep(S). 


2. Existence of extreme points. We proceed to establish the existence of the 
extreme points which we have defined. 

THEOREM 2.1. There exists at least one extreme point xrp(y) of any set C with 
respect to each point y. 

This theorem is obvious, for the distance from y to a point x of C is a contin- 
uous function of z, defined for x on a closed, compact set C. 

THtoreM 2.2. The set of extreme points Ep(y) of C for a fixed y is compact 
and closed. 

Since Ep(y) is a subset of C, it is compact. That it is closed follows from 
the remarks made in proof of the last theorem. 

Remark. Every set C contains at least one pair of points 2, 22 such that 
Ep(x;) contains x2 and Ep(2x2) contains x,. The segment joining such a pair 
of points is a diameter of the set. 








58 G. BALEY PRICE 


Turorem 2.3. In a space L* any point xp of Ep(S) ts also a point xy of Ew. 

The point zp is related to a point y of S in such a way that the sphere with 
center y and radius || x» — y || has C in its interior except for certain points on 
its boundary. From the definition of a space L* it follows that zp belongs to Ey. 

This theorem together with Theorem 2.1 proves that the set Ey of a set C 
in a space L* is not empty. 


3. Minkowski’s Approximation Theorem. In this section we shall establish 
for spaces L* a theorem first proved by Minkowski [3, p. 160] for euclidean 
3-space, and we shall give its generalization in spaces L. 

The convex hull of a finite set of points 2, ---,2, will be called a poly- 
hedron P,,. Its set of extreme points Ey is composed of the points themselves 
or a subset of them. 

Lemma 3.1. Let a set C in L be given and also the parallel set C’ at distance 
d > 0. No segment which joins a point of the boundary of C’ to an interior point 
of C’ contains a second boundary point of C’. 

The parallel set C’ at distance d from C may be defined as the set composed 
of all points whose distance from C is equal to or less than d (in this case the 
notation does not imply that C’ is compact). Let 2, be any point of the bound- 
ary of C’, and ze any interior point. Then there exist points y; and ye of C 
such that |! 7, — y: ||) = d, || a2 — ye || <d. The distance from 6x2; + (1 — @)z2 


to C is not greater than its distance to @y, + (1 — @)ye. But 
| {Or + (1 — Axe} — fOy. + (1 — O)ye} || 
SO\|un—mll + ( — 4 ||rz-—y||<d 


for 0 S @ < 1. Thus the only point of the segment on the boundary of C’ 
is x, ,and the lemma is established. 

THeoreM 3.1. Ina space L* let any set C with extreme points Ey be given and 
anye > 0. Then there exist an N(e) and a sequence of polyhedra P,, P2, --- with 
Pi C Pp C .--- , and with all the extreme points in the sense of Minkowski of each 
one contained in Ey, such that for n = N(e) the distance from any point of C to 
P,,, is less than e. 

Let D(x, P,) denote the distance from x to P, ; it is a continuous function of 
x, for | D(z, P,.) — D(y, P.)| S || x — y||. 

Let x; be any point of Ey, and let x2 be any point of the set p(x); at least 
one exists by Theorem 2.1. Consider D(x, P:) for x on C. It is a continuous 
function which is defined on a closed, compact set, and which vanishes on P2. 
Unless C is identical with P2, there is a set of points X; C C, but X; € Ps, 
at which it takes on its maximum value d; > 0. The set X3 is closed and com- 
pact and lies on the boundary of S:, the parallel set to P: at the distance d;, 
and we shall show that it contains at least one point z; of Zy. For let x3 be a 
point of X; whose distance from 22 is maximum; such a one exists and || 23 — 22 || 
> 0. Now 2; is an end point of any segment of C which contains 2; and a point 
of C interior to Ss by Lemma 3.1; all points of C are interior to S; except those 


— 

















IC rE SAIS AM MH -T 





melee mee meio es te 





ON EXTREME POINTS OF CONVEX SETS 59 


of X;, which are on its boundary. Also zx; cannot be an interior point of a 
segment which contains only points of X3, since these points lie in the interior or 
on the boundary of the sphere || « — zz || < || 23 — x2 ||. We make use of the 
fact here that spherical boundaries in a space L* are non-flat. It follows from 
the arguments given that x; « Ew. 

We observe next that D(z2, P;:) 2 D(a, Pe), for D(x3, Pi) 2 D(as, Pe), 
since P; C P:, and D(a2, P;) 2 D(as, P1), since x2 is a point of Ep(2x). 

Suppose now that this process has been repeated until n points 2, --- , tn 
of Ey have been obtained with D(a, P:) 2 D(a3, P2) 2 --- 2 Dan, Pri). 
Then it can be repeated to obtain a point 2,4; of Ey unless C is identical with 
P,.. For since C contains points distinct from P,, D(x, P,) for x on C takes 
on its maximum value d,,; > 0 at a set of points X,,, CC, but X,u. € P,. 
The set X,4: is closed and compact and lies on the boundary of the parallel 
set S, to P, at the distance d,,,;. The same arguments that were used before 
will show that a point z,., of X,,; whose distance from z, is a maximum belongs 
to Ex. 

Also D(a,, Pai) 2 D(tasi, Pr), for D(tnsi, Pas) 2 Danii, Pn) since 
Py, C P,a, and D(a,, Pri) 2 D(2n41, Pn1) because of the definition of z,. 

Thus either there exists a value of n such that C is identical with P,, or there 


is an infinite sequence of points x2;, 22, --- of Ey with the corresponding poly- 
hedra P,, P2, --- and 
(3.1) D(a, Pi) 2 D(a3, P2) 2 --- 2 90. 


Also it is clear that D(z,, Px) 2 D(a,, Pa) for k = 1,---,n — 1 (since 
P,, C P,_1), from which it follows that 


(3.2) lla. — || 2 Dern, Pri) (k=1,---,n—1;n=2,8,---). 


The proof will be complete if we can show that the sequence of numbers in 
(3.1) approaches zero. But if they do not, they have a limit 6 > 0 and 


(3.3) D(z, Py-1) = 6 


for all values of n. Also the sequence 21, 22, --- has at least one limit point, 
since its points belong to the compact set C. Then it is possible to select a sub- 
sequence which approaches this limit. But no such sub-sequence can approach 
a limit, because (3.3) and (3.2) show that the distance from any point of it to 
all the preceding is not less than 6. Thus the assumption has led to a contra- 
diction, and the sequence of numbers in (3.1) approaches zero. The proof of the 
theorem is complete. 

An examination of the proof just given will show that the approximating 
properties of the polyhedra P;, P2,--- are in no way dependent on the fact 
that the points 7, x2, --- belong to Ey. The hypothesis that C is in L* was 
used only to establish the fact that x2, x2, --- belong to Ey. It will be seen 
at once that this proof establishes also the following theorem. 

THeoreM 3.2. Ina space L let any set C and any « > 0 be given. Then there 








60 G. BALEY PRICE 


exist an N(e) and a sequence of polyhedra P,, P2, --- with P; C Pz C---, 
all of which are contained in C, such that for n = N(e) the distance from any point 
of C to P,, is equal to or less than e. 

Remark 3.1. The set C is the limit of the sequence of polyhedra P;, Ps, - - - 
in two senses: (a) take all points x such that z e P,, for n 2 N(x) and close this 
set by adding its limit points; the set obtained is C; (b) P,, — C according to the 
definition of Blaschke (see Theorem 9.3 below and [2, p. 60)). 

Remark 3.2. By means of the approximating polyhedra, we can set up a 
denumerable set of points everywhere dense in C; hence C is separable (other 
proofs are known). An examination of Carathéodory’s proof of Blaschke’s 
Auswahlsatz shows that it is valid in any separable set; hence the validity of the 
Weierstrass-Bolzano Cluster Point Theorem for points in a closed convex set 
implies the validity of the same theorem for closed convex sets [2, pp. 62-66]. 

Remark 3.3. Let S; and Se: be two closed, compact sets with convex hulls 
H,, Hy. If the distance from any point of S; to S: is at most d, then the dis- 
tance from any point of H, to He is at most d, for the parallel set to H2 at the 
distance d obviously includes S,. 

Remark 3.4. The convergence to zero of the sequence of numbers in (3.1) 
is a necessary condition that the limit of the sequence of polyhedra be a set C, 
but it is easy to give an example with linear sets to show that it is not sufficient. 
We consider here a condition that is sufficient. Let a set of points x(1), x(2), --- 
be given; from it form a new sequence x(k,) = x(1), x(k2) = x(2), and in general 
r(k,) equal to the first element of the original sequence not contained in the 
convex hull of those that precede it. Let P, be the convex hull of the first 
n points of the new sequence. A sufficient condition that the limit of the 
P,, Ps, --- be a set C, the limit being taken as in Remark 3.1 (a) above, is 
that the points x(k), x(k), --- belong to some set C. This condition is obvi- 


ously necessary. 


4. The two kinds of sets C. As shown by Theorem 2.3, some of the points 
which are extreme in the sense of distance are also extreme in the sense of Min- 
kowski. On the basis of this relationship we are able to distinguish two kinds 
of sets C. 

TueoremM 4.1. The set Ep(S) of a set C in a space L is closed if S is compact 
and closed. 

Let x be any limit point of Ey(S). Then there is a sequence 2, 22,---, 
x, € Eo(S), whose limit is x. We shall prove that x « Ep(S). 

By hypothesis there is a point y, ¢ S such that 2; € Eo(y), k = 1, 2,---. 
Since S is compact and closed by hypothesis, we can select a sub-sequence from 
the y, which has a limit y, ye S. Then the limit of the sequence of the cor- 
responding .; is x, and we may suppose that this selection has been made before- 
hand so that y:, ye, --- approaches y. 

Suppose that x is not a point of C at maximum distance from y. Then there 
is a point z, distinct from x, which has this property, and 


-— 





ans ee 


PE EOIN 8390 CAN 





enna. 








plored 








ON EXTREME POINTS OF CONVEX SETS 61 


lz-yll=|jz-yll+d, d>0. 
Furthermore 
lly —2|[2lly—2z|l—lly.—yll 
(4.1) | ae 
2||z7-—yll+d—-—|ly. — y|l. 
But 
(4.2) = |lan — yall S lle —yl|l + [lz — 20 ll + lly — yell. 


Since x, — 2, Yn — y, there exists an n sufficiently large, say N, so that 


\|a2—atw|| < d/4, ||y — yw || < d/4. 


Then from (4.1) and (4.2) we have 
llyw—2 || 2|l|x —y|| + 3d/4, 
ll zw — yw || S lle — y || + 4/2. 


These two inequalities contradict the assumption that zy e Ep(yw); hence, 
there exists no point z of C whose distance from y is greater than that of z. 
Then z « Ep(S). The proof is complete. 

Definition 4.1. Ina space L* if there exists a closed compact set S such that the 
set Ex of a set C is identical with Ep(S), we say C isa set Cs. In all other cases, 
C is called a set Cx. 

As a result of this definition and Theorem 4.1, we have at once the following 
theorem. 

THEOREM 4.2. Ina space L* a sufficient condition that the set Ex of C be closed 
is that C be a set Cs. 

THEOREM 4.3. In spaces L* there exist both sets Cs and Cx. 

Any set with a finite number of extreme points Ey, such as a polyhedron, is a 
set Cs. In particular, every linear set C is a set Cs, the two end points being 
extreme points in both senses. The sphere || x — 2 || S risaset Csalso. Its 
boundary points ||z — 29 || = r form Ey (see Theorem 1.1), and S may be 
taken as the single point 2. 

We shall now give an example of a set Cx. On a circle mark a point z, and 
let x; and zz be the end points of a segment which has z for its mid-point and is 
perpendicular to the plane of the circle. Let x; be the point on the circle 
diametrically opposite x; let 2, be one of the points on the circle which bisect 
the ares joining x; and x. In general, let x, , k = 5, 6, --- , be the point of the 
circle which bisects the are joining z,,; and z. Then the sequence of points 
21, %2, --- has x as its single limit point. 

Let P,, denote the convex hull of 2, ---,2,. Then consider the set C of 
points p such that either (a) pe P, for n = N(p), or (b) pis a limit point of the 
points (a). Then C is closed and compact, lies in a space L* (see §1), and it is 
convex also (see Remark 3.4). The points 2, 22, --- are the extreme points 








62 G. BALEY PRICE 


Ey of C, but Ey is not closed, since their limit point z is not an extreme point. 
Hence, C is a set Cx, for if it were a set Cs, its extreme points Ey would form 
a closed set according to Theorem 4.2. 

The reader will readily construct plane sets C with points of Ey which are 
not extreme points in the sense of distance. Thus not every plane set C is a 
set Cs. In spite of this fact, the following theorem is easily proved. 

TueoreM 4.4. The set Ey of every plane set C is closed. 

Let pi, Po, +--+ , Pn € Eu CC, have the limit point p. Then the points p, and 
p belong to the boundary of C. Let 1 be the supporting line of C through p. 
Then C lies entirely on one side of 1. If p is the only point of 1 in C, we see at 
once that p « Ew, and the theorem is true in this case. If the theorem is false, 
the only possibility is that p is an interior point of a segment of I which is con- 
tained in C. But in this case p would not be a limit point of points of Ey, 
contrary to hypothesis. Thus the theorem is true in all cases. 


5. On the convex hull of a set. We shall consider now the definition of the 
closed convex hull of a set S, and also two methods of constructing it. 

Definition 5.1. The closed convex hull of a set S is the set contained in all closed 
convex sets which contain S. 

The justification of this definition is the fact that the product of any number 
of closed convex sets is a closed convex set. Its weakness is that it does not 
enable us to establish directly many properties of the closed convex hull of S. 

Definition 5.2. The convex hull [4, p. 359] of S is the set of points 


V7 + es + Trin, 


where z,¢ S,r; 20, andr +--- +r, = 1. 

TueoreM 5.1. The closed conver hull of S is the closure of the convex hull of S. 

Still another method can be used to construct the closed convex hull of S 
in the special case that S is compact and lies in L. 

Let S be the closure of S. Then S is also compact [5, p. 89]. Let x be any 
point of L, and x, one of the points of S at maximum distance from z (it exists 
since § is compact). Let 22 be one of the points of S at maximum distance 
from z;. The convex hull of 2;, 22 is a polyhedron P:. We can continue 
as in the proof of Minkowski’s Approximation Theorem (the only difference is 
that C is replaced by S) and show that either S is contained in some polyhedron 
P,,, or that there exists a denumerable sequence of polyhedra P,; C P, C- 
with all the points Ew of each one contained in S. Let C be defined as the 
closure of the set of points z such that xe P, for n 2 N(x). It will be shown 
later that this set is compact; hence, the notation C is justified. 

Tueorem 5.2. Let C and P,, P2, --- be the sets whose construction we have 
just explained. Then the closed convex hull of the compact set Sin Lis C. Fur- 
thermore, given any « > 0, there exists an N(e) such that D(x, P,.) < eforn = N(e) 
and every xeC. Finally, the set Ex of each polyhedron P,, belongs to 8. 

This theorem may be considered an extension of Minkowski’s Approximation 











ON EXTREME POINTS OF CONVEX SETS 63 


Theorem. Not only can we approximate to the convex hull of S by polyhedra, 
but we can do so by means of polyhedra whose sets Ey belong to S. 


6. A theorem on compact sets. Let D(z, G) denote the distance from z 
to the closed compact set G. 

THEOREM 6.1. Given a sequence G,, G2, --- of closed, compact sets in a metric 
space, and a set G for which the axiom of completeness is satisfied in addition. 
If for every « > 0 there exists an N(e) such that D(x, G,) < ¢forn = N(e) and 
x eG, then G is compact. 

Since the sets G, are compact, they are totally bounded [8, p. 108]. This 
fact and the hypotheses of the theorem show that G is totally bounded. Then 
since G is complete by hypothesis, it is compact [8, pp. 107-108]. 

In the next section this theorem will be applied to sets in a Banach space. 


7. The convex hull of a compact set in a Banach space. In this section we 
shall use the results of the last two sections to establish the following theorem. 

THEOREM 7.1. The convex hull of a compact set S in a Banach space is itself 
compact. 

Since S is compact, we can use the method of Theorem 5.2 for constructing 
its closed convex hull C. The polyhedra P, are closed and compact and satisfy 
all the other hypotheses for the sets G, in Theorem 6.1. The set C satisfies 
all the hypotheses for the set G in the same theorem. Thus Theorem 7.1 fol- 
lows immediately from Theorems 5.2 and 6.1. 

This theorem, although discovered independently by the author, was first 
proved by Mazur [6]. 


8. The significance of Minkowski’s Approximation Theorem. Let S be a 
compact set in a linear metric space L. Then it is well known that S is sepa- 
rable, i.e., that there exists a denumerable set of points which is everywhere 
dense in S. We are therefore led to inquire what additional information is 
given by Minkowski’s Approximation Theorem. The answer is the following 
(see Theorem 5.2): Let a compact set S in L be given. Then there exists a 
denumerable set of points 21, 22, --- , 2, 5S, such that for each « > 0 there 
is an N = N(e) for which ||z — (mia: + --- + ryay) || < €, where r; = 0, 
r; + --- + ry = 1, and zis any point in the closed convex hull of S. Further- 
more, the set of points x,, X2, -- - in general is not everywhere dense in S. 

The set of points 2;, 22, --- is of course not unique. This fact, for example, 
indicates clearly the great variety of sets of functions, linear combinations of 
which can be used to approximate to all the functions of a given set. 


9. Theorems on the set Ey. We shall now prove several theorems which 
will establish more precisely the relation between C and Ey. 
THEorEM 9.1. Let C in L* be given. Then the closed convex hull of Ey is C. 








64 G. BALEY PRICE 


Let the closed convex hull of Ey be C’. Then since Ey C C, it is clear that 
C’ CC, and the proof will be complete if we can show that C CC’. 

Suppose C €C’. Then D(z, C’), which is defined and continuous for z in the 
closed, compact set C, takes on its maximum value d > 0 on a closed compact 
set X CC, X €C’. Then the set X lies on the boundary of the parallel set 
Sto C’ at the distance d. Let x be a point of X at maximum distance from some 
point y of C’. Then by the arguments used in the proof of Theorem 3.1, we 
can show that z is a point of Ey of C. But this is impossible, because Ey is 
contained entirely in C’. As a result of this contradiction we conclude that 
C CC’, and the proof is complete. 

Corouiary 9.1. A set C in L* is determined by any subset Ey of Ex whose 
closure contains Ew. 


THEOREM 9.2. Let, --- , 2, be points of a set Cin L andr, --- , rn numbers 
greater than zero whose sum is 1, and let x = ryx, +--+ + Trtn. Then az e C, 
but if n > 1, x is not a point of Ey. 

Ifn > 1, then 0 <r; < 1,7 = 1, --- ,m, and we can find positive numbers 
€:, °°: , €, such that 0 < ry; + e < 1, and such that e, = e& +--+ + e-4. 
Then z is the mid-point of the segment joining the points (r:; — «)2 + --- 


+ (Tr-1 ~~ €n—1)Ln—1 + (Tn + En) In and (ry + €:)21 + ---+ (rn-1 + €n—1)En—1 
+ (rn — €)2%,, Which belong to C, and by definition (1) of §1, z is not an ex- 
treme point. 

The following theorem is well known [7, §9, p. 200). 

Tueorem 9.3. Let S be an arbitrary closed bounded set in euclidean N-space 
and C its closed convex hull. If x is any point of C, then there exist points x; ¢ 8 
and numbers r;, 7; > 0,7 = 1,---,n, withr; + --- + 7, = 1, such that x = 
Mt + --- + 7r.tn,andns=N+1. 

TueoremM 9.4. Let S be an arbitrary closed bounded set in euclidean N-space, 
and C its closed convex hull. Then Ey CSCC. 

The proof follows at once from the two preceding theorems. Let x be a point 
of C which is not a point of S. Then by Theorem 9.3, there are points x, --- , 
z, of S such that zc = rz; + --- + 7.2,. Also n > 1, for otherwise x would 
be a point of S contrary to hypothesis. Then by Theorem 9.2, z is not a point 
of Ey. It follows that Ew C S, and the proof is complete. 

TuHeoreM 9.5. In euclidean N-space let a set C with extreme points Ey be given. 
A necessary condition that the closed convex hull C’ of Ey C Eu € C contain 
tm, tu € Ey is that xy be a point or a limit point of Ew. 

To prove this theorem, we shall suppose that zy is not a point of Ex, and 
show that it does not belong to C’. Suppose zy e¢C’. Then zy is not an ex- 
treme point in the sense of Minkowski of C’ because by Theorem 9.4 all such 
points belong to Ey. Then it follows that zy is not an extreme point in the 
sense of Minkowski of C, for C’ C C. But this contradicts the hypothesis 
that rwe Ew CC. Then if zw C’, it follows that rw «Ey, and the proof is 
complete. 

Remark 9.1. It seems likely that Theorem 9.4 is true for sets C in spaces more 
general than euclidean N-space; hence, we consider the following propositions: 














— 


es 














wee 


U 





Ae cant ae ~l Na, 





ON EXTREME POINTS OF CONVEX SETS 65 


(A) Let S be an arbitrary closed compact set in L, and C its closed compact hull. 
Then Eu CSCC. 

(B) Let a set C in L with extreme points Ey be given. A necessary condition 
that the closed convex hull C’ of Ey, Ey C Eu € C contain ru, tu ¢ Ex, is that 
amu be a point of the closure of Eu. 

An examination of the proof of Theorem 9.5 will show that the following 
corollary is true. 

Coro.uuary 9.2. For every set C in L, (A) implies (B). 

Throughout the remainder of this section, we shall treat sets C in L for which 
(B) holds, no assumption being made about (A). The theorems established 
will have content because of Theorem 9.5. Among other results, we shall obtain 
the following restricted converse of Corollary 9.2 (see Corollary 9.5 below): 
For every set C in L*, (B) implies (A). 

Corouuary 9.3. If C is in L, and if (B), a necessary condition that the closed 
convex hull of Ey C Eu CC be C is that the closure of Ey contain Ey. 

Corotiary 9.4. If C is in L*, and if (B), then the closure of the set of points 


Xi, X2,--- obtained in the proof of Minkowski’s Approximation Theorem 3.1 
contains Ey. 

THEOREM 9.6. Jf Ci, C2, --- is a sequence of closed, compact, convex sets in 
L* which have the limit C in L* in the sense of Blaschke, if (B), and if ry « Ey CC, 
then there exists a sequence of extreme points ry €C,, xy €C2, --- whose limit 
is Im. 


Let v, be the limit inferior of numbers d, such that the distance from any 
point of C to C,, and from any point of C, to C, is equal to or less than d,. 
From the hypothesis that C,, — C in the sense of Blaschke, it follows that v, — 0 
asn— @, 

Suppose that the theorem is false. Then there exists an « > 0 such that 
for every N 2 1 there exists a set C, with k 2 N which has no extreme point 
in the neighborhood || x — rm || < «. 

Let S, S, be the sets of points x of C, C, such that || «7 — aw || 2 €/2, || x — 
tm || 2 e respectively. Let the closed convex hulls of S, S, be H, H,. There 
exists an n, say N,, such that », < ¢«/2forn 2 N,. Then we can assert that 
the distance from any point of S, to S does not exceed v, at least for n 2 N,. 
Furthermore, by Remark 3.3 the distance from any point of H, to H does not 
exceed v, forn 2 Nj. 

From the hypothesis that proposition (B) holds, it follows that the distance 
from zy to H is positive. Call this distance d. Then there exists an n, say 
n = Ne, such that v, < d/2 forn = Ne. Then for n 2 N, where N is the 
larger of the integers N, and Ne, the distance from any point of H, to H is 
less than d/2, and therefore the distance from zy to H, is greater than d/2. 

Finally, from the assumption that the theorem is false as stated above it 
follows that there exists a set C,, with k 2 N, all of whose extreme points lie 
in S,. Then H, is identical with C,, because S; contains only points of C;, 
and it contains all of its extreme points Ey (see Theorem 9.1). Thus from the 
statement made above we see that the distance from ry to C; is greater than 








66 G. BALEY PRICE 


d/2. But this contradicts the fact that » < d/2. Thus the assumption that 
the theorem is false has led to a contradiction, and the theorem is established. 

Remark 9.2. In the euclidean plane let C, be the convex hull of the points 
(0, 1 + 1/n), (0, — 1 — 1/n), (1/n, 0), (— 1/n, 0). The limit set C is the line 
segment joining (0, 1) and (0, — 1), and these two points are its only extreme 
points. The point (0, 0) is also a limit point of points zy of the sets C,. Thus 
not every limit point of points ry of the sets C, is a point zy of C. For linear 
sets, however, every limit point of extreme points is an extreme point of the 
limit set C. 

Coroutuary 9.5. If S is any compact set in a space L*, and if (B), then the 
set Ey of the closed convex hull C of S is contained in 8. 

The proof of this corollary follows from Theorems 9.6 and 5.2, for it was shown 
in the latter that it is possible to construct a sequence of polyhedra whose sets 
Ey belong to S, and whose limit in the sense of Blaschke is C. 

From this corollary we obtain the following result, stated above. 

Coro.iary 9.6. For every set C in L*, (B) implies (A). 


10. Some examples. Consider again the functions f(s) discussed in §1 as an 
example of a space L which is not L*. The functions | f(s) | S M form a closed, 
not compact, convex set. Every function f(s) such that max | f(s) | = M is an 
extreme element in the sense of distance, but it can be shown easily that the 
only extreme elements in the sense of Minkowski are the two functions f(s) = 
+M. The closed convex hull of the set Ey is not the given closed convex set 
(see Theorem 9.1). 

Let x be the real number triple (2;, x2, x3), and let || || = max |a2;|,7 = 
1, 2,3. Then the ‘spherical neighborhood” || x || S ris the cube with vertices 
at the points (+r, +r, +r); it is a closed, compact, convex set. The space is a 
space L but not L*. We thus have an example of a space which is an L* with 
the usual euclidean distance function and not an Z* with another one. In both 
cases, however, the vertices of the cube form its set Ey, and their closed convex 
hull is the cube itself. Thus the conditions in Theorem 9.1 are sufficient, but 
not necessary. We see also that the extreme points in the sense of distance of 
the closed convex hull of a set S need not belong to S (see Corollary 9.5). 

So far we have characterized an extreme point ry by the original definition 
in §1 and by Theorem 9.2. One might expect to characterize xy at least in 
euclidean space also by some property of the supporting plane through the 
point, but this seems impossible. Consider, for example, a plane set C whose 
boundary contains two intersecting straight line segments and an arc of a circle 
tangent to one of them. Then we see that there may be either one or many 
supporting lines through a point rw which have a single point in common with 
C, or a single one through ry, which, however, has a segment in common with C. 
If a supporting line contains only a single point of C, then that point is a point 


xu, however. 








NOW Pir iene tesa ie cx 


pd AC TAC te Ke Ret SS 




















TER TY 


seaport 





ee 


ney nore c 








ON EXTREME POINTS OF CONVEX SETS 67 


BIBLIOGRAPHY 


. BONNESEN AND FENCHEL, Theorie der konvexen Kérper, Ergebnisse der Mathematik und 


Ihrer Grenzgebiete, Berlin, vol. 3 (1934), no. 1. 


. Buascuxke, Kreis und Kugel, Veit und Comp., Leipzig, 1916. 
. MInKowskI, Gesammelte Abhandlungen, Teubner, Leipzig and Berlin, 1911. 
. Garrett Birxkuorr, [ntegration of functions with values in a Banach space, Transactions 


of the American Mathematical Society, vol. 38 (1935), pp. 357-378. 


. SIERPINSKI, General Topology, The University of Toronto Press, Toronto, 1934. 
. Mazur, Uber die kleinste konvere Menge, die eine gegebene kompakte Menge enthélt, Studia 


Mathematica, vol. 2 (1930), pp. 7-9. 


. Caratutopory, Uber den Variabilitatsbereich der Fourier’ schen Konstanten von positiven 


harmonischen Funktionen, Rendiconti del Circolo Matematico di Palermo, vol. 32 
(1911), pp. 193-217. 


. Hausporrr, Mengenlehre, Berlin, 1927. 


Brown UNIVERSITY. 








ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER* 
By REINHOLD BAER 


An abelian group which is written so that its symbols are combined by addi- 
tion and which has no elements of finite order other than 0' may be called 
completely reducible, if it is a direct sum of groups of rank one. For every 
group is contained in a completely reducible group of the same rank. There 
exist furthermore direct irreducible groups of every finite rank and the groups 
of rank 1 are exactly the subgroups of the additive group of the rational numbers 
and therefore irreducible. 

The structure of a completely reducible group is uniquely determined by the 
ranks of the differences of certain characteristic subgroups. A survey of the 
structures of all subgroups of completely reducible groups would involve the 
solution of the general structure problem, since every group is contained in a 
completely reducible group. But it is possible to characterize a class of com- 
pletely reducible subgroups (of completely reducible groups) which are iso- 
morphic with a direct summand of the whole group. 

Every property of a completely reducible group which refers to finite subsets 
or to subgroups of finite rank also holds true for separable groups, i.e., for groups 
whose finite subsets are contained in completely reducible direct summands. 
Countable separable groups are completely reducible. But there exist separable 
groups which are not completely reducible, e.g., vector groups like the additive 
group of all the sequences of integers. Further criteria for complete reduci- 
bility, for separability and for the complete reducibility of separable groups are 
given. There exists in particular a characterization of the direct summands of 
finite rank which holds true in every separable group and which is valid in a 
group of finite rank if, and only if, this group is completely reducible. Further- 
more, every direct summand of finite rank of a separable group is completely 
reducible. 

The subsets b’ and b” of the group J are isotype in the group J, if there 
exists a proper automorphism of J which maps b’ upon 6b’. The classes of iso- 
type elements of a separable group are determined by complete sets of invariants 
and an enumeration of the characteristic and of the strictly characteristic sub- 
groups is based on this classification of the elements. The existence of char- 
acteristic subgroups which are not strictly characteristic and the existence of 
elements which are not isotype though contained in the same characteristic 
and in the same strictly characteristic subgroups are closely related phenomena; 
but neither is a consequence of the other. 


Received November 3, 1936; presented to the American Mathematical Society Sep- 
tember 3, 1936. 

' The word “‘group”’ is substituted for this longer statement wherever there is no danger 
of confusion. 


68 


as 














ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 69 


A subgroup S of the group J is completely reducible in J, if some direct 
decompositions of J induce complete reductions of S. Not even all subgroups 
of rank one of a completely reducible group J are completely reducible in J. 
But it is possible to give criteria for the complete reducibility of subgroups of 
finite rank in separable groups. If the subgroup S of J is completely reducible 
in J, and if the rank of S is finite, then the type of S in J can be described by 
means of invariants which are derived from the existence of characteristic sub- 
groups of S in characteristic subgroups of J/. 

All properties and concepts used are invariants and their definitions are based 
on the concept of multiplicity of an element z in the group J, i.e., the l.e.m. 
of all the positive integers n such that z = 0 mod nJ. The calculus with these 
generalized numbers (in the sense of E. Steinitz) is the basis of the methods 
applied here. 

Previous work in this domain concerns mainly direct sums of a finite number 
of infinite cyclic groups. They form a rather special, though important, class 
of completely reducible groups and their theory is embodied in the theory of 
completely reducible and separable groups as developed here. 


Chapter I. Preliminaries 


1. Dependence, rank and rational multipliers.” Let J be an abelian group 
whose elements are combined by addition and which contains no non-zero 
element of finite order. If x is an element of J and n an integer, then nz = 0 
implies that n = 0 or « = 0. An element zx of J is therefore dependent on 
the subset S of J, if there exists a positive integer n such that nz is contained 
in the subgroup of J which is generated by the elements in S.A subset of J is 
dependent, if at least one of its elements depends on the others. A subset of J 
is therefore independent if, and only if, all its finite subsets are independent, 
and the finite subset b; , --- , by is independent if, and only if, > cb; = 0 

ae | 
implies that all the integral coefficients c; are 0. 

The rank of J is the smallest (finite or infinite) number r(./) such that there 
exists a subset of J on which every element of J is dependent and which con- 
tains r(/) elements. There always exist greatest independent subsets of J. 
If G is a greatest independent subset of J, then every element of J is dependent 
on G and G contains exactly r(J) elements. 

The subset C of the group J is closed (in J), if C contains every element of / 
which is dependent on C. Closed subsets are subgroups and a subgroup S of 
J is closed in J if, and only if, the class group J/S does not contain elements + 0 
of finite order. 

Since the intersection of any number of closed subgroups is also a closed 
subgroup, there always exists a smallest closed subgroup which contains a given 


2 For proofs of the facts mentioned in this section, see R. Baer, The subgroup of the 
elements of finite order of an abelian group, Ann. of Math., vol. 37 (1936), pp. 766-781. 














70 REINHOLD BAER 


set S: the closed subgroup, generated by S. It contains exactly those elements 
which are dependent on S. 

In groups of rank one any pair of elements is dependent. The elements 
x ~ O and y # O are dependent if, and only if, there exist integers n and m 
such that 


nz = my ~ 0. 


The ratio of n and m is uniquely determined by the dependent elements x and y; 
and to given x, n, m there exists at most one solution y of the equation nz = my. 


° ° ° ° n 
If the equation nz = my has a solution y in J, the notation y = —z may be used. 
m 


By this definition a multiplication of the elements of J with rational numbers 
has been introduced. Since it is possible that rb’ exists in J, but rb” does not, 
the rational numbers are not operators in the usual sense. These rational 
multipliers satisfy the following rules. 

Suppose that r and s are rational numbers, x and y elements in J. 


If rx and ry exist in J, then r(x + y) If rx and sx exist in J, then (r + s)x 
exists in J and satisfies exists in J and satisfies 
rixty) = rr try. (r+ s)zr = rx + 82. 


If rx and s(rx) exist in J, then (sr)x exists in J and satisfies (sr)x = s(rz). 
If rz and sx exist in J, and if the denominators of r and s are relatively prime, 
then (rs)x exists in J and satisfies 


(rs)x = r(sx) = s(ra). 


A proof of the last formula may be added. If r = mn™, s = hk and nandk 
are relatively prime, then there exist integers k’, n’ such that nn’ + kk’ = 1 
and therefore 

mhx = mhkk’x + mhnn'x = kn(hk'rx + mn'sz). 


It is important to note that the converses of these rules are not true. 

These formulas show that the closed subgroup of J which is generated by the 
element x # 0 is simply isomorphic with the additive group of those rational 
numbers which are multipliers of z. Groups of rank one are therefore exactly 
the subgroups of the additive group of all the rational numbers. Hence they 
may be called rational groups. 

The group J is completely reducible, if J is a direct sum of rational groups. 
If J is the direct sum of the rational groups J, and b, # 0 an element of J, , 
the elements b, form a basis of J. The subset B of J is therefore a basis of 
J if, and only if, B is a greatest independent subset of J and an equation 


k 


nz = >> eid 


i=l 


where n is a positive integer, z # 0 an element in J, the c; are integers and the 
° ° ° ° =} ° ° 
b; different elements in B, implies that every (c;n_)b; exists in J. 


~~ 


ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 71 


J is complete, if J = nJ for every positive integer n. Complete subgroups 
are direct summands. Complete groups are direct sums of groups which are 
isomorphic with the additive group of all the rational numbers. Every greatest 
independent subset of a complete group is a basis. Every group is contained 
in an essentially uniquely determined smallest complete group. 


2. Multiplicities and derived invariants. In the following a certain general- 
ization of the multiplicative set of the positive integers will be needed.’ The 
principal concepts and properties of these generalized numbers will therefore 
be enumerated. 

If, for every prime number p, v(p) is either a non-negative integer or the 


symbol «, then 
v=[[p"” 
Pp 


is such a generalized number v and v is uniquely determined by its p-values v(p). 

If S is any set of (ordinary or generalized) numbers, the product of the num- 
bers in S is the number whose p-value is the sum of the p-values of the numbers 
in S. Here the sum of an infinity of positive integers is © and the sum of 
and anything is ©. The p-value of the g.c.d. (l.c.m.) of the numbers in the 
set S is the minimum (maximum) of the p-values of the numbers in S. 

v is a divisor of w, w a multiple of v, 1.e., v / w, if there exists a solution x of 
the equation w = zv. v/ wif, and only if, v(p) S w(p) for every prime number p 
(where v(p) is the p-value of v and w(p) the p-value of w). If v/ w, there may 
exist many solutions of the equation w = zv. The g.c.d. w,v and the l|.c.m. 
w,v of all the solutions z of w = zv are also solutions of this equation, the small- 
est and the greatest, respectively. If v / w, the p-value of 








w,v is w,v is 

w(p) — v(p) w(p) — v(p) if w(p) is finite 
a a if w(p) is infinite, v(p) finite 
oo 0 if v(p) is infinite. 


The infinite part of the number v is v,, = v,v and the finite part is vf = vv, . 
The symbols v,, and v; are relatively prime, the p-values of v,, either 0 or © and 
the p-values of v; are all finite. We have v = vy, . 

1 is the g.c.d. of all the numbers. Every number is the |.c.m. of ordinary 
integers and the l.e.m. of all numbers is the number without finite p-values. 
It will be convenient to add to these numbers a symbol « which is different 
from all numbers and a multiple of every number. 

The numbers v and w have the same genus, i.e.,|v| = | w!|, if there exist 
ordinary positive integers m and n such that mv = nw. Hence |v/|= | w| if, 
and only if, v,, = w,, and for almost every p the p-value of v is the same as the 
p-value of w. 


3 E. Steinitz has used this generalization for enumeration of the finite fields. 














72 REINHOLD BAER 


If | vi) = | wi! , then | ve | = | ww, | and 
| met, = | wie we |, Vig 02| =| Wig We! . 


Product, g.c.d. and l.c.m. of a finite number of genera are therefore uniquely 
determined genera. 

If the three genera a, b, ¢ satisfy a = be, then b is a divisor of a and aa multi- 
ple of bhie.b Sa. Ifb Saanda<sb,thena=b. Ifa S banda #¥b, 
then a <b. The genus a is a divisor of the genus b if, and only if, there exist 
numbers of genus a which are divisors of certain numbers of genus b. 

DEFINITION 2.1. The multiplicity m(x) = m(x < J) of the element x in the 
group J is «©, if x = 0, and is the |.c.m. of all the positive integers n such that 
x = 0mod nJ (i.c., such that n“'zx exists in J), if x ¥ 0. 

If x # 0, the genus |x| = |x < J | of x in J is the genus of the multiplicity 
of xin J. 

If x is an element of the subgroup S of J, then m(x < S) is a divisor of 
m(x < J) and, if x # 0, then | x < S| is a divisor of |4 < J|. If Sis a closed 
subgroup of J, the elements of S have the same multiplicity and the same 
genus in S and in J. 

If Sis a (closed) subgroup of J, x an element of J, the multiplicity m(z < J/S) 
= m(S +2 < J/S) of e mod S is a multiple of m(x < J/) and, if z ¥ 0 mod S, 
the genus |x < J/S| = |S +2 < J/S| of x mod S is a multiple of its 
genus in J. 

If n is a positive integer, then m(--nr) = nm(xr) and n‘z exists in J if, and 
only if, n is a divisor of m(x < J). If ris a positive rational number such that 
rx exists in J, then m(rx) = rm(z). 

k 


(2.2) (a) If = >> 2, then m(x) is a multiple of the g.c.d. of the m(zx;) and, 
a 
if x, x; are all # 0, then | x | is a multiple of the g.c.d. of the | x; | 
k 
(b) If x = >> 2,, and if the elements x; # 0 are contained in different com- 
sa 


ponents of a direct decomposition of the group J, then m(x) is the g.c.d. of the m(x;) 
and | x | the g.c.d. of the | x; .. 

(c) The elements x # 0 and y # 0 generate isomorphic closed subgroups of the 
group J if, and only if, .x| = | y|. 

(d) The closed subgroup of J, generated by the element x # 0, is isomorphic 
with a subgroup of the closed subgroup of J, generated by the element y # 0, if 
and only if, |x| S \y|. 

Proof. (a) and (b) are consequences of the corresponding facts for the p- 
values of the multiplicities. If x and y are contained in the same group of 
rank one, there exist integers n and u such that nr = uy # 0 and this implies 


‘ The assumption that there do not exist elements # 0 of finite order in J is not needed 
for this definition. If J is an abelian group without elements of infinite order, the multi- 
plicity is essentially Priifer’s ‘‘Héhe’’. 








ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 73 


the necessity of the conditions in (c) and (d). If finally the condition of 


(c) (d) 
is satisfied, there exist integers n # 0, u ¥ 0 such that 
m(nx) = m(uy). m(nx) | m(uy). 
If therefore r ~ 0 is a rational number, then 
r(uy) exists if, and only if, r(nx) exists. r(uy) exists, if r(nz) exists. 


Hence in mapping r(nz) upon r(uy) a required isomorphism is defined. 

By (c) all the elements # 0 of a rational group have the same genus. The 
genus of the elements # 0 of the rational group R is called the genus | R | of R. 

(c) and (d) imply that the rational groups R and R’ are isomorphic, if RP is 
isomorphic with a subgroup of R’ and R’ with a subgroup of R. 

DEFINITION 2.3. If f(x) is a property, then (J, f(x)) is the (not necessarily closed) 
subgroup of J, generated by the elements x of J which satisfy f(x). 

f(x) is an additive property, if? 


(2X Je f@)) = Do, f@). 


The following properties f(x) will be used: g / m(x < J); the order of x mod 
the subgroup S of J is a finite divisor of g;s S |x <Ji;s<ia<Ji;|a< 
J| < s;|a2 < J| €-s. Here g is an (ordinary or generalized) number and 
Ss a genus. 

All these properties (except the second) are additive. If f(x) is one of the 
three first properties, every element of (J, f(x)) has the property f(z). 

Since (J, g/m(x < .J/)) is the intersection of the groups nJ for finite divisors 
n of g, and since its structure depends only on g and on the structure of J, it 
may be denoted by g.J. 

The closed subgroup of J, generated by gJ, is exactly (J, |g| S$ |a < J|) 
and the groups (J, s S | x < J |) are therefore closed subgroups of J. 

If f(x, g, S) is the second property, and if g / m(x < J) for every z in S, then 
(J, f(z, g, S)) is the join of the subgroups nS of J for finite divisors n of g. 
Since under this assumption the structure of (J, f(x, g, S)) depends only on g 
and on the structure of S, and since every group is contained in a complete group, 
this property may be used for defining gS. If g, = 1, then g'(gJ) = J; 
if furthermore J = (J, |g| S |2!), then gg J) = J. Thus under these 
assumptions the structures of J and gJ and the structures of J and g™'J deter- 
mine each other. Note that for a rational group R and a prime number p either 
p R= Ror p’R = 0. 

Finally, m(x < g"J) = gm(x < J) and gm(zx < gJ) = m(x < J). 


(2.4) (a) J,s<lr|)SWU,ssiz|), W,|e| Es) SUV,|x| £8), 
(VJ,s<l|2|)SU,|z| 8s) WU,sslz\)SUV,|2| £8). 


5 J. is here and in the future the direct sum of the groups J, . 
° 














74 REINHOLD BAER 


(b) Every class of J(s)* = (J,s S |x \)/(J,s < | x!) is contained in exactly 
one class of J(s)** = (J, 2) « s)/(J, |x”) £€ 8s). Thus a homomorphism of 
J(s)* upon the whole group J(s)** is defined and this homomorphism its an iso- 
morphism if, and only if, 

(b*) (J,s < | x |) ts exactly the intersection of (J,s S | x |) and of (J,| x| £8). 

These statements are consequences of the definitions and of the following fact. 
If y is an element of (J, |x| < s), then y is the sum of elements y; and of ele- 
ments z; such that s S |y;| ands £ |2;| ¢ s. Since the y; are therefore 
contained in (J, s S |x|) and the z; in (J, 2! ¢€ 8), this implies that every 
element of (J, |z! « s) is mod (J, |x| £€ Ss) congruent to an element of 
(J,s Ss |z)). 

DeFInitIon 2.5. The direct decomposition J = >> J, is a partial reduction of J, 


if for every v all the elements # 0 of J, have the same genus in J, (and in J). 

Every complete reduction of the group J is by (2.2c) also a partial reduction. 
If J = >> J, is a partial reduction of the group J, let J(t) be the sum of all 
those J, whose elements # 0 have the genus t. Then J = >> J(t) is also a 

t 
partial reduction of J, since every element # 0 in J/(t) has the genus t. Since, 
therefore, elements # 0 in different components J(t) have different genera, 


this decomposition is a smallest partial reduction of J. 


(2.6) If J = >> J(t) is a smallest partial reduction of J, then 
t 


(J,t< |x!) = + Hs), (J,t < |r|) = D> Hs), 
tss t<s 

(J,|r| ¢ t) = > JA(s), (J,|z| €t) = >> Ws) 
s<t sft 


and J(t) represents exactly the classes of J(t)* and the classes of J(t)**. 

These are consequences of (2.2) and (2.4). 
(2.7) Suppose that every finite subset of the group J is contained in a partially 
reducible direct summand of J. 

(a) (J,t <|2]), (J,)2| « t) and (J,|x\| £ t) are closed subgroups of J. 

(b) (J, t < |x|) ts for every genus t the intersection of (J, t S |x|) and 
(J,|z| = t). 

(c) All the elements ¥ 0 of J(t)* and of J(t)** have the genus t. 

(d) To every element b # 0 of J there exists at least one and at most a finite 
number of genera t such that 

b = Omod (J, |x| < t), b # Omod (J, | 2} € t). 

Proof. If b is any element of J, there exists a partially reducible direct sum- 

mand D of J which contains b. If f(x) is one of the three discussed properties, 


then (D, f(x)) is a direct summand of (./, f(x)) and by (2.6) a direct summand of 
D and now (a)—(c) are consequences of (2.6) and (2.4). If 6 # 0 and D = 
k 


= D(t) is a smallest partial reduction of J, then b = >> b;, b; ¥ 0, b; in D(t,), 
t 53 


ee a ee 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 75 


t; ~ t; fori # 7. The generat such that b = 0 mod(J,|2z| < t), b ¥ 0 mod 
(J,|a| < t) are exactly the genera t; such that t; « t; for every j ¥ 7, and this 
proves (d). 

THEOREM 2.8. (a) Suppose that J; is a direct sum of rational groups of genus 
t;. Then J; and J2 are isomorphic if, and only if, 


t, = te, r(J1) = r(Je). 


(b) Suppose that J; is partially reducible. Then J; and Jz are isomorphic 7f, 
and only if, for every genus t, J,(t)* and J2(t)* are isomorphic. 

(c) Suppose that J; is completely reducible. Then J, and J: are isomorphic if, 
and only if, for every genus t 


r(Ji(t)*) = r(J2(t)*). 


Proof. If J; is a direct sum of rational groups of genus t;, the number of 
summands is r(J;) and (J;, t: < |x|) = 0, J; = (Ji, t; S |xz]). This proves 
(a). (b) is a consequence of (2.6) and (c) is a consequence of (a), (b), (2.6). 

CoROLLARY 2.9. Any two decompositions of a group J into rational direct 
summands are isomorphic. 

This is true since there appear exactly r(J(t)) summands of genus t in such 
a decomposition. 


Chapter II. Direct summands of finite rank 


3. Direct summands and complete reducibility of groups of finite rank. If 
the group J is the direct sum of its subgroups S and T,i.e., J = S + T, then 


(J,ss|z|) =(S,s Ss |2z|)+ (7,8 S |2}), 


(J,s<|2z|) = (S,s <]2|) + (7,8 < |2)}). 


The subgroup of J(s)* which contains elements of the direct summand S of J 
is therefore a direct summand of J(s)*. Hence a direct summand S of the 
group J satisfies the following conditions. 

ConpiTION 3.1. The classes of J(s)* which contain elements of S form a closed 
subgroup of J(s)*.° 

ConpiTIon 3.2. (S, s < | x}) is the intersection of S and (J, s < | x)). 

The above formulas imply: 
(3.3) The subgroup S of the direct summand D of the group J satisfies the 
conditions 3.1 and 3.2 in J if, and only if, S satisfies these conditions in D. 


6 It may happen that (J, s < | x |) is not a closed subgroup of J and that consequently 
J(s)* contains elements ~ 0 of finite order. Then the ‘‘closed’’ subgroups of J(s)* are 
not defined and therefore the following proposition may be substituted for condition 3.1, 
if (J, s < | x |) is not closed: a congruence nz = s mod (J, s < | x |), where nis an ordinary 
positive integer and s an element of S whose genus in J is s, has a solution z in J if, and only 
if, there exists a solution z in S. 

Note that this proposition and condition 3.1 are equivalent, if (J, s < | x |) is a closed 
subgroup of J. 














76 REINHOLD BAER 


THEOREM 3.4. The group J of finite rank is completely reducible if, and only if, 
every subgroup S of J which satisfies the conditions 3.1 and 3.2 is a direct sum- 
mand of J. 

Proof. A. Suppose that J is a completely reducible group of finite rank 
and that the subgroup S of J satisfies the conditions 3.1 and 3.2. 

Case 1. J is a direct sum of a finite number of isomorphic groups (of rank 1) 
and r(S) = 1. 

By condition 3.1 S is a closed subgroup of J. Thus if r(/J) = 1, then J = S. 
Suppose now that r(./) = 2. Then J = J’ + J” and either S is one of these 
direct summands or there exist elements b’ # 0 in J’, b” # Oin J” and rela- 
tively prime positive integers n’, n” such that n’b’ + n’’b’’ is an element + 0 
of S and m(b’) = m(b’) = m(n’b’ + nb”). There exist therefore integers 
k’, k”’ such that k’n’ — k’’n” = 1. Then the elements 


¢ _ n'b’ + “Tv. c”’ = k’’b’ + k'b” 


form a basis of J, since m(c’) = m(c’’) = m(b’) = m(b”), and since the subgroup 
” ° ” om ~ . 
of J, generated by c’ andc , contains b’ andb . Thus Sis a direct summand of J. 
r(J) 
If, finally, J = >> J;, where the J; are isomorphic rational groups, r(J) any 


i=l 
riJ) 


finite number, and s ¥ 0 any element in S, then s = >> s; with s;in J;. Then 
“a 


r(J) r(J) 
s’ = >> s; generates a closed subgroup S’ of J’ = >> J; and, by complete 
i=2 


i=2 
induction, S’ is a direct summand of J’. Thus S is contained in a direct sum- 
mand of rank 2 and is therefore, as proved above, a direct summand of J. 

Case 2. J is a completely reducible group of finite rank and r(S) = 1. 

Let s be the genus of S and let S* be the subgroup of J(s)* whose classes 
contain elements of S. Then every element of S is contained in one class of S* 
and every class of S* contains by condition 3.2 exactly one element of S.  S* is 
by condition 3.1 a closed subgroup of rank one of J(s)*. Since J is completely 
reducible and of finite rank, and since therefore, by (2.6), J(s)* is a direct sum 
of a finite number of isomorphic rational groups, it follows from case 1 that S* 
is a direct summand of J(s)*, i.e., J(s)* = S* + 7T*. If T is the subgroup of 
(J, s < |x|) which contains (J, s < |x|) and satisfies T* = T/(J,s < | 2}|), 
then (J, s S |x|) = S + T, since S represents exactly S*. Since, by (2.6), 
(J,S S |2|) is a direct summand of J, this implies that S is a direct sum- 
mand of J. 

Case 3. J is a completely reducible group of finite rank. 

There exist in S elements w # 0 such that |w < S| ¢ | s < S| for every 
element s # Oin S. 

For if b # 0 is an element in S whose genus in S is s, and if s < t, then (S, 
t =< |x.) < (S,s S | 2)) and therefore, since both these subgroups are closed 
subgroups of S, r((S,t S |2/|)) < r((S,s S| 2))). 

Let now w be an element # 0 in S such that | w < S| «| s < S| for every 


, 


D2 ee ewe 


ae 


eS ee 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 77 


s ~ Oin S, let W be the closed subgroup of S, generated by w, and s the genus 
of W. «Since (S,s < |x|) = 0, and since S satisfies the conditions 3.1 and 3.2, 
W satisfies the conditions 3.1 and 3.2 in J and is therefore as group of rank one 
a direct summand of J,i.e., J = W+ J’. 

Then S = W + S’ where S’ is the intersection of S and J’, since W Ss S. 
The subgroup S’ of J’ satisfies therefore the conditions 3.1 and 3.2 in J’, and 
since r(J’) = r(J) — 1, it can be assumed that S’ is a direct summand of J’. 
Thus it has been proved by complete induction with regard to r(J) that S is a 
direct summand of J. 

B. Suppose now that J is a group of finite rank such that every subgroup of J 
which satisfies the conditions 3.1 and 3.2 in J is a direct summand of J. 

Since J is a group of finite rank, there exists (as proved in A., case 3) an ele- 
ment w # Oin J such that | w < J| < |b < J | for every element b ¥ 0 in J. 
The closed subgroup W of J, generated by w, satisfies the conditions 3.1 and 3.2 
and is therefore a direct summand of J, i.e., J = W + J’. Every subgroup 
S of J’ which satisfies the conditions 3.1 and 3.2 in J’ by (3.3) also satisfies them 
in J and is therefore a direct summand of J and of J’. Since r(J’) = r(J) — 1, 
it can therefore be assumed that J’ is completely reducible. Hence the com- 
plete reducibility of J has been proved by complete induction with regard 
to r(J). 

Coro.uary 3.5. If J is a completely reducible group of finite rank, every direct 
summand of J is completely reducible. 

For if the subgroup S of the direct summand D of J satisfies the conditions 
3.1 and 3.2 in D, it follows from (3.3) and Theorem 3.4 that S is a direct sum- 
mand of J and therefore a direct summand of D, i.e., D is by Theorem 3.4 com- 
pletely reducible. 

Coro.uary 3.6. Suppose that all the elements # 0 of the group J have the 
same genus s in J. 

(a) If J is complete, then J is completely reducible and every closed subgroup of J 
is a direct summand of J. 

(b) If J is not complete, every closed subgroup of J is a direct summand of J if, 
and only if, J is a completely reducible group of finite rank. 

Proof. (a) is a consequence of the fact that every closed subgroup of a com- 
plete group is complete and that complete subgroups are direct summands, 
complete groups direct sums of rational groups (§1). 

Since J = (J,s S| 2x|),0 = (J, s < |x|), the closed subgroups of J satisfy 
the conditions 3.1 and 3.2. Every closed subgroup of J is therefore by Theorem 
3.4 a direct summand of J, if J is a completely reducible group of finite rank. 
Suppose now that J is not complete and that every closed subgroup of J is a 
direct summand of J. If r(./) is infinite, then J contains an independent subset 
bi, be, --- ,b:,---. Theclosed subgroup J’ of J, generated by these elements, 
is a direct summand of J. Since every subset of the sequence of the 6; generates 
a closed subgroup of J which is a direct summand of J and of J’, the elements 
b; form a basis of J’. Since J is not complete, there exists a prime number p 














78 REINHOLD BAER 


such that pJ < J. The closed subgroup of J and of J’ which is generated 
by the elements b;_, — pb; is a direct summand J” of J and of J’. J” does not 
contain b}. But J = J’/J” satisfies pJ = J. This is a contradiction, since 
the p-value of m(b < ./) is finite for every b  Oin J. The rank of J is therefore 
finite. Since every closed subgroup of J is a direct summand of J, Theorem 3.4 
implies the complete reducibility of J. 

THeoreM 3.7. If there exists a (generalized) number g and a finite subset F 
of the group J such that g | m(f < J) for every f in F and such that J is generated 
by the elements n-'f with f in F and n any positive integer, dividing g, then J is a 
direct sum of a finite number of rational groups of genus | g |. 

Proof. Denote by fi, --- , fx the elements of F. Let J’ be a direct sum of 
k rational groups of genus | g| and fj, --- , f; a basis of J’ such that g = m(fi - 
J’). There exists a homomorphism a@ of J’ upon the whole group J such that 
fia = f;. If W’ is the subgroup of all the elements of J’ which are mapped 
upon 0 by a, then J’/W’ and J are isomorphic. W’ is therefore a closed sub- 
group of J’. W’ is by Corollary 3.6 a direct summand of J’ and J’/W’ is there- 
fore by Corollary 3.5 completely reducible. Hence / is completely reducible 
and J is by Corollary 2.9 a direct sum of rational groups of genus | g |. 

Coro.uary 3.8. If there exists a finite subset F of J such that all the elements 
of F have the same genus s in J, and such that J is generated by the rational multi- 
ples of the clements in F, then J is a direct sum of a finite number of rational groups 
of genus Ss. 

For if g is the g.c.d. of the numbers m(f < J) with f in F, then m(f < J).ig = 
n(f) is an ordinary positive integer and the elements f’ = n(f)'f for f in F form 
a subset F’ of J such that g, F’ satisfy the conditions of Theorem 3.7. 

Coro.iary 3.9.’ Suppose that all the elements ¥ 0 of the group J of finite 
rank have the same genus in J. Then J is completely reducible if, and only 7, 
J/B is finite for every subgroup B which is generated by the rational multiples of 
the elements of a given greatest independent subset of J. 


Proof. Suppose first that J is a direct sum of a finite number of isomorphic 
r(J) 


rational groups, i.e, J = >> J;, and that the subgroup B of J is generated 
| 


by the rational multiples of the elements of a greatest independent subset G of J. 
Then every element b # 0 of B has the same genus sin Band in J. If therefore 
J‘ is the intersection of B and J;, then J;/J; is a finite cyclic group. The 
direct sum J’ of the groups J; is a subgroup of B and J/J’ is a finite group, 
since r(J/) is finite. Since J/B and (J/J’)/(B/J’) are isomorphic, J/B is also 
a finite group. 

Suppose now that G is a greatest independent subset of J, that B is the sub- 
group of J generated by the rational multiples of the elements in G, and that 
J/B is finite. Let F be a set of elements, containing G and a complete set of 
representatives of the classes of J//B. Then F is finite and satisfies the assump- 
tions of Corollary 3.8 and J is therefore completely reducible. 


7 A particular case of this proposition has been communicated to the author by Dr. G. 


Szekeres. 


ain 


a 
s 
‘ 


hm 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 79 


4. Direct summands of finite rank and separable groups. 

DEFINITION 4.1. The group J is separable, if every finite subset of J is contained 
in a completely reducible direct summand of J. 

Note that the assumptions of (2.7) are satisfied for separable groups. 

THEOREM 4.2. J is a separable group if, and only if, 

(a) every finite subset of J is contained in a direct summand of finite rank; 

(b) every subgroup of finite rank which satisfies the conditions 3.1 and 3.2 is a 
direct summand of J. 

Proof. If H is the direct sum of the rational groups H, , then every element 
of H is contained in the direct sum of a finite number of the groups H,. Every 
finite subset and every subgroup of finite rank of a separable group is therefore 
contained in a completely reducible direct summand of finite rank. Hence 
every separable group satisfies the conditions (a) and (b) by Theorem 3.4. 

If the group J satisfies the conditions (a) and (b), every direct summand of 
finite rank is by (3.3) and Theorem 3.4 completely reducible and J is therefore 
by (a) separable. 

Coro.uary 4.3. If J is a separable group, every finite subset of J is contained 
in a completely reducible direct summand of finite rank and every direct summand 
of finite rank is completely reducible.” 

Coro.uary 4.4. If all the elements ¥ 0 of J have the same genus in J, then J 
is separable if, and only if, every closed subgroup of finite rank in J is a direct 
summand of J. 

This is a consequence of Theorem 4.2 and Corollary 3.6, since any n elements 
of J generate a closed subgroup of a rank S n. 

Coro.iary 4.5. Denote by C the greatest complete subgroup of the group J. 
Then every closed subgroup of finite rank of J is a direct summand of J if, and 
only if, 

(a) J/C is separable; 

(b) all the elements # 0 of J/C have the same genus in J/C. 

Proof. C is, as a complete subgroup of J, a direct summand of J, i.e., J = 
C + J’. Suppose now that every closed subgroup of finite rank of J is a direct 
summand of J. Either J’ is of rank one and conditions (a) and (b) are obvious, 
or there exist two different rational closed subgroups S and T of J’. The closed 
subgroup U of J’, generated by S and T, is a direct summand of J and of J’, 
and S and 7 are direct summands of J and of U. By Corollary 2.9 either S 
and T or S and U/T are isomorphic. In the latter case U = S’ + T, where 
S and S’ are isomorphic. If s # 0 is an element of S’, t ~ 0 an element of T, 
the closed subgroup of U, generated by s + ¢, is a direct summand of U and 
therefore |s +t<U{|=!|S|or=!|T7'|. Since the genus of s + tis the g.c.d. 
of the genera of s and of ¢, this implies that either | S| < | 7 | or|7| < | S|. 
Since S and T are not complete, and since the same argument applies to every 

8 The analogous theorem for primary abelian groups may be mentioned. The primary 


abelian group P does not contain elements of infinite height if, and only if, every finite 
subset of P is contained in a finite direct summand of P. 











80 REINHOLD BAER 


element s + nt and ns + t with positive integer n, it follows from condition 3.1 
that S and T have the same genus, i.e., that J satisfies (a) and (b). 

Suppose now that J satisfies the conditions (a) and (b) that S is a closed 
subgroup of finite rank in J, and that S’ is its greatest complete subgroup. 
Then S’ is a direct summand of C, i.e., C = C’ + S’ and S = S” + S’, where 
S” is the intersection of S and J‘ + C’. The subgroup S* of J/C which con- 
tains all those classes, containing elements of S, is exactly represented by S” 
and by Corollary 4.4 is a direct summand of J* = J/C = S* + 7*. If T is the 
subgroup of J’, representing 7*, then J = T + S” + S’ + C’,i.e., Sis a direct 
summand of J. 

Lemma 4.6. Jf J is separable and S a direct summand of finite rank of J, then 
J /S is separable. 

Proof. If F* is a finite subset of J* = J/S, F a subset of J, representing F*, 
then J has by Corollary 4.3 a completely reducible direct summand D of finite 
rank which contains S and F. Since S is a direct summand of J, S is also a 
direct summand of D, i.e., D = S + T and T is completely reducible by Corol- 
lary 3.5. The subgroup 7* of J*, represented by 7, is a completely reducible 
direct summand of J* and contains F*, i.e., J/S is separable. 

TueoreM 4.7. Every countable separable group is completely reducible. 

Proof. Let b,, be, ---,b¢,--- be an enumeration of the elements of the 
countable separable group J. It follows by complete induction from Corollary 
4.3 that there exist completely reducible direct summands J; of J such that 
r(J,) is finite, J;; is a direct summand of J; , and the elements b; with j S 7 
are contained in J;. Then J; = S; + Ji, Si is completely reducible by 
Corollary 3.5 and J the direct sum of the completely reducible groups J; , Ss , 
S;,---,ie., J is completely reducible. 


Chapter III. Types of elements and subgroups 


5. Types of elements in separable groups. 
DeFINITION 5.1. The element b of the group J is a primitive element of genus s 
(in J), of 
b = 0 mod (J, s S | z)), b # 0 mod (J, s < | 2)}), 


m(b < J) = m(b < J/(J,s < |x})) = mb < J(s)*). 


The subset F of J is primitive (in J), if F is finite, its elements are primitive 
elements in J and different elements of F have different genus in J. 

Lemma 5.2. The finite subset F of the separable group J is a primitive set in J 
if, and only if, 

(a) different elements of H have different genus in J; 

(b) the closed subgroup F of J, generated by F, is a direct summand of J and 
F is a basis of F. 

Proof. If the conditions (a) and (b) are satisfied, then every element in F 
is 0 and generates a closed subgroup of J which is a direct summand of J, 
i.e., every element of F is primitive and F is therefore a primitive set. 














is S 


tive 


nF 
f J, 


ne ete Ree 








Atte 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 81 


If b is a primitive element in the separable group J, then the closed subgroup 
of J, generated by b, is by Theorem 4.2 a direct summand of J. If F is a primi- 
tive set in the separable group J, then F contains as a finite set an element b 
such that |f| « |b{| for every element f of F. Then J = 6 + J’, where 6 
is the closed subgroup of J, generated by b, and the elements ~ b of F are all 
contained in J. Since by Lemma 4.6 also J’ is separable, and since elements 
of J’ are primitive in J’ if, and only if, they are primitive in J, (b) follows by 
complete induction from the proved facts. 

Coro.iary 5.3. Suppose that the sets by, --- , bk and bj, ---, by, are primitive 
sets in the separable group J. Then there exists a (proper) automorphism a of J 
such that bia = b; for every i if, and only if, m(b; < J) = m(b; < J) for every i. 

Proof. Since J is separable, there exists a completely reducible direct sum- 
mand D of finite rank which contains all the elements b; and b;. By Lemma 5.2 


k k 
D=>6:+ B= > 6; + B, 
es i=1 


where 6 is the closed subgroup of D, generated by b. Since b; and b; have the 
same multiplicity and therefore the same genus, and since D is a completely 
reducible group of finite rank, Theorem 2.8 and Corollary 2.9 imply that B 
and B’ are isomorphic. Since 6; and b; are isomorphic, and since J = D + J’, 
there exists an automorphism a of J such that bia = b; for every i, ua = u 
for uin J’, Ba = B’, and this proves the corollary. 

Another consequence of Lemma 5.2 is 

Coro.tiary 5.4. If b is a primitive element of genus s in the separable group J, 
|b| # |e| and m(b < J)/m(c < J), then b + ¢ is also a primitive element of 
genus S. 

DEFINITION 5.5. s is a regular or singular genus in the group J according as 
1 < r(J(s)*). 

Lemma 5.6. Suppose that b is a primitive element of genus s in the separable 
group J. 

(a) If s is singular, the primitive elements of genus s are exactly the elements 
r(b + b’), where b’ = 0 mod(J, s < |x|), m(b < J)/m(b’ < J) and risa 
rational number ~ 0 such that rb exists in J. 

(b) If s is regular, there exists a (primitive) element b’ of genus s in J such that 
b, b’ is a basis of a direct summand of J. 

Proof. If b is a primitive element of genus s in the separable group J, then 
J = 6 + J’, where 6 is the closed subgroup of J generated by b (Lemma 5.2). 
If b’ is an element of (J, s < |x|), then b’ is contained in J’, and if m(b < J) / 
m(b’ < J), then m(b + b’ < J) = m(b < J) andb = b + b’ mod(J, s < | x), 
i.e., b + b’ and every r(b + b’) ¥ Oin J are primitive elements of genus s in J. 

If s is regular, then J’ contains an element b” such that b” = 0 mod(J’, s S 
|x|), b”’ # 0 mod(J’, s < |x|). Since J’ is by Lemma 4.6 also separable, 
b” is contained in a completely reducible direct summand D of finite rank for 
J’ and D has a rational direct summand of genus s, since (D,s S | x |) ¥ (D,s < 
| 2 |), and thus (b) is proved. 














82 REINHOLD BAER 


If finally s is singular and b” a primitive element of genus s in J, then b” 
rb + c, where rb ~ 0, is an element of 6 and c an element of J’. Since s 
|b’ | = | rb|, it follows that s S |c|, and since (J’,s S |z|) = (J’,s < | x)) 
(J,s <|2z!|), cis an element of (J,s <|2z|). Since b” = rb mod(J,s < | z}) 
i.e., m(b” < J) = m(rb < J) as b” is a primitive element of genus s, m(rb < J) / 
m(c < J) and therefore c’ = rc exists in J and b” = r(b + c’) has the re- 
quired form. 

We shall employ the notation 


m(b, s) = m(b < J, 8) 
m(b, s+) = m(b < J, s+) = m(b < J/(J,|2| £:)). 


Il 


ll 


mb < J/(J, 24! £s)), 


Lemma 5.7. Suppose that b, , --- , by is a primitive set in the separable group J 
k 
and that b = >> b;. 
=I 


(a) If s | b; | for every i, then m(b, s) = m(b, s+). 
(b) If | b;| « | bi | = s for every j, then 


m(b < J,8s) = @, m(b < J, s+) = m(b; < J). 
(c) If | b;| < | bs| = s for some j, then 

mb<J,s) = g.c.d. of all m(b; < J) with |b;| < 

m(b < J, s+) g.c.d. of all m(b; < J) with | b;| Ss; 

m(b < J,s) # @, m(b < J, s+) /m(b < J,s) 


Ss; 


and q(b, s) = m(b < J,s);m(b < J, 84-) is an ordinary positive integer; m(b; < J) 
and m(b < J, s+) have the same finite p-value for every prime divisor p of q(b, s). 
Proof. If s is any genus, then b = >> bs = D> db; mod (J, |x| <£ 8). 


s<jbj ss/)i| 

Since the elements b; form a primitive set in the separable group J, it follows 
from Lemma 5.2 that m(b, s) and m(b, s+) have the values given in the lemma. 
The other statements are consequences of these evaluations and of the fact that 
|u| < |v| implies the existence of a positive integer h such that m(u < J) / 
hm(v < J). 

Notations. If b # 0 is any element of the separable group J, s a genus such 
that m(b < J, s) # @, q(b, s) therefore an ordinary positive integer, and g any 
(generalized) number, then 

(g, b, s)* is the greatest divisor of g which is relatively prime to q(b, s); 


(g, b, s)** = g.c.d. of (g, b, s)* and (m(b, s), b, s)*;° 
(g, b, s) = (g, b, s)*<(g, b, s)**. 


% (g, b, s)** = l[g.c.d. of (g, b, s)* and m(b, s)| = [g.c.d. of (g, b, s)* and m(b, s+)] by 
Lemma 5.7. 








* Pomel Baath owe 


ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 83 


Note that by Lemma 5.7 
(m(b, s), b, s)* = (m(b, s+), b, s)*. 


Lemma 5.8. Suppose that, for every genus s, r(s) is a number of genus s. Then 
there exists corresponding to every element b # 0 of the separable group J a primitive 
set by , --- , by in J such that 

k 

(a) b= > bi ; 

ie | 

(b) m(b < J, s) ¥ m(b < J, s+) if, and only if, (exactly) one of the elements 
b; is of genus s; 

(c) m(b < J, s) = ~, m(b < J, s+) = m(b; < J), if |b;| = sand m(b < 
J, t) = m(b < J, t+)(= ~) fort <s. 

(d) If sis a regular genus, | b;| = sand m(b < J,s) # ™, then 


m(b; < J) = m(b, s+)(r(s), b, s).”° 
(e) If s is a singular genus, | b;| = s and m(b < J,s) # ©, then 
m(bi < J) = m(b, s+)(r(s), b, s)hi ,* 


where h; is an ordinary positive integer, relatively prime to q(b, s) and chosen at 
random in its class H(h;, 6, s) = H(b, --- , bk, b, s), where H(h, b, s) denotes 
the smallest class of ordinary integers which contains h; complete classes of residues 
mod q(b, s) and with n also every nn’ for n’ | r(s),, , n’ an ordinary integer. 

Proof. 1. Since J is a separable group, every element b + 0 of J is contained 
in a completely reducible direct summand D of J. There exists a smallest 
partial reduction of D and, if b(s) is the component of 6b in the summand, con- 
taining the elements of genus s, the elements b(s) ¥ 0 are primitive elements of 
genus s and form therefore a primitive set (in D and in J) whose sum is B, i.e., 
to every element b ¥ 0 of J there exists a primitive set in J whose sum is b. 

2. If the elements w, b; , --- , b; form a primitive set in J whose sum is b, 
if | w | = sand m(b, s) = m(b, s+), it follows from Lemma 5.7 that m(b, s) ¥ ©; 
that there exist elements b; , say b; , --- , bs , such that | bj! < s if, and only if, 
1sjSh,0 <h Sk; that 

c; = m(b;): {g.c.d. m(b;), m(w)} 
is for 1 S j S h an ordinary positive integer; and that 1 is the g.c.d. of q, --+ , ca. 
h 
There exist therefore integers c; such that }> cc; = 1 and a primitive set in J 
ga 
is defined by 
b, = bs + crew (l sis hy), 


b; = db; (h < i), 


1° m(b, st+)(r(s), b, s) = l.c.m. of m(b, s+) and (r(s), 6, s)* = l.c.m. of m(b, s+) and of 
all divisors of r(s) which are relatively prime to q(6, s). 


STR a ame ee meme se mt a 











84 REINHOLD BAER 


since |b; | < |w| and m(b;) / m(cw) = cm(w) for 1 S i S A, ie., b; and bi 
are primitive elements of the same genus and the same multiplicity in J (Corol- 
lary 5.4). Since the sum of this new primitive set is b, and since this new primi- 
tive set contains fewer elements than the old, we have the following result. 

A primitive set in J with sum 6 satisfies the condition (b) if, and only if, it 
contains as few elements as possible. There exist therefore to every element 
b # Oin J primitive sets in J which satisfy (a) and (b). 

3. Suppose now that the primitive set b, --- , by satisfies the conditions 
(a) and (b), that s is the genus of b; , and that m(b < J, s) # «©. Then some 
of the elements 6; satisfy | b; | < s and it can be assumed that exactly the ele- 
ments b; with 1 S j S A, 1 S h satisfy this condition. 

(m(b, s), b, s)* = (m(b, s+), b, s)* is by Lemma 5.7 the g.c.d. of the numbers 
(m(b;), 6, s)* with 1 S 7 S hand therefore a true divisor of (m(b,), b, s)*. Then 


d; = (m(b;), b, s)* :(m(b, s), b, s)* 


is relatively prime to q(b, s). 
The numbers 
cj = m(b;). {g.c.d. m(b;), m(bi)} (l Sj SA) 


are ordinary positive integers whose g.c.d. is q(b, s). 
Suppose now that w is an ordinary positive integer, dividing the finite part 
(d;); of d;. Then there exists a decomposition 
h 


w= |] w; 


7=1 
such that w; and w, are relatively prime for 7 # j’ and such that w;/ 
m(b;) .{g.c.d. m(b,), m(b;)}, since w is relatively prime to q(b, s) and since 
m(b, s+) is the g.c.d. of the numbers g.c.d. (m(b;), m(b;)) with 1 S 7 Sh. 
Then c; and w; are relatively prime integers and the g.c.d. of the numbers 
cjww;' with 1 < j < Ah is exactly q(b, s). There exist therefore integers c; 
such that 


h n 
q(b, 8) = >> c;e;(ww;') or g(b,s)w = >> cje;w;' (40 mod 1). 
?7™=1 g™1 


Since | b; | < s and w;m(b,) / c;m(b,) for 1 S j S h, a primitive set is defined by 


b; = b; (h <j # 4), 
b; = b; + ne;e;w;'b; (lj SA), 
b; = (1 — ngq(b, s)w ')bi (j = 9), 
where n is any given integer. This primitive set satisfies (a), (b), and 
m(b;) = m(b,) (j # 2), 
m(b;) = | w — nq(b, s) | w 'm(b,) (¢ = 9). 





SAtiledisc 





en ee te 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 85 


Since it is possible to apply this construction" upon several indices 7 without 
changing the effect on the other indices, we have the following result. 

(5.8.3) Suppose that the elements b; form a primitive set, satisfying (a) and (b) 
and that for m(b, |b; |) # © n; is any integer and w; / d; = (m(b,), b, | b; |)*; 
(m(b, |b: |), b, |b; |)*. Then there exists a primitive set b; which satisfies (a) 
and (b) and 


m(b;) 


m(b,), if m(b, |b; |) = @, 
= | w; — niq(d, | b; |) | wy m(d,), if m(b, |b; |!) # @. 


4. Suppose that b , --- , bd, is a primitive set, satisfying (a) and (b), and 
that the indices 7, h and the numbers w, w;, c;, c; for 1 S j S h and d; have 
the same meaning as during the proof of (5.8.3). Suppose furthermore that the 
genus s of b; is regular for the group J. Then there exists by Lemma 5.6b an 
element e in J such that b; , e is a basis of a direct summand of J and satisfies 
m(b; < J) = m(e < J). Since |b; | < s and w;m(b,) / cjym(b;) = c;m(e), and 
since ¢ is a primitive element of genus s in J, it follows from Lemma 5.2 and 
Corollary 5.3 that a primitive set in J is defined by 


bj = b; (h <j # i), 
b; = b; + csc; wj'e (l<j<h), 
b; = b; — w ‘qb, s)e (j = 3). 
This primitive set satisfies (a), (b) and 
m(b;) = m(b;) (j ¥ 2), 
m(b;) = [g.c.d. of m(b;) and w ‘q(b, s)m(e)] = w 'm(b,) (j = 2), 


since m(b;) = m/(e) and since w and q(b, s) are relatively prime. 

Since it is possible to apply this construction upon several indices 7 without 

changing the effect on the other indices, we have the following result. 
(5.8.4) Suppose that the elements }; , --- , b, form a primitive set which sati#- 
fies (a) and (b), and for every 7 such that m(b, | b; |) # © and | b;| isa regular 
genus for J the integer w;/d; = (m(b;), b, | b; |)*:(m(b, |b: |), b, | bs |)*. Then 
there exists a primitive set bj , --- , b, which satisfies (a) and (b) and m(b;) = 
m(b;), if m(b, |b; |!) = © or | b; | is a singular genus for J, or m(b;) = w;‘m(b,), 
if m(b, | b; |) # © and | b; | is a regular genus. 

5. Suppose that b;, --- , b, is a primitive set which satisfies (a) and (b) 
and that m(b, |b; |) # © if, and only if, 1. < i < 2,052 k. Since r; = 
m(b, | b: | +)(r(| b; |), 6, |b: |) has the genus | 6; | for 1 < 7 S z, there exists an 
ordinary positive integer n; which is relatively prime to q(b, |b; |) such that 
r; | n:m(b,) and there exist integers n;, n/ such that 1 = n,q(b, |b: |) + nin. 
There exists therefore by (5.8.3) a primitive set b; , --- , 6; which satisfies (a) 

1 Since m(rb;) = m(b;), if numerator and denominator of r are divisors of the infinite 
part of m(b,), it suffices for this construction to assume that w is a divisor of d; . 











86 REINHOLD BAER 


and (b) and 
m(b;) = m(b;) (2 <2), 


m(b,) |nzny | m(b;) (l sis 2). 


Thus it follows from Lemma 5.7 that 

(5.8.5) There exists a primitive set b; , --- , by which satisfies (a), (b), (c) and 
the additional condition (d*). 

(d*) If m(b, |b; |) # &, then 


m(b;) = m(b, | bs | +)(r(| b: |), b, | be DAE, 


where h; is an ordinary positive integer which is relatively prime to q(b, | b; |). 

6. Since the numbers h; in (5.8.5d*) satisfy the condition imposed on the 
numbers w; in (5.8.3), it follows that these numbers h; can be chosen at random 
in their class of residues mod q(b, | b; |); and since m(b;) = pm(b,), if pis a 
prime number which divides the infinite part r(| b; |), of m(b,), it follows 
that h; can be chosen at random in its class H(h;, b, | b: |). 

7. If m(b, |b; |) # & and |b;| is a regular genus, then the numbers h; in 
(5.8.5d*) satisfy the condition imposed upon the numbers w; in (5.8.5), and the 
existence of a primitive set which satisfies (a) to (e) follows therefore from 
(5.8.5) and the facts proved in heading 6. 

Let s be a singular genus for the separable group J; n an ordinary positive 
integer which is relatively prime to the infinite part of the numbers of genus s; 
Y = Y(J, n, s) the subgroup of J generated by nJ and (J, |x| £€ s); and 
Z = Z(J,n,s) the group of those classes of J/Y(J, n, s) which contain elements 
of (J, |x| <s). 

Since J is a separable group and s a singular genus, it follows from Theorem 
2.8 that there exists a rational subgroup R&R of genus s and a subgroup J’ such 
that 


J=R+4+J',(U,\2! 8) =R+U,|2] €8), U,s |r|) =R+(,8 
<|z|), (J’,|2| « 8s) = (J’,|z| £8), (J’,8s < |z|) = WJ’, 8 S |z)). 


Since every class of Z(J/, n, s) contains elements of R, i.e., primitive elements 
of genus s and since every primitive element of genus s is contained in a class of 
Z, it follows that Z(J, n, s) and R/nR are isomorphic, i.e., since n is relatively 
prime to the infinite part of the numbers of genus s, Z(J, n, s) is a cyclic group 
of the finite order n. 

nJ, (J, |x| £ s) and (J, |x| <« s) are characteristic subgroups of J, and 
every (proper) automorphism of J induces therefore an automorphism of 
Z(J, n, 8). Two elements z and 2’ of Z are called conjugate, if there exists an 
automorphism of J which maps z upon 2’. The elements of Z are thus distributed 
into classes of conjugate elements. 

Lemma 5.9. Suppose that s is a singular genus of the separable group J and 
that the positive integer n is relatively prime to the infinite part of the numbers of 


genus S. 








ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 87 


(a) The elements z and z’ of Z(J, n, S) are conjugate if, and only if, there exist 
positive integers h,h’ which are divisors of the infinite part of the numbers of genus s 
and which satisfy hz = h’z’. 

(b) The elements z and z' of Z(J, n, 8) are conjugate if, and only if, the primitive 
elements of genus s, representing z, have the same multiplicities as those repre- 
senting 2’. 

(c) The primitive elements b and b’ of genus s represent conjugate elements of 
Z(J, n, s) if, and only if, 

(c’) b and b’ have the same order q mod Y(J, n, s); 

(c’’) there exist ordinary integers k and k’ which are relatively prime to q and 
satisfy 

k = k’ mod q, m(kb < J) = m(k’b!’ < J). 


Proof. If h, h’ are ordinary positive integers, dividing the infinite part of the 
numbers of genus s, and if z and 2’ are elements of Z(./, n, s) such that hz = 
h’z’, then z’ contains a primitive element b’ of genus s and the primitive element 
b = hh’'d’ of genus s in z satisfies m(b) = m(b’), i.e., z and z’ are represented 
by primitive elements of the same multiplicity. 

If z and 2’ are elements of Z which are represented by primitive elements 
b and b’, respectively, such that m(b) = m(b’), there exists by Corollary 5.3 an 
automorphism of J which maps b upon b’ and therefore z upon 2’, i.e., z and 2’ 
are conjugate. 

Suppose that z and z’ are conjugate and that the primitive elements b and b’ 
of genus s represent z and z’ respectively. Then there exists an automorphism 
of J which maps z’ upon z and therefore b’ upon the element b” representing z. 
Moreover, m(b’) = m(b’’). Since b, b” are primitive elements of the singular 
genus Ss, it follows from Lemma 5.6a that 


b” = r(b +0) 


where m(b’’) = m(rb), rb exists in J andc = 0 mod (J,s < |x\). Since b” =6b 
mod Y(J, n, s) and rb = r(b + c) mod Y, this implies b = rb mod Y and there- 
fore b = rb mod nJ, b = rb mod nb, where 6 is the closed subgroup of J, gener- 
ated by b. The order q of b mod Y is nn’ where n’ is the g.c.d. of n and m(b). 
Since b and rb have the same order mod Y, n’ is also the g.c.d. of n and m(rb) 
and therefore n’'(r — 1)b = 0 mod qb, q and m(n’™'b) are relatively prime. If 
r = rr’, where r’ and r” are relatively prime integers, then r”’ is relatively 
prime to q (as a divisor of m(n’~'b)) and r’ = r” mod q and furthermore m(r’b) = 
m(r’’b’), i.e., b and b’ satisfy the conditions (c’) and (c’’). 

Suppose now that the primitive elements b and b’ of genus s satisfy the con- 
ditions (c’) and (c’’). Since k is relatively prime to q, there exists an integer 
k’”’ such that kk” = 1 mod q and since b = b’ = 0 mod nq‘ J, it follows that 
b = kk’’b = k’k"’b mod Y(J, n, s), b’ = kk’b! = k’k’’b’ mod Y, and since m(kb) = 
m(k’b’), this implies that kb and k’b’ and therefore b and b’ represent conjugate 
elements of Z. 

If z and z’ are conjugate elements of Z, b a primitive element of genus s, 


er ne ee 








88 REINHOLD BAER 


representing z, then there exist relatively prime integers h, h’ such that m(b) = 
m(hh’'b) and hh’~'b represents 2’. Then h and h’ divide the infinite part of the 
numbers of genus s, and hz = h’z’. This completes the proof of the lemma. 

Notation. If 6 # 0 is an element of the separable group /, s a singular genus 
of J such that «© + m(b, s) # m(b, s+), then n(b, s) = m(b, s); (m(b, s), b, s)*. 

Corouiary 5.10. (a) If b # 0 ts an element of the separable group J, s a 
singular genus such that ~ # m(b, s) # m(b, s+), then b represents a certain 
element of order q(b, s) of Z(.J, n(b, s), 8) and therefore a class C(b, s) of conjugate 
elements of Z(J, n(b, $), 8). 

(b) Suppose that, for every genus t, r(t) is a number of genus t and that the ele- 
ments b and b’ of the separable group J satisfy » # m(b, s) = m(b’,s) 4 m(b, s+) 
= m(b’,s+) for a certain singular genus s. Then n(b,s) = n(b’, s) and q(b, s) = 
q(b’, s). Moreover, C(b, s) = C(b’, s) if, and only if, H(bi, --- , be, b, 8) = 
Be., <>, by’, b’, s), where bh, --- , by and bi, «++, by’, are primitive sets in J with 
sum b and b’ respectively which satisfy the conditions (b) to (e) of Lemma 5.8. 
H(b,, --- , be, b, S) = H(b, s) depends therefore only on b, s, and the numbers r(t), 
representing the genera t, but not on the particular representation of b by a “canon- 
ical”’ primitive set. 

Proof. If b # 0 is an element of the separable group J, then there exists a 
primitive set b; , --- , b, , satisfying the conditions (a) to (e) of Lemma 5.8. If 
s is a singular genus such that « # m(b, s) # m(b, s+), then exactly one of the 


elements b; , say 6; has the genus sin J. Then b = b; mod Y(./, n(6, s), s) and ° 


now the corollary is a consequence of Lemma 5.8 and Lemma 5.9. 

Derinition 5.11. The subsets S and T of the group J are isotype (have the same 
type in J) if there exists a (proper) automorphism of J which maps S upon T. 

Turorem 5.12. The elements b and b’ of the separable group J are isotype in J 
if, and only if,” 

(a) m(b, s) = m(b’, s) and m(b, s+) = m(b’, s+) for every genus s; 

(b) C(b, s) = C(b’, s) for every singular genus s such that ~ 4 m(b, s) = 
m(b’, s) = m(b, s+) = m(b’, s+) and therefore n(b, s) = n(b’, s), g(b, s) = 
q(b’, s). 

Proof. m(b, s), m(b, s+) and C(6, s) are invariants of the element b in the 
group J, since for their definitions only characteristic subgroups of J have been 
used. If the conditions (a) and (b) are satisfied by the elements b and Db’, 
there exists by Lemma 5.8 and by Corollary 5.10 a primitive set b; , --- , b. with 
sum 6 and a primitive set Bes , by with sum b’ in J such that m(b; < J) = 
m(b; < J) for 1 S i < k. There exists therefore by Corollary 5.3 an auto- 
morphism a of J such that bia = b;, ice., such that ba = b’. 

Corouuary 5.13. Suppose that b; is an element of the separable group J; and 
that J, and Jz are isomorphic. Then there exists an isomorphism of J, upon J2 
which maps b, upon be if, and only if, 

(a) m(b) < Jy, 8) = m(be < Je, 8) and m(h < Ji, 8+) = m(b < J2,8+) 
for every genus Ss; 

i? A reason for the so different nature of the invariants m(b, s), m(b, s+) on the one 
hand and the invariants C(6, s) on the other will be given in Corollary 6.11. 








Rees ene 








+ 
n 





re 








ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 89 


(b) the primitive elements of genus s, representing C(b, , 8), have the same multi- 
plicities as those representing C(bz , s) for every singular genus s of J, and of J 
such that 


co m(b; < Ji, 8) = m(b; % Ji, s+). 


For an isomorphism of J upon J; maps b: upon an element b; such that 
b, and b; satisfy in J; the conditions (a) and (b) of Theorem 5.12 if, and only if, 
the conditions (a) and (b) of the Corollary 5.13 are satisfied. This follows from 
Lemma 5.9 and Corollary 5.10. 

Corotiary 5.14. The elements b and b’ of the separable group J satisfy the 
condition (a) of Theorem 5.12 if, and only if, 

(1) m(b, s) ¥ m(b, s+) tf, and only if, m(b’, s) # m(b’, s+); 

(2) «© # m(b, s) * m(b, s+) if, and only if, « # m(b’,s) # m(b’, s+); 

(3) m(b, s+) = m(b’, s+), if m(b, s) * m(b, s+). 

Or (3) can be replaced by 

(3’) m(b, s+) = m(b’, s+), if © = m(b, s) + m(b, s+); 

(3’") q(b, s) = q(b’, s), if © # m(b, s) + m(b, s+). 

This follows from Lemma 5.7 and Lemma 5.8, since there exists but a finite 
number of genera s such that m(b, s) # m(b, s+) and since m(b, s+) # m(b, s) 
~ « implies that m(b, s) is the g.c.d. of those m(b, t+) with t < s and m(b, t) + 
m(b, t+). 


6. Characteristic subgroups of separable groups. 

DEFINITION 6.1. The subgroup S of the group J is a characteristic subgroup of 
J, if S is mapped upon itself by every proper automorphism of J. S is regular, if 
every proper or improper automorphism of J maps S upon a subgroup of S.A 
characteristic subgroup of J which is not regular is singular. 

Regular subgroups are usually called strictly characteristic, singular ones 
characteristic, but not strictly characteristic. The above notation has been 
adopted for brevity. 

nJ and (J, 8 S |x|) are examples of regular subgroups. Intersection and 
join of characteristic or regular subgroups are characteristic or regular subgroups. 

Lemma 6.2. Suppose that b ¥ 0 is an element of the characteristic subgroup S 
of the separable group J and that the genus s satisfies m(b, s) # m(b, s+). Then 
the element w of J is contained in S, if one of the following conditions is satisfied: 

1. s < | w|, m(b, s+) / m(w). 

2. s = | w| and 2m(b, s+) | m(w) 

or m(b, s+) | m(w) and S is regular 
or Ss is regular 
or m(b, s) # © and q(b, s) is odd. 

Two particular instances of this lemma will be needed for its proof. 
(6.2.1) Suppose that the characteristic subgroup S of the separable group J contains 
the sum b of the primitive set b; , --- , bk in J. 

(a) 2b; is contained in S. 

(b) b; is contained in S, if S is regular or | b; | is regular. 








90 REINHOLD BAER 


: 
Proof. By Lemma 5.2 there exists a subgroup J’ such that J = >> 6; + J’, 


i=l 
where 6; is the closed subgroup of J, generated by 6;. A proper automorphism 
a of J is therefore defined by bja = 6; for 7 # ti, ua = u for u in J’, bja = 


—b, for j = 7, and an improper automorphism 8 is defined by b;8 = 6; for 7 ¥ 7, 
u8 = uforuin J’, 6,8 = 0 for j = 7. Then b — ba = 2b; is contained in S. 
If S is regular, then b — b8 = b; is contained in S. If finally | b; | is regular, 
there exists by Lemma 5.6b an element w in J’ such that m(w < J) = m(b; < J) 
and J’ = @+ J”, where @ is the closed subgroup of J generated by w. Then 
a proper automorphism y of J is defined by by = 6; for 7 ¥ ¢, wy = u for win 
J’, by = bi + w forj = i, and therefore b — by = —w is contained in S. But 
since —w and b, are primitive elements of the same genus and the same multi- 
plicity in J, it follows from Corollary 5.3 that b; is contained in S. 
(6.2.2) If the characteristic subgroup S of the separable group J contains the 
primitive element b, then S contains every element c such that m(b) | m(c). 
Proof. cis by Lemma 5.8 the sum of a primitive set c¢ , --- , c, and Lemma 
5.7 implies m(b) / m(c) / m(c,). If |c;| = |b), then there exists an ordinary 
positive integer h such that hm(b) = m(hb) = m(c,). Since Ab is an element of 
S, it follows from Corollary 5.3 that ¢; is contained in S. If |b| < | ¢; |, then 
it follows from Corollary 5.4 that b + ¢; is a primitive element of the same genus 
and the same multiplicity as b and it follows therefore from Corollary 5.3 that 
b + c; and consequently c; are elements of S. 
(6.2.3) If the assumptions: 

bi , --+ , be ts a primitive set in the separable group J, complying with the condi- 
tions of Lemma 5.8; 


b = > b; is an element of the characteristic subgroup S of J; the genus s of b, 
os 


is singular for J; 
o = m(b,s) ~ m(b, s+); 
m(b, < J) = m(b, s+)fh, where h is an ordinary positive integer which is rela- 
tively prime to q(b, s); 
are satisfied, then S contains q(b, s)h~'b, and there exists an integer h' which is 
k 


relatively prime to q(b, s) such that h-'b, + h’ p 8 b; is contained in S. 


Proof. By Lemma 5.8 there exists a primitive set bj, --- ,b, with sum b 
in J such that 

m(b;) = m(b,) (i ¥ 1), 

m(b;) = m(b, s+)f(h + 9(b, s)) (¢ = 1). 


' 
Corollary 5.3 implies therefore that (hk + q(b, s))h~'b: + 20 b; and consequently 


q(b, s)h”'b; are elements of S. Since h and q(b, s) are relatively prime, there 





aa Cae 


ye at ete 


2 





oe 


Pte Pte ie 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 91 
exist integers h’, q’ such that hh’ + q’q(b, s) = 1 and 
» 
hb; + h’ Yb; = h’b + q'q(b, s)h'b, 
s=3 


is an element of S 

Proof of Lemma 6.2. Suppose that t is the genus of w. Then it is possible 
to choose the number r(u) of genus u in such a way that r(u) / m(w < /) for 
u <t. There exists by Lemma 5.8 and (6.2.3) a primitive set b; , --- , bj in J 
whose sum 0b’ is an element of S such that s is the genus of b; and m(bi < J) = 
m(b, s+), if m(b,s) = ©, m(bh, < J) = m(b, s+)(r(s), b, s), if m(b, s) # ~. 

If s < t and m(b, s+) / m(w), then m(b;) / m(w) and b; + w is therefore by 
Corollary 5.4 a primitive element of genus s, satisfying m(b,) = m(b; + w). 
Corollary 5.3 implies therefore that b’ + w and consequently w are elements of S. 

If s is regular or S is regular, then b, is by (6.2.1) an element of S. If m(b, s+) 
/ m(w), then m(b;) / m(w) and w is therefore by (6.2.2) an element of S. 

2b; is by (6.2.1) an element of S. If 2m(b, s+) / m(w), then m(2b,) / m(w) 
and w is therefore by (6.2.2) an element of S. 

Coro.uary 6.3. If the primitive set b, , --- , by in the separable group J is a 
subset of the characteristic subgroup S of J and if the element b # 0 in J satisfies 
lem. (|b: |, ---, | be |) S |b], g.e.d. (m(bi), --- , m(be)) / m(b), then b is an 
element of S. 

Proof. From the assumptions there exist relatively prime positive integers 
h; such that m(b;) / him(b) = m(hib). Since b; is an element of S, (6.2.2) implies 
that h,b is an element of S. Since the h; are relatively prime, there exist integers 
h; such that >» hjh; = landb = > h;h;b is an element of S. 

i=1 

Davusrssen 64. If S is a subgroup of the separable group J, s a genus, then 
g(S, s) = g(S < J, 8s) is the g.c.d. of all the multiplicities m(b < J) of primitive 
elements b of genus s in J which are elements of S. 

Note that g(S, s) = ©, if S does not contain primitive elements of genus s. 

THEOREM 6.5. (a) If S is a characteristic subgroup of the separable group J, 
then S contains every primitive element b of genus s in J such that g(S, s) 
/m(b < J). 

(b) If S ts a regular subgroup of the separable group J and T a characteristic 
subgroup of J, then S s T if (and only if) g(T, s) | g(S, s) for every genus s. 

(ce) If S and T are regular subgroups of the separable group J, then S = T if 
(and only if) g(T, s) = g(S, s) for every genus s. 

Proof. m(b < J) has the same infinite part for every primitive element b 
of genus sin J. If the prime number p does not divide the infinite part of the 
numbers of genus s, and if g(S, s) # ©, there exists a primitive element b, 
of genus s in J which is an element of S such that the p-values of m(b, < J) 
and g(S, s) are equal (and finite). Let now } be a primitive element of genus s 
in J such that g(S, s) / m(b < J). S contains a primitive element b’ of genus s. 
If m(b’ < J) / m(b < J), then m(b < J) = m(hb’ < J) for some integer h ¥ 0, 








92 REINHOLD BAER 


and b is an element of S by Corollary 5.3. If m(b’ < ./) is not a divisor of 
m(b < ./), there exists a positive integer k which is relatively prime to the in- 
finite part of m(b) such that m(b’ < J)/m(kb < J) = km(b < J). kbis 
therefore an element of S. Let A be the smallest positive integer such that hb 
isan element of S.. Thenh/k. Furthermore there exists in S by Corollary 5.3 
a multiple of b whose multiplicity has the same p-value as m(b, < J). If 
k # land p/ k, there exists in S an element wp ‘b, where w is a positive integer 
relatively prime to p. If w’'w + k’k = g.c.d. of wand k, then (w’w + k’k)b is an 
element of Sand 0 < w’w + k’k <k. That being impossible, k = 1, and b is an 
element of S. 

If S is regular and the sum b of the primitive set b; , --- , b, an element of S, 
then by (6.2.1) every b; is contained in S. If the characteristic subgroup T of 
J satisfies g(7’, s) / g(S, s), it follows from (a) that S S T, i.e., (b) is true. (ce) 
is a consequence of (b). 

Tueorem 6.6. Let, for every genus s, g(s) be either a (generalized) number or the 
symbol «©. Then there exists a regular subgroup S of the separable group J such 
that g(S < J,s) = g(s) for every genus s if, and only if, 

(a) g(s) = ~, if (J,s < |r|) =(J,s < \r]); 

(b) if g(s) # ~, then 

(b1) | g(s)| S 8; 
(b2) the infinite part of g(s) is the infinite part of the numbers of genus s; 
(b3) the finite part g(s), / g(t) for every t < s; 

(c) if g(t) = ~,t < sand (J,s < |x|) ¥ (WJ,8 S |x|), theng(s) ¥ ~. 

Proof. A. Suppose that S is a regular subgroup of the separable group J. 
If (J,s <|2|) = (J,s S | x)}), there do not exist primitive elements of genus s 
in J, i.e., g(S, 8) satisfies (a). That g(S, s) satisfies (b1) and (b2) is obvious. 
(b3) and (c) are consequences of Corollary 6.3. 

B. Suppose that the numbers g(s) satisfy the conditions (a) to (c). Then put 


S(s) = (J;s S |2|, g(s)/m(x < J)). 


Thus for every genus s a regular subgroup of J has been defined. S(s) = 0 if, 
and only if, g(s) = « (by (c)). S(s) consists exactly of the elements z ¥ 0 with 
s < |x|, g(s)/m(x). (a), (b1), (b2) and (c) imply that g(S(s) < J, s) = g(s). 
Now let S be the subgroup of J, generated by all these subgroups S(s). As 
the join of regular subgroups, S is a regular subgroup of J. Since by Lemma 5.8 
every element of S(s) is the sum of primitive elements b such that s < | }|, 
g(s) | m(b), it follows that every primitive element s of genus t which is con- 
tained in S has the form s = u + v + w, where 


u = >> ui, ui primitive, | u;| < t, g(| us|) / m(u); 
v = >> v,, » primitive of genus t, g(t) / m(v,); 


w= > w,, w; primitive, | w;| £ t, g(| wi!) / m(wi). 
’ 





C0, NG OR I 


“Siete 


Re AIS Beem. 





Oe a A IH we 


Miva AMI. PII 


ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 93 


Then s = u + vmod (J, |2| € t), and since s is a primitive element of genus t 
in the separable group J, it follows from Lemma 5.2 that m(u + v) / m(s). 

g(t) / m(v;) implies g(t) / m(v). 

g(t), = m(s), , since t is the genus of s and g(s) satisfies (b2). 

g(t); / g(, us|) | m(u,), since | u; | < t and g(s) satisfies (c) and (b3). There- 
fore g(t); /m(u). Thus it has been proved that g(t) /m(u + v)/ m(s) and 
therefore that g(t) /g(S < J, t). Since S(t) S S, it follows that g(S, t) / 
g(S(t), t) = g(t), ie., g(s) = g(S, s). 

Coro.iary 6.7. Suppose that b ¥ 0 is an element of the separable group J 
and that S is the smallest regular group of J which contains b. Then 


g(S,s) = ~,if m(b,s+) = ~ orif(J,s S |x|) = WJ,s < |2z)}); 
If m(b, s+) ¥ ~ and (j,s S |x!) ¥ (J, 8< | 2}), then 
g{S, s) = m(b < J, s+), if m(b < J,s) = ~, 


and if m(b < J, s) # ~, then g(S,s); = m(b < J, s+), and g(S, s),, is the 
infinite part of the numbers of genus s. 

Proof. As a consequence of Theorems 6.6 and 6.5c¢ there exists one and only 
one regular subgroup S of J such that g(S, s) has the value given in the Corollary 
6.7. This regular subgroup S contains the element b, since the elements b; of 
a primitive set with sum b are by Corollary 5.7 and by Theorem 6.5a contained 
in S. If finally 7 is a regular subgroup of J which contains b, then S S T by 
Lemma 6.2 and Theorem 6.5a, i.e., S is the smallest regular subgroup of J 
which contains b. 

Coro.iary 6.8. The elements b and b’ of the separable group J are contained 
in the same regular subgroups of J if, and only if, m(b, s) = m(b’, s) and m(b, s+) 
= m(b’, s+) for every genus s. 

For the smallest regular subgroup of J which contains b depends by Corollary 
6.7 only on the values of m(b, s) and m(b, s+). 

Thus the type of an element is generally not completely determined by the 
regular subgroups which contain the element. 

Corotiary 6.9. If J is a separable group, the following statements are 
equivalent : 

(a) Two elements of J are isotype in J if, and only if, they are contained in the 
same regular subgroups of J. 

(b) There does not exist a genus w with the following properties: 

(b1) wis a singular genus for J; 
(b2) there exists a genus s such thats < wand (J,s <|x|) #(J,s S|2}); 
(b3) there exists an ordinary positive integer n which is relatively prime to the 
infinite part of the numbers of genus w such that Z(J, n, w) contains elements 
of order n which are not conjugate. 

This is a consequence of Corollary 6.8, Lemma 5.8, Corollary 5.10b and 
Theorem 5.12. 

Noration. If b # 0 is an element of the separable group J, s a genus, then 
q(b, s)’ ts either 1 or 2 and q(b, s)’ = 2 tf, and only if, 








O4 REINHOLD BAER 


(1) s is singular for J and 2 relatively prime to the infinite part of the numbers 
of genus s; 

(2) m(b, s) # m(b, s+) and either 

(21) m(b, s) # «, g(b, s) even or 
(22) m(b,s) = ~. 

THEOREM 6.10. Suppose that b # 0 is an element of the separable group J, 
that C is the smallest characteristic subgroup of J containing b, and that R is the 
greatest regular subgroup of J contained in C. 

(a) C = Rif, and only if, at most one of the numbers q(b, s)’ = 2. 

(b) If R < C, then 

(b1) C/R is a cyclic group of order 2 (and consists therefore of the elements 
Rand R + b); 

(b2) g(R < J,s) = ~,if(J,s <|x|) = (J,s S| 2]) orifm(b,s+) = ~, 
g(R < J, s); = q(b, s)’m(b, s+), and 

g(R < J, s),, is the infinite part of the numbers of genus s, if (J,s < |x|) # 
(J,s S |x|) and m(b, s+) # ~. 

Proof. Since q(b, s)’ / q(b, s), if m(b, s) # «, there exists by Theorem 6.6 
one and by Theorem 6.5c¢ only one regular group R such that the values of 
g(R, s) are those given in (b2). By Lemma 6.2, R < C. 2b is an element of 
R by Lemma 5.8 and Theorem 6.5a. 

If every q(b, s)’ = 1, then R is by Corollary 6.7 the smallest regular subgroup 
of J containing b, and therefore R = C is a regular subgroup of J. 

If at least one of the q(b, s)’ = 2, then b is not an element of R, as follows from 
Lemma 6.2 and Lemma 5.7, 5.8. But 2b is an element of R and the elements 
of R and R + b form therefore a group C’ which contains b and satisfies R < 
C’ <= C. C’/R is a cyclic group of order 2. 

Let b,, --- , by be a primitive set with sum 6 and satisfying the conditions 
of Lemma 5.8. If s is the genus of b; , then b; is an element of R if, and only if, 
q(b, s)’ = 1. If q(b, | 6; |)’ = 2forl i<h,thenl Sh S kand 


= 
h 
b=b'’=)>0b; modR. 


Let a be a proper automorphism of J. Since | b; | is for 1 S 7 S ha singular 
genus, it follows from Lemma 5.6 that bia = r<(b; + b;), where r; is a rational 
number # 0 whose numerator and denominator are divisors of the infinite 
part of the numbers of genus | b; | and where b; is an element of (J, | bs | < | 2!) 
satisfying m(b,) / m(b;). 

Now s < t implies that q(b, t)’m(b, t+), / m(b, s+), (if q(b, t)’ is defined), 
since the 2-value of m(b, t+) is smaller than the 2-value of m(b, t) and therefore 
than the 2-value of m(b, s+), if q(b, t)’ = 2. Furthermore, b; is the sum of a 
primitive set bj , --- and, since | b; | < | bj; |, it follows that 


q(b, | bi; |)’m(b, | bi; | +), / m(b, | bs | +), / m(bs) | m(b;) | m(b;;), 


. he 
i.e., b;; and consequently b; is an element of R. 





Jenga EN. 





: 
s 
. 
f. 
' 
i 





12PCRRTRVONIIT I et 4 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 95 


If r; = r;r;’, then r’, r” are both odd (if relatively prime) and since 2b; and 
(by Corollary 5.3) 2r;~'b; are elements of R, it follows that b; — b;a = 0 mod R. 
Since # is a characteristic subgroup, C’ is therefore also a characteristic sub- 
group, and since 6 is an element of C’ S C, it follows that C’ = C. 

If exactly one g(b, s)’ = 2, i.e., h = 1, then b is the sum of a primitive set in 
J which is contained in C. C is therefore generated by the primitive elements 
in J which are contained in C, i.e., C is by Theorem 6.6 (and its proof) regular. 

If 2 < A (in the above notation), then b; and b: are not contained in R and 
therefore not in C, which is singular by (6.2.1). 

Coro.iary 6.11. The elements b and b’ of the separable group J are contained 
in the same characteristic subgroups of J if, and only if, m(b, s) = m(b’, s) and 
m(b, s+) = m(b’, s+) for every genus §S, 7.e., if, and only if, b and b’ are contained 
in the same regular subgroups of J. 

Proof. If b and b’ are contained in the same characteristic subgroups of J, 
then they are contained in the same regular subgroups of J. If they are con- 
tained in the same regular subgroups, then m(b, s) = m(b’, s) and m(b, s+) = 
m(b’, s+-) for every genus s by Corollary 6.8. If band b’ satisfy these identities, 
then it follows from Theorem 6.10, since q(b, s)’ = q(b’, s)’, that R(b) = R(b’), 
where R(u) is the greatest regular subgroup of the smallest characteristic sub- 
group C(u) of J which contains u. If furthermore b,, --- , b is a primitive 
set with sum b and:bi, --- , by a primitive set with sum b’, b; not contained 
in R(b), then 2b; is contained in R(6), q(b, | b; |)’ = 2and therefore q(b’, | b; |)’ = 
2, ie., |b; | = | bs | and b; — b; = O mod R, ie., C(b) = C(b’). Note that C(d) 
is completely determined by R(b) and the set of genera s such that g(b, s)’ = 2. 

Corotiary 6.12. There exist singular subgroups of the separable group J if, 
and only if, there exist at least two different singular genera s such that the 2-values 
of the numbers of genus s are finite. 

This is a consequence of Theorem 6.10, since the join of regular subgroups 
is a regular subgroup, and since every characteristic subgroup S is the join of 
the groups C(s) with sin 8S. 

Survey of the singular subgroups. If C is any characteristic subgroup of the 
separable group J, then C contains a greatest regular subgroup R(C) and is 
contained in a smallest regular subgroup S(C). By Lemma 6.2 and Theorem 
6.10 all the elements # 0 of S(C)/C and C/R(C) are of order 2. 

Suppose now that C is singular and 6* ¥ 0 an element of C* = C/R(C). 
Then denote by S(b*) the set of all the genera s such that q(b, s)’ = 2 for every 
element b in the class b*. S(b*) contains at least two different genera and there 
exists in b* an element bo such that m(bo , s) # m(bo , s+) if, and only if, s is 
contained in S(b*) and q(bo, s)’ = 2, if s is contained in S(b*). S(b* + c*) 
contains exactly those genera which are contained either in S(6*) or in S(c*) 
but not in both these sets. 

Thus in the set T = T(C) of sets S(6*) with 6* in C* an addition has been 
defined, such that T becomes an abelian group (S(O) = 0) and such that the 
correspondence S(b*) between C* and T is an isomorphism. 








96 REINHOLD BAER 

Finally if s is an element of S(b*), then s is a singular genus for J and the 
2-value of g(R(C), s) is positive and finite. 

If conversely C* is an abelian group whose elements ~ 0 are of order 2, R a 
regular subgroup of the separable group J and S(b*) a function of the elements 
of C* which has all the mentioned properties, then there exists a characteristic 
subgroup C of J such that R = R(C), C* = C/R(C) and S(0*) is the above iso- 
morphism between C* and T(C). 

Finally C is uniquely determined by R(C) and S(6*). 

Characteristic decompositions. A direct decomposition of a group J is a char- 
acteristic decomposition, if all the summands are characteristic subgroups. 
Every characteristic direct summand is regular.” 

The group J is characteristic irreducible, if J is a summand of every charac- 
teristic decomposition. It may happen that J is characteristic irreducible 
and that there exists a characteristic direct summand # 0, + J of J. There 
exists at most one characteristic decomposition in characteristic irreducible 
direct summands.” 

Suppose now that the group J is separable and partially reducible. The 
set P(./) of all the genera s with (J,s < | x |) ¥ (J, s < | x |) can be decomposed 
into sets S such that 

every genus is contained in exactly one set S; 

every S contains genera; 

ifs < t are genera in P(/), then they are contained in the same §; 

every S is as small as possible. 

This decomposition of P(/) is unique. Let now J/(S) be the join of the sub- 
groups (J, s < |x|) withsinS. Then J is the direct sum of the groups J(S) 
and this decomposition is a characteristic decomposition into characteristic 
irreducible, direct summands. 


7. Types of subgroups. The following concepts and facts concerning abelian 
groups without elements of infinite order will be used in the course of this section. 

If F is an abelian group without elements of infinite order, then the order of F 
is the l.c.m. of the orders of its finite subgroups; the rank m(F) of F is the small- 
est number m such that every finite subgroup of F can be decomposed into a 
direct sum of at most m cyclic groups. (Only groups of finite rank will be 
considered. ) 

If the rank m(F) of the abelian group F without elements of infinite order is 
finite, there exists a canonical decomposition of F into a direct sum of m(F) 
groups F; of rank 1 such that the order of F; is a divisor of the order of Fj4, . 
The orders of the summands F; of a canonical decomposition do not depend 
on the particular canonical decomposition and determine the structure of F 
completely. They are the invariants m(F), --- , Mmcy(F) of F. 

If F is in particular a finite group, then the groups F; are cyclic groups and 

13 See R. Baer, The decomposition of abelian groups into direct summands, Quart. Jour. 
of Math., (2), vol. 6 (1935), pp. 222-232. 





oe ea 


Se LP aie ad 


+ eee RE 





ee ee 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 97 


the subset B of F is a basis of F, if the cyclic subgroups of F, generated by the 
elements in B, form a canonical decomposition of F. 
If F is of finite rank and S a subgroup of F, then S and F/S are of finite rank. 
If A is any abelian group, the subset F(A) of all the elements of finite order 
in A is a subgroup of A and A/F(A) does not contain elements ~ 0 of finite order. 
If J is an abelian group without elements ~ 0 of finite order, S a subgroup of J, 
then the closed subgroup S of J which is generated by S satisfies F(J/S) = S/S. 
If S is of finite rank, then F(J//S) is of finite rank and m(F(J/S)) S r(S). 
If S is a subgroup of the (abelian) group J (without elements ~ 0 of finite 
order), then the defect of an element b # 0 of S in J is 


d(b << S < J) = m(b < J)im(b < 8). 


An element b # 0 of S has the same genus in S and in J if, and only if, its defect 
is an ordinary integer. 

If R’ # 0 is a subgroup of the rational group R, all the elements ¥ 0 of R’ 
have the same defect (R:R’). Then (R:R’) is the order of the group R/R’ of 
rank 1 and R = (R:R’)'R’. 

Lemma 7.1. (a) The completely reducible group J of finite rank is finite mod 
its subgroup S if, and only 7f, 

(a’) r(S) = r(J), 
(a’’) the elements + 0 of S have the same genus in S and in J. 

(b) If S is a subgroup of the direct sum J of a finite number of isomorphic 
rational groups (all of genus s), and if F = J/S is finite, there exist decompositions: 


r(J) m(F) r(J) 
(b+) J=DR, S=Lm(MR+S, S'= YR, 
= i=1 t=m(F)+1 


where the R; are groups of genus s, S’ is a greatest closed subgroup of J contained 
in S, m(F)R; is for 1 S i S m(F) the intersection of S and R; , m(F) the finite 
rank and m,(F) the finite invariants of the finite group F = J/S, and m(F) S r(J). 

Proof. If J is a completely reducible group of finite rank and J/S is finite, 
then r(J) = r(S), since every element of J has finite order mod S. If b # 0 
is an element of S, 6 the closed subgroup of J, generated by 6, 6’ the intersection 
of S and 6, then 6/6’ represents exactly a subgroup of J//S, i.e., (b:6’) is finite 
and b has the same genus in S and in J. 

If, conversely, (a’) and (a”’) are satisfied, J is the direct sum of the rational 
groups J;, J; the intersection of S and J;, S’ the direct sum of the groups J;, 
then S’ < S, (J;:J;) finite, i.e., J/S’ is finite and therefore J/S = (J/S’)/ 
(S/S’) is finite. 

Suppose now that J is a direct sum of a finite number of isomorphic rational 
groups of genus s and that J/S = F is finite. There exists a greatest closed 
subgroup S’ of J such that S’ s S. 8S’ is by Corollary 3.5 a direct summand of 
J, since r(J) is finite and S’ is closed. If J = S’ + J”, then S = S’ + 8", 
where S” is the intersection of Sand J”. F = J/S and J’’/S” are isomorphic 
finite groups and there exists therefore a basis B of J’ mod S”’. 

(7.1.1) Every basis B of J” mod S” is a greatest independent subset of J”. 











9S REINHOLD BAER 


For if >> cb = 0 with integer, relatively prime coefficients c , then ob = 0 
binB 
mod S’’, and therefore c = 0 mod m,(F), i.e., c» = 0 and B is independent. 

In order to prove that B contains r(J’) elements, denote by B the closed 
subgroup of J” generated by B. Since every class of J’’/S” contains elements 
of B, there exists corresponding to every element u # 0 of J” an element s(u) 
such that s(u) = 0 mod S”, u = s(u) mod B. Since 0 is the only closed sub- 
group of J” contained in S”’, there exists corresponding to every element s # 0 
of S” a positive integer w(s) such that w(s)'s exists in J, but not in S”. 
There exists, therefore, corresponding to every element v of J” a chain s;, ¢; 
satisfying 


v= to, 8; = 0, if t; = 0; tia = 0, if s = 0, 


n” 
II 


; = s(t), if t; ¥ 0; tis = w(s;)'s;, if 5; ¥ 0. 


k—1 
Then v = [] w(s,) s. mod B, if s; + 0 for i < k, and therefore v = 0 mod B, 


i=0 
if at least one s, = 0. 

If none of the s; = 0, none of the numbers w(s;) is relatively prime to the 
finite order f of the finite group F and there exists therefore a prime number 
p|/f (p therefore relatively prime to the infinite parts of the multiplicities of 
elements in J’’) such that 


v = p'z;mod B 


has for every i a solution z; in J”. Since B is by Corollary 3.5 a direct sum- 
mand of J”, this implies that v = 0 mod B, ie., J” = B and this completes 
the proof of (7.1.1). 

Let now b, --- , 5» form a basis of J” mod S” such that m = m(F) and 
m,(F) is the order of b; mod 8S”. Denote by B; the closed subgroup of J gener- 
ated by b, ---,b:. Then B, is a rational group, B, = J” by (7.1.1), and 
Bi = By + Ria by Corollary 3.5. Then bi = visor + vias With uss: in By, 
vi., in R;,, and the elements v; = b; , v2, --- , Um form a basis of J” which is also 
a basis of J” mod S”. This shows that the groups R; together with S’ define 
a decomposition of J which meets the requirements of (b+). 

Remark. The following example shows that it is impossible to omit in Lemma 
7.1b the assumption that all the elements # 0 of J have the same genus in J. 

A basis of the group J is formed by the elements b’, b” with m(b’ < J) = 1, 
m(b” < J) = 2” and the subgroup S is generated by the elements 3b’ + b”, 
2°'3b". Then J/S is a cyclic group of order 3, but there does not exist a de- 
composition (b+) of J, since a multiple of 6” is contained in every basis of J. 

DEFINITION 7.2. The direct decomposition 


i> te i 


of the group J is a complete reduction of the subgroup S in J, if the groups J, are 
rational and S = >> S, for S, = intersection of S and J, . 











a 


ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 99 


If there exists a complete reduction of the subgroup S in J, then S is com- 
pletely reducible in J. 

If the subgroup S of J is completely reducible in J, then S is completely 
reducible. The converse is not true, since e.g., rational subgroups S of separable 
groups J are completely reducible in them if, and only if, all the elements + 0 in 
S are primitive in J. It is a consequence of Lemma 7.1b that S is completely 
reducible in J, if J/S is finite, r(J) = r(S), J is completely reducible and all 
the elements ~ 0 in J have the same genus in J. 

Coro.uary 7.3. Suppose that J and its subgroup S have the same finite rank. 

(a) S is completely reducible in J, all the elements ¥ 0 of J have the same genus 
gin J and all the elements # 0 of S have the same genus s (S g) in S if, and only 
if, there exists a subgroup T of J such that 

(1) SsTsJ, 

(2) all the elements t ¥ 0 of T have the same defect d = d(it < T < J)inJ 
(and therefore J = dT), 

(3) T is a direct sum of isomorphic rational groups, 

(4) all the elements ¥ 0 of S have the same genus in S and in T. 

(b) If the subgroup S of J satisfies the conditions of (a), then there exist de- 
compositions into rational groups: 


r(J) r(J) 
(b+) J=D Ji, S=)DS, 
t=1 é=j 


where S; = intersection of Sand J;, | Ji| = g, | Si| = 8, (Je: Si) = m(J/S) = 
i-th invariant of the group J/S without elements of infinite order for 1 Si S 
m(J/S). 

Note that either all the numbers (J;:S;) are ordinary integers—the case dealt 
with in Lemma 7.1—or all the (J;:S;) have the same genus and m(J/S) = 
r(J) = r(S). 

Proof. Suppose first that S is completely reducible in J, that all the elements 
~ 0 of J have the same genus g, and that all the elements ~ 0 of S have the 
same genus sin S. Then there exist decompositions into rational groups 


r(J) r(J) 
J=)> R,, S=)> Ri, |Ri| = g, |Ri| =s, 
ee | at _| 
where R’, is the intersection of S and R;. 
Denote by d the g.c.d. of all the d(s < S < J) with s + 0 in S and by d* 


r(J) 


the g.c.d. of the numbers (R;:R;). If s ¥ 0 is an element of S, then s = > s; 
i=1 


with s;in R;. Then m(s < S) is the g.c.d. of the numbers m(s; < R;) and m(s < 
J) is the g.c.d. of the numbers m(s; < R,). If s; # 0, there exists a uniquely 
determined ordinary positive integer h; which is relatively prime to the infinite 
part of the numbers of genus g such that 


m(s; < J) = him(s < J) 








100 REINHOLD BAER 


and a uniquely determined ordinary positive integer k; which is relatively prime 
to the infinite part of the numbers of genus s such that 
m(s; < S) = kym(s < 8S). 
Then hid(s < S < J) = k(R;:R)), and since the numbers h; with s; # 0 are 
relatively prime, and also the numbers k; with s; # 0 are relatively prime, it 
follows that d(s < S < J) is the g.c.d. of the numbers (R;:R;) with s; ¥ 0, 
i.e., that 
d = d*. 

Since r(/) is finite and all the numbers (R;:R;) have the same genus, it fol- 
lows that every n; = (R,:R{)id is an ordinary integer. Denote now by Ry 
the subgroup of R; which satisfies n:R! = R;. Then (Ri: RZ) = d, (RZ:R) = 


n; and the direct sum 


satisfies (1)—(4). 
Suppose secondly that there exists a subgroup T' of J which satisfies (1)—(4). 
Then there exists by Lemma 7.1 a decomposition 


r(J) r(J) 


fT => 7, S=DS, 
6=5 ae | 
such that S; is the intersection of S and T;, | S;| = | 7;| = s and the numbers 
(7;:S;) with 1 S ¢ S m(T/S) are the invariants of the finite group 7/S. 
r(J) 


If J; = dT; is the closed subgroup of J generated by T;, then J = oe 
é=} 
{Ji | = t and (J;:S;) = d(T;:S,), ie., these decompositions of J and S are 
decompositions (b+). 

If finally there exist decompositions (b+) of J and S, then S is completely 
reducible in J, all the elements # 0 of J have the same genus in J and all the 
elements # 0 of S have the same genus in S. 

If J and its subgroup S have the same finite rank; if J is completely reducible; 
if all the elements + 0 of J have the same genus in J; and if all the elements 
~ 0 of S have the same genus in S, then it can happen that S is not completely 
reducible in J. This is shown by 

Example 7.4. Let J be a group with a basis b’, b” such that m(b’ < J) = 
m(b” < J) = p” for a given prime number p. Choose the ordinary integers ¢c; 
in such a way that ce = )> e;p' is an irrational p-adic number and 0 < ¢; < p. 


=0 


Denote by S the subgroup of J, generated by the elements b’, b”, p ‘(b’ + 
mee 
ps c;p b”) fori = 1, 2,---. Then all the multiplicities m(s < S) of elements 
=@ 


li a 


wer eam 





ee ce 


* awe Bi 


PONS TE 





es BES 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 101 


s ~ 0 in S are ordinary integers, S/pS is a cyclic group of order p and S is 
therefore not even completely reducible (in S)."* 

Coro.uary 7.5. Suppose that the group J and its subgroup S satisfy: 

(1) S and J have the same finite rank; 

(2) J is a direct sum of isomorphic rational groups (of genus g); 

(3) The g.c.d. d(S < J) of the defects d(s < S < J) of the elements s # 0 of 
S in J has the same genus as these defects; 

(4) dS < J), = 1. 

Then S is completely reducible in J and all the elements # 0 of S have the same 
genus in S. 

Proof. There exists by (3) to every element s ~ 0 of S an ordinary integer 
h(s) such that d(s < S < J) = h(s)d(S < J). Therefore m(s < J){d(S < J) = 
h(s)m(s < S) by (4) and all the elements # 0 of S have consequently by (2) 
the same genus s in S. 

If U is the subgroup of J which satisfies U = d(S < J)"S and S = d(S < 
J)U by (4), then all the elements + 0 of U have the same genus in U and in J. 
There exists therefore by Lemma 7.1 decompositions 


r(J) r(J) 


J=2 Ji, U = 2U;, 


‘=i i=1 
r(J) 


U; = intersection of U and J;. Thus S = d(S < J)JU = > ds < J)U; and 
i=1 


d(S < J)U; = intersection of S and J;, i.e., S is completely reducible in J. 
Noration. If S is a subset of the group J, then S is the closed subgroup of J, 
generated by S. 
If s S tare genera and S a subgroup of the group J, then 
(S < J,t,s) = intersection of (J,t S$ |x < J|) and (S,s S |x < 8)), 
(S < J,t, s+) = intersection of (J,t S |x < J |) and(S,s <|z < S)), 
(S < J,t,s)* = joinof (S < J,t,s) and (S < J,t, s+), 


n(S < J,t,s) = r((S < J,t,s)/(S < J, t, s+)), 
N(S < J, t,s) 








(S < J, t,s)/(S < J, t, s)*. 
These notations and Corollary 7.3 imply the following statement. 
(7.6) If the formulas 


J=2 35.4’, Sa > 8, 


where S, is both a rational group and the intersection of S and 8, , give a complete 
reduction of Sin J, and if fors St 


4 The group S belongs to a class of groups which has been discussed recently by A. 
Kurosch, Ann. of Math., vol. 38 (1937), pp. 175-204. 

18 Note that the group J and its subgroup S which have been considered as Example 7.4 
satisfy (1)-(3). 








102 REINHOLD BAER 


S(t, s) is the direct sum of those groups S, which satisfy s = | S,| andt = | 8, |, 


then 
(S < J,t,s) = ba > S(v, wu), (S < J,t,s+) = > > stv, u), 
t<v s<u tsv s<u 
n(S < J, t,s) = r(S(t,s)) = r(S(t,s)), | N(S < J,t,s) = S(t, s)/S(t, s). 


If the ranks n(S < J, t, 8) are finite, there exists a decomposition 
Jud & + J’, S = > R. 


such that the rational group R, is the intersection of S and R, , S(t, s) is the direct 
sum of the groups R, with s = | R,|, t = | R,|, and the numbers (R,:R,) ¥ 1 
with s = | R, |, t = | R, | are exactly the invariants of N(S < J, t, s). 
Coro.uary 7.7. Suppose that S is.a subgroup of finite rank of the separable 
group J. Then S is sp reducible in J if, and only if, there exists a direct 


decomposition S = Si; such that 


t i 
(a) S,, is completely reducible in Sy: ; 
(b) every element # 0 of S = > Si isa primitive clement of genus t in J; 
(c) St _ > Sei . 


Proof. If S is completely reducible in J, then S§ is a completely reducible 
direct summand of J and the groups S(t, s) of (7.6) show the necessity of the 
condition. If conversely the condition is satisfied, then every 5, is by (b) and 
Theorem 4.2 a direct summand of J. S is therefore the direct sum of the groups 
S, and by Theorem 4.2 a direct summand of J. S is completely reducible in J 
as a consequence of this fact and of (a). and (ce). 

THEeoreM 7.8. Suppose that the subgroups S and T are both completely re- 
ducible in the group J, and that all the ranks n(S < J, t, s) are finite. Then S 
and T are isotype in J if, and only 7, 

(a) n(S < J,t,s) = n(T < J, t, s) for any two generas & t; 

(b) N(S < J, t, s) and N(T < J, t, s) are isomorphic for any two genera 
sat 

(c) J/S and J/T are isomorphic. 

Proof. The necessity of the conditions is obvious. If the conditions (a) to (c) 
are satisfied, there exist by (7.6) direct decompositions 


Ja D845’ = 27. +d", Sa + &.. T=> fT, 


such that the rational groups S, are the intersections of S and S,, the rational 
groups 7’, the intersections of T and 7T,, the numbers (S,:S,) # 1 are the 
invariants of N(S < J, | 8, |, | S,|), the numbers (7,:7,) # 1 are the invari- 
ants of N(T < J,| T.|,| T, |). (Note that by (a) also the ranks n(T < J, t, s) 
are finite.) 

By (a), (b) and (7.6) it is possible to denote the groups S, in such a way that 


iS. =| Tel, |S.| = |T.|, (8.:8,) = (Ty:T,) 


apres 








PMI. “Ss . 








ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 103 


for every v. Since J’ and (J/S)/F(J/S), J” and (J/T)/F(J/T) are isomorphic, 
(c) implies that J’ and J” are isomorphic. Therefore there exists a proper auto- 
morphism of J which maps 8, upon 7, , J’ upon J”, and since this automorphism 
also maps S, upon 7’, , i.e., S upon 7, S and T are isotype in J. 

Coro.tiary 7.9. Suppose that S and T are subgroups of finite rank of the 
separable group J, and that S and T are both completely reducible in J. Then S 
and T are isotype in J if, and only if, for any two genera s S t 

(a) n(S < J,t,s) = n(T < J,t,s); 

(b) N(S < J,t,s) and N(T < J, t, s) are isomorphic. 

Proof. Suppose that the conditions (a) and (b) are satisfied. By Corollary 
4.3 J has a completely reducible direct summand D of finite rank which con- 
tains both S and 7. Then it follows from (a), (b) and the finiteness of the 
ranks n that D/S and D/T are isomorphic, and S and T are therefore isotype in 
J by Theorem 7.8. 

If the ranks n(S < J, t, s) are finite, the ranks n and the groups N satisfy 

(i) m(N(S < J, t,s)) S n(S < J,t, s), and the equality holds, ifs < t; 

(ii) Le n(S < J, t,s) = rity"); 

t 


(iii) to every invariant m;(N(S < J, t, s)) there exists a number ¢ of genus t 
and a number s of genus s such that t;s = m,(N(S < J, t,s)). 
If > n(S < J, t, s) is finite and J is separable, the above conditions are also 
t.s 


sufficient for the existence of a subgroup S of J with these given ranks n and 
groups N. 

The following particular cases of the Corollary 7.9 may be mentioned. Sup- 
pose that J is a separable group, that S and 7 are subgroups of finite rank, 
that all the elements # 0 of S and of T are primitive elements of the given 
genus g in J, that either all the elements # 0 in S and in T have in S and T 
respectively the genus g; or all the elements # 0 in S and in T have in S and T 
the genus s(< g), and that S and 7 are completely reducible in J. Then S 
and 7 are isotype in J if, and only if, 

(a) r(S) = r(7), 

(b) F(J/S) and F(J/T) are isomorphic. 

Since r(J) = r(S) + r((J/S)/F(J/S)) = r(S) + r(J/8), we have also the 
following result. If the subgroups S and 7’ of the separable group J satisfy the 
above assumptions, and if the rank of J is finite (and J therefore completely 
reducible), then S and T are isotype in J if, and only if, J/S and J/T are 
isomorphic. 

If finally J is a direct sum of a finite number of infinite cyclic groups, and S is 
any subgroup of J, then J/S is finite and the type of S in J is completely deter- 
mined by the structure of J/S. 

The following remark may give an idea of what happens if subgroups are 
considered which are not completely reducible in the whole group. Let J be a 
complete group. Then the type of the subgroup S of J in J is completely 
determined by the structure of S and the structure of J/S. 





104 REINHOLD BAER 


Chapter IV. Reducibility and separability 


8. Decomposition into isomorphic rational groups. 

Lemma 8.1. Suppose that S is a subgroup of the group J and that all the elements 
of the class b* # 0 of J/S have the same genus sin J. Then 

(a) Ss (J,s S |2)). 

(b) s is the genus of b* in J/S if, and only if, there exists in b* an element b 
such that m(b < J) = m(b < J/S). 

Proof. If s # 0 is an element of S, b an element of b*, then b and b + s 
have both genus s and s is therefore a divisor of the genus of s. 

If there exists in b* an element b such that m(b < J) = m(b < J/S), then 
b and b* have the same multiplicity and therefore the same genus. 

Suppose now that s is the genus of b* and b’ an element of J in b*. Then 
m(b’ < J)/ m(b’ < J/S) = m(b* < J/S) and there exists an ordinary positive 
integer h which is relatively prime to m(b’),, such that hm(b’ < J) = m(b’ < 
J/S). Denote by h’ the greatest divisor of m(b’ < J/S) such that the same 
prime numbers divide h and h’. Then h’ is an ordinary positive integer and 
there exists an element z in J such that h’z = b’ mod S._ There exists, as above, 
an ordinary positive integer k which is relatively prime to m(h’z),, such that 
km(h’'z < J) = m(h’z < J/S) = m(b’ < J/S) = m. Since m(h’z) = h’m(z), 
mh’ exists and is relatively prime to A’ (by its definition) and k and h are 
therefore relatively prime. There exist ordinary integers k”’, h” such that 
kk” + hh” = 1 and 

b = hUhb’ + k’kh’z = b’ mod S 


satisfies m(b < J) / m/ g.c.d. h”hm(b! < J), k’kmth’z < J)/m(b < J), ie., 
m(b < J) = m(b < J/S). 

Coro.uuary 8.2. Suppose that S < J and that all the elements of J which are 
not contained in S have the same genus sin J. Then 

(a) (J,s<|r|)S S<J=(J,8 Ss |z)). 

(b) All the elements # 0 of J/S have the genus s if, and only if, there exists 
corresponding to every element b of J an element b’ of J such that 


b = b’ mod S, m(b’ < J) = m(b’ < J/S)(= m(b < J/S)). 


This follows from Lemma 8.1 and the fact that every element of (J, s < | x |) 
is the sum of elements of S. 

DeFINITION 8.3. The class T; consists of all countable groups. If v is a finite 
or infinite ordinal greater than 1, then T, consists of all groups J satisfying 

(a) J does not belong to a class T, with wp < v; 

(b) There exists a closed subgroup S of finite rank of J such that J/S is a direct 
sum of groups which belong to the join T” of the classes T, with uw < v. 

The additive group P of the integer p-adic numbers belongs to the second 
class and the direct sum of an infinity of such groups P belongs to the third class. 
The additive group of all the sequences of integers does not belong to any of 
these classes, as will be proved in the section on vector-groups. 

THeoREM 8.4. Assume that all the elements of the group J which are not con- 


oh ee 


WH om 





ae mea 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 105 


tained in the subgroup S of J have the same genus's. Then J is the direct sum of S 
and rational groups (of genus s) if, and only 7f, 

(a) S is a closed subgroup of J; 

(b) the elements ¥ 0 of J/T have the genus s in J/T for every closed subgroup 
T of J such that S = T and r(T/S) is finite; 

(c) J/S belongs to a class T, . 

Proof. A. Suppose that J = S + R, where R is completely reducible. 
Then S is a closed subgroup of J. If S < T < J, then T = S + T; where 
T’ is the intersection of JT and R. If T is a closed subgroup of J and r(7'/S) 
is finite, then 7” is a closed subgroup of R and r(7”) is finite. 7” is by Corollary 
4.4 a direct summand of R, i.e., R = T’ + R’ and consequently all the elements 
~ 0 of J/T are of genus s in J/T, since J/T and R’ are isomorphic. Since 
finally R and J/S are isomorphic, J/S belongs either to T; or tol: . 

B. Suppose now that the conditions (a) to (c) are satisfied by the subgroup 
S of J. 

1. r(J/S) = 1. 

Then there exists by Corollary 8.2 an element b in J such that b ¥ 0 mod S, 
m(b < J) = m(b < J/S). The elements of the closed subgroup 6 of J, gen- 
erated by b, represent therefore exactly the classes of J/S and consequently 
J=S+4+ 6. 

2. J/S is countable. 

Then there exist closed subgroups 7; of J for —1 S i < r(J/S) such that 
S = T., Tin < 7T;, r(T7i/Tis) = 1, and such that J is the join of the groups 
T; Gf r(J/S) = m + 1 is finite, then J = T,,). Since r(7;/S) = i + 1, it 
follows from 1 that 7; = 7; + R;for —1 <i < r(J/S), where R; is a rational 
group of genus s. Since J is the join of the groups 77; , it follows that 

J=S+ QD R. 
0<i<r(s/s) 

3. J/S belongs to the class I,. 

Since for vy = 1 the theorem has been proved in 2, it can be assumed that the 
theorem holds true for subgroups S’ of groups J’ which satisfy (a), (b) and such 
that J’/S’ belongs to I”. 

Since J/S belongs to I, , there exists a closed subgroup W of J such that 
S < W,r(W/S) is finite and J* = J/W is the direct sum of groups J7, belong- 
ing tol’. Let J, be the subgroup of J which contains W and satisfies J,/W = 
J. Then the subgroup W of J, satisfies the conditions (a), (b), and since P 4 
belongs to I’, J, = W + W,, where W, is completely reducible. Suppose now 
that b is an element of W, that the elements b; belong to different groups W, 
and that b + >> b; = 0. Then every b; belongs to W, since J* is the direct 


sum of the groups -. i.e., b; = 0, since 0 is the intersection of W and W,, i.e., 
b = 0. Thus the groups W, W, are independent and, since every class of J/W 
contains elements of the sum of the groups W, , it follows that 


JuW+2 W.. 








106 REINHOLD BAER 


Since r(W/S) is finite, W/S is countable and it follows from the case, treated 
in 2, that W = S + V, with V completely reducible, and consequently J = 


~ 


S+R,withhR=V+ DW, completely reducible. 


Coro.uary 8.5. The group J is a direct sum of isomorphic rational groups 
(of genus s), if, and only if, 

(a) the elements # 0 of J/T all have the same genus s for every closed subgroup 
T of finite rank of J; 

(b) J belongs toa class 1, . 

This follows from Theorem 8.4 by choosing S = 0. 

Remarks. 1. If J is complete, then J is completely reducible and in this 
case the conditions (a) and (b) are not needed. 

2. If J is a closed subgroup of the group P of all the integer p-adic numbers, 
then J/pJ is a group of order p. Since all the elements # 0 in S have the same 
genus, J is directly irreducible, and satisfies condition (b) and a weaker form of 
condition (a). This shows that condition (a) cannot be omitted, since there 
exist closed subgroups J of every finite rank in P. 

3. The additive group of all the sequences of integers furnishes an example 
of a group, satisfying condition (a), but not condition (b), as will be proved in 
the section on vector-groups. 

The following criterion provides a handy method for constructing a basis in 
some simple cases. 

Corouiary 8.6. Suppose that n # 1 is the product of a finite number of differ- 
ent prime numbers, that n is the genus of those numbers whose infinite part ts 
divisible by all prime numbers which are relatively prime to n and that J is a direct 
sum of a finite number of rational groups of genus n. Then the subset B of J is a 
basis of J if, and only if, B consists of multiples of the elements of a basis of J 
mod nJ. 

Proof. If first B is a basis of J, the elements m(b < J);'b with b in B form 
a basis of J mod nJ. Assume conversely that the elements in B are multiples 
of the elements of the basis B’ of J mod nJ. B’ is an independent subset of J, 
since every relation between elements of B’ implies a relation mod nJ. Denote 
now by B the closed subgroup of J, generated by B (or by B’). If w is any 
element in J, there exists an element w’ in B such that w = w’ mod nJ. The 
congruence nz = w mod B has therefore a solution z in J for every element w, 
i.e., J/B = n(J/B). Corollary 8.5 implies therefore that J = B, i.e., that 
B’ and B are greatest independent subsets of J. Consequently there exist 
corresponding to every element w + 0 in J relatively prime integers k + 0, 
k(b) such that 

kw = >> k(b)b. 
bin B’ 
k and n are relatively prime, since B’ is a basis of J mod nJ and therefore a 
common divisor of k and n divides every k(b). Hence every element k“k(b)b 
exists in J, i.e., B’ and therefore B is a basis of J. 








ee 


8 OE ON ETN A a Pk TR MCR as tal 





See ent 





aah i 


I UES ag BST 


> MOREE TEE ANE 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 107 


Note that the finiteness of r(J) is a consequence of the validity of the above 
criterion and of the other assumptions of Corollary 8.6. 

Coro.uary 8.7. Suppose that all the elements of the group J which are not 
contained in the subgroup S of J have the same genus sin J. Then the following 
three statements are equivalent. 

(a) J is a direct sum of S and a completely reducible group. 

(b) J/S is a direct sum of rational groups of genus s. 

(c) J/S is separable, belongs to a class TY, and all the elements ¥ 0 in J/S have 
the genus s. 

Proof. If J is the direct sum of S and the completely reducible group R, 
then # is a direct sum of groups of genus s and R and J/S are isomorphic, i.e., 
J/S is a direct sum of rational groups of genus s. If J/S is a direct sum of 
rational groups of genus s, then J/S is separable, belongs to Tr; or to [2 and all 
its elements ~ 0 have the genus s. If finally J/S is separable, belongs to a 
class T, and all the elements # 0 of J/S have the genus s, it follows from Corol- 
lary 4.4 that the conditions (a) to (c) of Theorem 8.4 are satisfied by J and its 
subgroup S, and J is therefore the direct sum of S and a completely reducible 
group. 

Remark. Since there exist direct irreducible groups of rank 2 whose elements 
~ 0 all have the same genus, the words “of genus s’”’ cannot be omitted in (b). 

TueroreM 8.8. If the subgroup S of the group J and the genus s satisfy: 

(a) (S,s < | x |) és the intersection of S and (J,s < |x|); 

(b) J(s)* = (WJ, s S | x})/(J, 8 < | x]) ts a direct sum of rational groups of 
genus Ss; 
then (S, s S |x|) is the direct sum of (S, s < |x|) and a completely reducible 
group S(s) and r(S(s)) S r(J(s)*), 7.e., S(s) is isomorphic with a direct summand 
of J(s)*. 

Proof.'* By condition (b) and Corollary 8.7a, b there exist rational groups 
R, of genus s such that 


(VJ,s<|zr|)=W,s< 2!) + LR. 


It can be assumed that the indices v are the ordinal numbers, satisfying 
0<v<o. PutJ, = (J,s <|x]) + DR, for0 <v S cand S, = inter- 
p<v 


section of (S,s < |x|) and J,. Then J; = (J,s < |x|), S: = (S,s < | 2]) 
by (a), Jour = Jo + R,, Je = join of the groups J, with p < 2, if v is a limit 
ordinal, J, = (J,s S ||), S, = (S,s = |2)). 

Since S,,;/S, is isomorphic with a subgroup of R, , it follows therefore by 
Corollary 8.2 that either S, = S,., or that S,,;/S, is a rational group of genus s. 
Thus by Theorem 8.4 S,.; = S, + T,, where either T, = 0 or a rational group 
of genus s. 


16 A similar proof for the complete reducibility of the subgroups of direct sums of in- 
finite cyclic groups is due to L. Zippin. 





108 REINHOLD BAER 


The subgroup S’ of (S,s S | x |) which is generated by (S,s < | x |) and the 


groups 7’, is the direct sum of these groups. S, < 8S’. It can therefore be 
assumed that S, S S’ for every p < v. If v = vw’ + 1 is not a limit number, 
then S, = Sy + Ty» Ss 8S’. If v is a limit ordinal, then S, is the join of the 
groups S, with p < v and consequently S, < S’. Hence S, < 8S’, ie., 


(S,s < |z|) = (S,s < |z|) + 2, Te. 
This completes the proof of the theorem. 

Coro.iary 8.9. Let J be a direct sum of rational groups of genus s. Then 
the following three statements are equivalent. 

(a) The subgroup S of J is a direct sum of rational groups of genus s. 

(b) All the elements # 0 of the subgroup S of J have in S the genus s. 

(ce) The subgroup S of J is isomorphic with a direct summand of J. 

Proof. (a) implies (b). If (b) is satisfied, the conditions of Theorem 8.8 
are satisfied, i.e., (b) implies (c). (¢) implies (a), since every direct summand 
of J satisfies the conditions of Theorem 8.8. 

Remark. This corollary implies in particular that every subgroup of a direct 
sum of infinite cyclic groups is a direct sum of infinite cyclic groups. 

Corouiary 8.10. If the subgroup S of the group J and the genus s satisfy 

(a) J(s)* = (J, s S |2})/(J, 8 < |2|) ts a direct sum of rational groups of 
genus S; 

(b) (S,s < | x |) ts the intersection of S and (J,s < |x }); 

(c) the subgroup S(s)* of all those classes of J(s)* which contain elements of 
(S,s S< |x|) ts a direct summand of J(s)*; 
there exist subgroups LU’, V such that 
(VJ,ssl|[r|)=U+V+4+U,s8<|2z)), (S,ss|zj/)=U+(,8 <|2}) 
and the elements of U represent exactly the classes of S(s)*. 

Proof. Since conditions (a), (b) and (c) imply the conditions of Theorem 
8.8, it follows that 

(S,ssfi[zr|))=(S,s<jzr|))+U 
where U is a direct sum of rational groups of genus s. Furthermore, U repre- 
sents exactly the classes of S(s)*. By condition (c), J(s)* = 7* + S(s)*. By 
condition (a) and Corollary 8.7a, b it follows that 


(J,s S|2|) = VU,s <|2z]) + J(s) 


where J/(s) is a direct sum of rational groups of genus s._ If V is the subgroup 
of J(s) which represents the classes of 7*, then 


(J,ssi[rj=WJUs<ijr)+uU4+V, 


and this completes the proof. 
Note that the above groups l’ and V are direct sums of rational groups of 
genus s. Furthermore, it is a consequence of Corollary 4.4 that condition (c) 











Ee 


ek RMS Are a 





a ee 


we 


PEO Kim 


eee ter rk Ee 


vere AT 


ee ITA 


ES oe 








ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 109 


is satisfied, if S(s)* is a closed subgroup of finite rank of J(s)* or if J(s)* is 
complete and S(s)* a closed subgroup. 


9. Partial reducibility. 
Notation. If J is any group, then 
A(./) ts the set of all the genera s such that 


J(s)** = (J, |x| €s)/(J,|2| £s) #0; 
B(./) is the set of all the genera s such that 
J(s)* = (J,s S$ |2|)/(J,s < |x|) 4 0; 


and C(.J) is the set of all the genera s such that elements of genus s exist in J. 

A(J) < B(J), since there exists by (2.4b) a homomorphism of /(s)* upon 
the whole group J(s)**, and it follows from (2.4b) that A(/) = B(J), if (J, s < 
| z |) is always the intersection of (J, s S |x|) and (J, |x| £ ss). 

B(J) < C(J), since elements of (J, s S |x|) which are not contained in 
(J,s < |x|) have in J the genus s. 

Cuatn Conpition. Jf s; S & S --- £8; S --- is an ascending chain of 
genera S; in the set G of genera, almost all the s; are equal. 

If the set G of genera satisfies the chain condition, every subset of G satisfies 
the chain condition and every subset of G contains a “greatest’’ element g such 
that g <« s for every genus s in G and the chain condition is conversely a conse- 
quence of this condition. 

(9.1) If r(J) ws finite, the chain condition is satisfied in C(.J) (and therefore in 
A(J) and in B(J)). 

For if |b| = s <t, then (J,t S |x|) < (J,s S | x|) and since (J, g < | x |) 
is a closed subgroup of J and r(,/) is finite, this implies r((J, t S | x |)) < r((J, 
s < | x|)), and thus the chain condition holds in C(J), if r(J/) is finite.” 

It may happen that r(./) is finite and B(./) is infinite. This is shown by the 
following 

Example 9.2. Let pi, pz, --- be an enumeration of the prime numbers and 
J a group of rank 2 which contains two independent elements b’ and b’’ such 
that the p-value of m(b’ + pb”) is 0, if p ¥ pi, and infinite, if p = p;. If 
p; is the genus of p;, then 0 = (J, pi < |x|), and (J, ps S | x]) consists of all 
the rational multiples of b’ + p,b”, i.e., B(J) contains every p; and is therefore 
infinite. 

TuHeoreM 9.3. If the chain condition is satisfied in B(J), then the following 
three propositions are equivalent. 

(a) J is partially reducible. 

(b) J has the following properties: 





17 Another consequence of this argument is that also the ‘‘descending chain condition”’ 
holds in C(J/). 











110 REINHOLD BAER 


(b1) for every genus s, (J, s < |x|) is a direct summand of (J, s S |x|); 

(b2) for every genus s, (J, 8 < |x|) ts the intersection of (J, s < |x!) and 
(J, od | BS S); 

(b3) corresponding to every element b ¥ 0 in J there exists at least one genus 
s such that b = 0 mod (J, | x2! < s),b 4 Omod (J, |x| £s); 

(b4) to every element b of J there exists at most a finite number of genera s 
such that b = 0 mod (J, |x| < s),b A Omod (J, |x| £ 8). 

(c) If for every genus s the subgroup J(s) of J satisfies 


(es) (J,ss/2!|) = (J,s <|2z|) + J(s), 


then J is the direct sum of these groups J(s)."° 

Proof. 1. If J is partially reducible, it follows from (2.6) and (2.7) that J 
fulfills the conditions (bl) to (b4). 

2. If J satisfies (b2) and /(s) is a solution of (es), then J(s) is by (2.4) also 
a solution of 


(és) (J,|2, ts) = J(s) + VW, || €8). 


3. Suppose that J satisfies (b2) and that, for every genus s, J(s) is a solution 
of (es). If bi, --- , by are a finite number of elements # 0 which belong to 
different groups J(s), there exists one among them, say }; , such that | bi! <« 
1b |. Then 


hk 
b = >) b; = 0 mod VJ, |x| £ | bi), b = b; 4 0 mod (VJ, |2| £ |b: }), 
ml 


since, as proved in 2, J(| b; |) is a solution of (€| b; |). Hence b ¥ 0 and this 
implies 

(9.3.3) If J satisfies (b2) and if, for every genus s, J(s) is a solution of (es), 
then the subgroup of J, generated by the groups J(s), is their direct sum. 

4. Suppose that J satisfies the conditions (bl) to (b4) and that, for every 
genus s, J(s) is a solution of (es). 

If b # Ois an element of J, denote by A(b) the set of all the genera s such that 
b = 0 mod (J, |x| « s), b 4 O mod (J, |x| £s). Denote by J’ the direct 
sum of the groups J(s) (which exists by (9.3.3) in J). 

Since every J(s) is also a solution of (és), there exists to every element b ¥ 0 
and to every genus s in A(b) a uniquely determined element b(s) in J(s) such 
that b = b(s) mod (J,| 2, €s). Since by (b4) the set A(6) is finite, it is possible 
to form the sum b’ of all the elements b(s) with s in A(b). 

Put b” = b — Db’ and let w be a genus such that s ¢ w for every s in A(0). 
Since every b(s) # 0 and therefore every b(s) is an element of genus s, this 
implies that b = b” mod (J, |x| « w). If w # s for every s in A(b), then 
b(s) = 0 mod (J, |x| £ w), ie., b” = b = 0 mod (J, |x| £ w) and w does 
not belong toA(b’’). If wis an element of A(b), then b = b(w) mod (J, | x | £ w) 


'8 And there exist solutions of the equations (es). 





a AR rom i etaa heat ee 





™M Tae 


ie SCONE! 


a ee Saag 


eS onl 





te te PRN AIRLy, 


REINER ST ARR BIERR IROL = = 





Mt 


0 ot ERR LP I 


man ey 


PE PRONE IE TO” W 





ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 111 


and therefore b’” = 0 mod (J, | z| £ w), i.e., w does not belong to A(b’’). “Thus 
it has been proved that b = b” mod J’, and every element of A(b”’) is a true mul- 
tiple of at least one genus in A(b). 

5. Suppose finally that the chain condition is satisfied by B(J) and that J 
satisfies (b1) to (b4). There exist solutions /(s) of (es), their direct sum J’ 
is a subgroup of J and every element 6b of J is congruent mod J’ to an element 
b’ of J such that every genus in A(b’) is a true multiple of at least one genus in 
A(b). Since A(u) is vacuous if, and only if, « = 0, and since every A(u) is finite 
and a subset of B(J) = A(./), it follows from the chain condition that J = J’, 
i.e., (b) implies (ce). 

6. Since the decomposition of J into the solutions J(s) of (es) is a partial 
reduction of J, (¢) implies (a). 

That (a) is not a consequence of (b), if the chain condition in B(J/) is not 
satisfied, is shown by the following 

Example 9.4. Let pi, po, --- be an enume ration of the prime numbers, 


R; a rational group of genus r; = | II p; |, J the additive group of all the vectors 


| i 
whose i-th codrdinate is an element of R; . 
The subgroup of all the vectors of / whose coérdinates except the 7-th are 0 
is isomorphic with R; and may be identified with R; . 
Since ri. < r;, a vector v of J whose i-th coérdinate is the first coérdinate 
~ Ohas the genusr;in J. ThusC(./) consists exactly of the genera r; , 





VJ,rns |x|) = (J, | 2] tri) = Ri+ VJ,r <|2\) = (J,tiu <|2}), 


and therefore A(J) = B(J) = C(J/). 

J satisfies all the conditions (b1) to (b4). But since J contains a continuum 
of elements and the direct sum of the groups R; is countable, J is neither com- 
pletely nor partially reducible.” 

That (c) is not a consequence of (a)—and even not of complete reducibility— 
if the chain condition in B(J) does not hold is shown by 

Example 9.5. Denote by s; < Ss: < --- < 8; < --- some ascending chain 
of genera, by J; a rational group of genus s; and by J the direct sum of the 
groups J;. If b; # 0 is an element of J; , these elements b; form a basis of J. 
Put w; = 6; + bis: and let @; be the closed subgroup of J, generated by w;. 
The groups @; form a complete set of solutions + 0 of the equations (es), if 
m(b;) | m(bis1), but their direct sum W does not contain b; and is < J. 

That (b3) is not a consequence of the chain condition in B(/) and the condi- 
tions (b1), (b2), (b4), is shown by 

Example 9.6. Let pi, pe,--- be an enumeration of the prime numbers, 
J’ the group generated by the elements (a, --- , %) fork = 1,2, --- ,7; = 0,1, 
satisfying the relations (%;,---,%) = (,---,%, 0) + (4,---,%, 1) for 


1” The separability of this group J will be proved in section 11. 











112 REINHOLD BAER 


0 <*%*. J is the smallest group between J’ and some complete group such that 
. SP werws < k 

the p-value of m((i, --- , %) < J) is REST te eee 
©, if p = pojii, for some 7 S k. 
Every element (i, --- , 7%) is therefore in J the sum of two elements whose 
genera are true multiples of |(i,,---, %&)|. Consequently (J, s < |z|) = 
(J,s S |x|) and (J, |x| € s) = (J, |2| +« 8) for every genus s, ie., A(J) 
and B(./) are both vacuous. Thus J is a countable group, the chain condition 
satisfied in A(/) and in B(/), conditions (b1), (b2) and (b4) are satisfied in J. 
But (b3) is not satisfied. 

Corouiary 9.7. If r(J) is finite, the following propositions are equivalent. 

(a) J is partially reducible. 

(b) For every genus s, (J, 8 < |x|) is a direct summand of (J, 8 S | x |) and 


r(J(s)*) = r(J(s)**/F(J(s)™*)). 


(c) For every genus s, (J, 8 < |x|) is a direct summand of (J,s S | x |) and is 
the intersection of (J, s S |x|) and (J,|x| £s). 
(d) If for every genus s the subgroup J(s) of J satisfies 


(ds) (J,s S$ |x|) = J(s) +-G,s <|2)), 


then J is the direct sum of these groups J(s).” 

Proof. 1. If J is partially reducible, then J(s)* and J(s)** are by (2.6) 
isomorphic groups and thus (b) is a consequence of (a) and (2.6). 

2. Suppose that r(J/) is finite and that J satisfies (b). There exists by (2.4) 
a homomorphism of J(s)* upon J(s)** and therefore there exists a homo- 
morphism a of J(s)* upon J(s)**/F(J(s)**). If W* is the subgroup of J(s)* 
which is mapped upon 0 by a, then J(s)*/W* and J(s)**/F(J(s)**) are iso- 
morphic. Since therefore W* is a closed subgroup of J(s)* and 


r(J(s)*) = r(W*) + r(J(s)**/F(J(s)**)) = r(W*) + r(J(s)*), 


and since all the ranks are finite, it follows that W* = 0. The homomorphism 
of J(s)* upon J(s)** is therefore an isomorphism and it follows from (2.4) that 
J satisfies (c). 

3. Suppose that r(/) is finite and that J satisfies (c). Then every subgroup 
(J, $s S | 2 |) has finite rank and satisfies (c). 

If r(J) = 1, then (ce) and (d) are satisfied. It can therefore be assumed 
that the group J’ satisfies (d), if r(J’) < r(/) and if J’ satisfies (e). 

Suppose now that the subgroups J(s) of J are solutions of (ds) and that J’ 
is the subgroup of J generated by the subgroups J(s). Then it follows from 
(c) and (9.3.3) that J’ is the direct sum of the groups J(s). 

Let now b + 0 be an element of J and g its genusin J. Then (J,t S |2\|) < 
J and therefore r((J, t < | 2|)) < r(J) for every genus g < t. Consequently 


*” And there exist solutions of the equations (ds), 





9: GR eet T! +t 


Fa 
a 
¢ 
é 
4 
a 
Hi 


BIT A OIE ae 








ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 113 


every (J, t < |x|) with g < t is the direct sum of the groups J(s) witht S s. 
Since (J, g < |x|) is the join of the groups (J, t S |x|) with g < t, (J, g < 
| z |) is the direct sum of the groups J(s) with g < s and (J, g S | z |) is there- 
fore the direct sum of the groups J(s) with g S s, i.e., b is contained in J’, i.e., 
J = J’ is the direct sum of the groups J(s). 

4. (d) implies (a), since the decomposition of J into the solutions of (ds) is a 
partial reduction of ./. 

Coro.uary 9.8. Suppose that B(J) satisfies 

(0) If sand t are two genera in B(J), then eithers < tors =tort<-s. 

(I) If the chain condition is satisfied in B(.J), the propositions (a) to (c) of 
Theorem 9.3 are true in J if, and only 7f, 

(I’) for every genus s, (J, s < |x|) is a direct summand of (J, s S |2}), 
(I’’) elements b # 0 are not contained in (J, |b| < | x}). 

(II) If r(J) is finite, the propositions (a) to (d) of Corollary 9.7 are true in J if, 
and only if, for every genus s, (J, 8S < |x |) is a direct summand of (J,s S |2)}). 

Proof. If J is partially reducible and satisfies (0), b = pe b; is an element 

é=1 
~ 0 of J and the elements b; # 0 belong to different components of a smallest 
partial reduction of J, then |b, | < --- < |b | and |b| = |b |, ie., (I’) and 
(I’’) are satisfied by J. 

If conversely (0), (I’) and (I’’) are satisfied in J, then A(J) = B(J) = C(J), 
(J,s S$ |r|) = WJ, |r| £8), J, s < |x|) = (J, |x| £ s) and conditions 
(b1) to (b4) of Theorem 9.3 are therefore satisfied by J. 

If r(J) is finite and there exists for every genus s a solution J(s) of (ds) in 
Corollary 9.7, the subgroup J’ of J, generated by these groups J(s), is by (0) 
their direct sum. If b ¥ 0 is an element of genus g, then (J, g < | 2 |) is the 
join of the groups (J, t S |z|) withg <t. Thenr((J,t S |2|)) < r(J) and 
it can therefore be assumed that (J, t S | x |) is the direct sum of the groups 
J(s) with g <t S s,ie., (J, g < |x|) and consequently (J, g S | x |) are sub- 
groups of J’, i.e., b is an element of J’, ie., J = J’ is the direct sum of the 
groups J(s). 

Note that the examples 9.4 to 9.6 satisfy (0). 

Coro.iary 9.9. Suppose that the chain condition is satisfied in B(J). Then 
J is partially reducible if, and only if, 

(a) for every genus s, (J,s < |x|) isa direct summand of (J,s S | x |); 

(b) for every genus s, (J, s < |x|) is the join of the groups (J,t = |x 
s < tandt in Bi); 

(c) for every genus s, (J, 8 < |x|) is the intersection of( J, s S |x|) and (J, 
|x| £8). 

Proof. The necessity of the conditions (a) and (b) is a consequence of (2.6) 
and the necessity of (c) a consequence of (2.7). 

If conversely (a) to (c) and the chain condition in B(/) are satisfied, then 
let, for every genus s, J(s) be a subgroup of J such that (J, s S |x|) = J(s) + 
(J, s < |2\|). The joingroup J’ of these groups is by (c) and (9.3.3) their 
direct sum. 


|) with 








114 REINHOLD BAER 


Let W be the set of all those genera s in B(./) such that (J, s S |x|) € J’. 
If W is not vacuous, there exists from the chain condition a “greatest’’ genus 
winW. (J,w < |2)) is by (b) the join of the groups (J, t S |r|) withw <t 
and t in B(./) and since these genera t are not contained in W, it follows that 
(J, w < |x|) S J’ and consequently that (J, w < |z|) S J’. Thus Wis 
vacuous, i.e., (J,s S |x|) S J’, if sin B(J). If zis the genus of the number 1, 
then J = (J,z S |x|) = J(z) + (J,z < | x |) and by (b) it follows that (J, z < 

r\) S J’, ie., that J = J’ is the direct sum of the groups /(s). 

THEeoreM 9.10. If the subgroup S of the partially reducible group J satisfies 

(a) the chain condition in B(S); 

(b) for every genus s, (S,s < | x!) is a direct summand of (S,s S |x}); 

(ec) GS, f(x)) is the intersection of S and (J, f(x)) for every f(x) = (s S |x )}), 
(s<|z|),({2| <8), (/z2| £8); 
then S is partially reducible and every S(s)* is isomorphic with a subgroup of J(s)*. 

Proof. J satisfies the conditions (a)—(c) of Corollary 9.9 since J is partially 
reducible. Thus conditions (a)—(c) of Theorem 9.10 imply that S satisfies the 
conditions of Corollary 9.9 and S is therefore partially reducible. 

Coro.iary 9.11. Suppose that S is a subgroup of the partially reducible 
group J and that the chain condition is satisfied in B(J). Then S is a direct 
summand of J if, and only if, 

(a) for every genus s, (S,s < | x |) ts the intersection of S and (J,s < |x|); 

(b) for every genus s, the elements of (S,s S | x |) represent a direct summand 
of J(s)*; 

(c) S is partially reducible. 

Proof. Suppose first that S is a direct summand of J. Then (a) and (b) 
are satisfied, since (S, f(x)) is a direct summand of (./J, f(x)) (for the discussed 
functions f(x)), and since therefore the conditions (a)—(c) of Theorem 9.10 
are satisfied in S and J, S is also partially reducible. 

Suppose now that the subgroup S of J satisfies the conditions (a)-(c). Then 
there exists a smallest partial reduction of S, S = bb S(s), and the subgroup 


Ss 
S(s) of S represents exactly the subgroup S(s)* of /(s)* which is represented 
by elements of (S,s < |x|). By (b), J(s)* = S(s)* + T(s)*. There exists 
furthermore a partial reduction of J and if J(s) satisfies (J,s < |x|) = J(s) + 
(J,s < |x|), denote by 7(s) the subgroup of J(s) which represents the classes 
of T(s)*. Then 
(J,s < |x!) = S(s) + T(s) + (J, s < | 2}), 


and therefore by Theorem 9.3 


J = Dd (S(s) + T(s)) = D S(s) + d& T(s) = 8’ + 7, 


and since the chain condition is also satisfied in B(S) < B(/), it follows from 
Theorem 9.3 that S = S’ is a direct summand of J. 











ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 115 


Note that it is possible to substitute for condition (c) of the Corollary 9.11 
the condition (c) of Theorem 9.10. 


10. Complete reducibility. It is the object of this section to combine the 
results of the two preceding sections. 

THEOREM 10.1. The group J is completely reducible if, and only if, 

(a) for every genus s, J(s)* is a direct sum of rational groups of genus s; 

(b) for every genus s, (J, 8 < |x) is the intersection of (J, s S |x|) and 
(J, |r| £8); 

(ec) the set A(b) of all the genera s such that 


b = 0 mod (J, |x| £8), b # Omod (J, |2| € 8) 


is for every clement b ¥ 0 of J finite and not vacuous; 

(d) J is the direct sum of groups J, such that the chain condition is satisfied in 
every B(J,). 

This is a consequence of Theorem 9.3 and Corollary 8.7, since the conditions 
(a)—(c) are satisfied in a direct sum if, and only if, they are satisfied in all the 
direct summands. 

Coro.tuary 10.2. The group J of finite rank is completely reducible if, and 
only if, for every genus § 

(a) J(s)* is a direct sum of rational groups of genus s; 

(b) r(J(s)*) = r(J(s)**/F(J(s)**)). 

This is a consequence of Corollaries 9.7 and 8.7. 

By Corollary 9.7 it is possible to substitute for (b) the condition 

(b’) (J, 8s < |x|) is the intersection of (J,s S |x|) and (J, |x| £8). 

TueoreM 10.3. If the subgroup S of the completely reducible group J satisfies 

(a) the chain condition holds in B(S); 

(b) (S, f(x)) is the intersection of S and (J, f(x)) for every f(x) = (s = | x}), 
(s <|2/\),({x| £8), (/z| £8); 
then S is completely reducible and isomorphic with a direct summand of J. 

This is a consequence of Theorems 9.10 and 8.8. 

An obvious consequence of Theorem 10.3 is the 

Coro.uary 10.4. Every direct summand D of the completely reducible group J 
such that the chain condition holds in B(D) is completely reducible. 

Coro.uary 10.5. Suppose that S is a subgroup of the completely reducible 
group J and that the chain condition holds in B(J). Then S is a direct summand 
of J if, and only 7, 

(a) for every genus s, (S,s < | x |) is the intersection of S and (J,s < | 2x}); 

(b) for every genus s, (S, |x| £ 8) is the intersection of S and (J, |x| £8); 

(ce) for every genus s the elements of (S, 8s S | x |) represent a direct summand 
of J(s)*. 

This is a consequence of Corollary 8.10, Theorem 9.3 and Corollary 9.9 (since 
every condition used is satisfied in /). 














116 REINHOLD BAER 
Coro.tiary 10.6. Let J be a completely reducible group. Then every subgroup 
S of J, satisfying for every genus s the three conditions 

(a) (S,s < |x|) ts the intersection of S and (J,s < |x }); 

(b) (S,|2| £ s) is the intersection of S and (J,|x| €s); 

(c) the elements of (S, 8 S | x |) represent a closed subgroup of J(s)*; 
is a direct summand of J if, and only 7f, 

(1) the chain condition is satisfied in B(J/); 

(2) for every genus s, J(s)* is complete or r(.J(s)*) ts finite. 

Proof. The necessity of (1) may be derived from Example 9.5 and the 
necessity of (2) is a consequence of Corollary 3.6. If (1) and (2) are satisfied, 
every subgroup S, satisfying (a)-(c), by Corollary 3.6 satisfies the conditions of 
Corollary 10.5 and is therefore a direct summand of J. 





11. Separability. 

Lemma 11.1. The group J is separable if, and only if, 

(a) every finite subset of J is contained in a partially reducible direct summand 
of J; 

(b) J(s)* = (J, s S |x \)/(J, 8 < |x|) ts separable for every genus s. 

Proof. Suppose first that / is separable. Then (a) is satisfied, since every 
complete reduction of a group is also a partial reduction. If F* is a finite sub- 
set of J(s)*, F a subset of J representing /*, then F is contained in a completely 
reducible direct summand D of J. In particular, (D, s S | 2 |) is the direct 
sum of (D, s < | x}) and rational groups of genus s, (D, s S | 2}) is a direct 
summand of (7,8 S |x|) and (D,s < |x|) a direct summand of (J, s. < | 2 |). 
The elements of (D, s S |x|) represent therefore a direct summand of J(s)* 
which contains F* and is completely reducible, i.e., J(s)* is separable. 

Assume now that the conditions (a) and (b) are satisfied by J. Then it 
follows from (2.7) that every element # 0 of J(s)* has the genus s in J(s)* 
and consequently every closed subgroup of finite rank of J(s)* is by Corollary 
4.4 a direct summand of J(s)* and by Corollary 4.3 a direct sum of a finite 
number of rational groups of genus s. If now F is a finite subset of J, then 
F is contained in a partially reducible direct summand D of J. If D = >> D(s) 

Ss 


is a smallest partial reduction of D, the components of elements of F in D(s) 
form a finite subset F(s) of D(s). Since D(s) represents exactly a direct sum- 
mand of J/(s)*, it follows from the above that every closed subgroup of finite 
rank of D(s) is a completely reducible direct summand of D(s). The closed 
subgroup F(s) of D(s), generated by F(s), is therefore a completely reducible 
direct summand of D(s) and F is contained in a completely reducible direct 
summand of J, i.e., J is separable. 

A consequence of this proof is the 

Corouuary 11.2. The group J is separable if, and only 7, 

(a) every finite subset of J is contained in a partially reducible direct summand 


of J; 
































ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 117 


(b’) every closed subgroup of finite rank of J(s)* is a direct summand of J(s)* 
(and a direct sum of rational groups of genus s). 
THEOREM 11.3. Suppose that the chain condition is satisfied in B(J). Then 
the group J is separable if, and only if, 
(a) for every genus s, (J, s < |a|) is the intersection of (J, s S |x|) and 
(J,|z| €s); 
(b) for every genus s, J(s)* is a separable group and all the elements ¥ 0 of 
J (s)* have the genus s in J(s)*; 
(c) for every genus s, every subgroup S satisfying 
(cl) every element ~ 0 of S has the genus s in J, 
(c2) r(S) is finite, 
(c3) S is a direct summand of (J, |x| <s), 
is a direct summand of J; 
(d) the set A(b) of the genera s such that 


b = 0 mod (J, |x| <s), b 4 Omod (J, |z| £ s) 


is for every element b ¥ 0 in J finite and not vacuous. 

Proof. 1. If J is separable, then J satisfies (a), (b) and (d) by Lemma 11.1 
and (2.7). If furthermore the subgroup S of J satisfies (cl)—(c3), then S is by 
Theorem 4.2 a direct summand of J. 

2. If the chain condition is satisfied in B(J) and J fulfills (a)—(d), then the 
same holds true for every direct summand of J. Thus it is sufficient to prove: 
if J satisfies (a)—(d) and if the chain condition holds in B(./), every element of J 
is contained in a completely reducible direct summand. 

3. Suppose that J satisfies the conditions (a)—(c) and that F is a primitive 
set in J (see Definition 5.1). 

Assume first that F consists of exactly one element b and that s is the genus 
of b. Denote by b the closed subgroup of J, generated by b, and by b* the 
subgroup of J(s)*, represented by elements of 6. Since b is a primitive element 
of its genus s, b* is a closed subgroup of rank one of J(s)* and by (b) and Corol- 
lary 4.4 therefore a direct summand of J(s)*, i.e., J(s)* = 6* + T*. If T 
is the subgroup of (J, s < | x |) which contains (J, s < | x |) and satisfies 7* = 
T/(J,s < |x|), then (J,s < |x|) = 6 + 7, since the elements of 6 represent 
exactly the classes of 6*. If 7’ is the subgroup of (J, | z| < s), generated by 
T and (J, | z| ¥ s), it follows from (a) and (2.4) that (J,|z| << s)=6+ 7", 
and (c) implies therefore that 6 is a direct summand of J. 

Suppose now that F contains k elements bh, --- ,b., 1 < k. Then there 
exists an element b;, say b, , such that |b; | < | b,|. As proved above, J = 
b. + J’. J’ contains the elements b; , --- , bk. , since (J, s S$ |x|) = (J’,s S 
||) fors £ |b |. Since J’ satisfies (a)—(c) as a direct summand of J, and 
since the elements b; , --- , bk. form a primitive set of k — 1 < k elements in 
J’, it can be assumed by complete induction that the elements b,, --- , bya 
form a basis of a direct summand of J’ and consequently it has been proved 
that F is a basis of a direct summand of J. 

















118 REINHOLD BAER 


4. Suppose that J satisfies the conditions (b) and (d) and that b # 0 is an 
element of J. Then there exists corresponding to every genus s in A(b) by (2.4) 
an element 6’(s) such that 


b = b’(s) mod (J, |x| £), b’(s) = Omod (J,s S |x), 


and there exists by Corollary 8.2 a primitive element b(s) of genus s in J such 
that b’(s) = b(s) mod (J, s < |x|). The elements b(s) with s in A(b) form a 
primitive set in J. If b’ is the sum of this primitive set, then every genus in 
A(b — b’) is a true multiple of at least one genus in A(b), since for a genus s in 
A(b),b — b’ = 0 mod (J, |x| £ s) holds and for a genus t such that s < t for 
every s in A(b), b = 0 mod (J, |x| £ t) holds, and therefore b — b’ = 0 mod 
(J, 121 = €. 

Since A(b) S A(J/) S B(J/), it follows from this result that if the conditions 
(b) and (d) are satisfied in J and if the chain condition holds in B(J), every 
element # 0 of J is the sum of a primitive set in J. 

5. If J satisfies (a)—(d) and if the chain condition holds in B(/J), every element 
of J is contained in a completely reducible direct summand of J, as has been 
proved in 3 and 4. Thus it follows from a remark made in 2 that J is separable. 

Coro.uary 11.4. If S is a direct summand of the separable group J and if the 
chain condition holds in B(S), then S is separable. 

For J satisfies as a separable group the conditions (a)—(d) of Theorem 11.3 
and consequently the direct summand S satisfies these conditions too and is 
therefore by Theorem 11.3 separable. 

THeoreM 11.5. If J is a separable group, if every group J(s)* belongs to a class 
I, , and if the chain condition holds in B(J), then J is completely reducible. 

Proof. Every group J(s)* is by Corollary 8.7, Lemma 11.1 and (2.7) a 
direct sum of rational groups of genus s. J is therefore by (2.7) and Theorem 
10.1 completely reducible. 

Remark. Since it can be proved that the group J of Example 9.4 is separable, 
the chain condition cannot be omitted in Theorem 11.5. 

A proof of the separability of the mentioned group J runs as follows. 

1. If v ¥ O is an element of J, put z(v) = minimum of the m(v; < R,); with 
v; ~ 0. 

2. If v ¥ 0, z(d) = 1, let k be the index such that z(v) = m(m% < R,);. Then 
v =v + vo” with v; = vo;, ¥, = Ofori < k, and v; = 0, v; =v; fork S i. 
If v’”’ is the closed subgroup of J generated by v’’, then 


k—-1 
J=D RF +0" + (J,n < |2}), 
i=l 


and v is therefore contained in a direct summand of finite rank. 
3. If 1 < 2(v), then v = vo’ + 0” + 0” with 


, ” avr 


y= VU, 3 = 0, vi — 0 (i 


k), 


< 
v; = 0, vi = 2(v)zwi, vy = zw, = (k $0), 

















ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 119 


where w; = m(v; < R,)7'v;, if v5 ¥ 0, wi = 0, if v0; = 0, and 
m(vi < Ri)y = 2(v)zi + Zi, O<2< 2(v), ifk Si,v; #0. 
Then z(v’’’) < 2(v), for v’’”’ an element of (J, r%, < |x|) and 


b-1 
J= DR +07 + Vin < |x|). 
i=l 


Since the same argument as on J can be applied on (J, rm < |2|), by complete 
induction there exists a completely reducible subgroup V of finite rank and an 
index 7 such that v is contained in V and J = V + (J, ri < | x/]). 

4. The separability of J follows now by complete induction with regard 
to the number of elements contained in the given finite subset. Note further- 
more that there exists a continuum of different genera and that therefore the 
Theorem 11.5 is not a consequence of Theorem 4.7, even if the groups J(s)* 
are all countable. 

THEOREM 11.6. The class X of groups is the class of all the separable groups tf, 
and only if, X is the greatest class of groups such that 

(a) the elements # 0 of a group J in X are sums of primitive sets in J; 

(by) every primitive set in a group J in X is a basis of a direct summand of J; 

(c) of J belongs to X and S is a direct summand of finite rank of J, then J/S 
belongs to X. 

Proof. The class of all the separable groups satisfies (a)—(c) by Lemma 4.6, 
Lemma 5.2 and Lemma 5.8. If on the other hand the class X satisfies (a)—(c), 
all the groups in J are separable, i.e., the class of all the separable groups is the 
greatest class satisfying (a)—(c). 

If n is any finite or infinite cardinal number, C a complete group of rank n, 
then a group J is isomorphic with a subgroup of C if, and only if, r(J) S n. 
Therefore it is possible to transform Theorem 11.6 into the following criterion 
which does not use concepts conflicting with the antinomies of the theory of sets. 

Corotuary 11.7. If nis any finite or infinite cardinal number, C a complete 
group of rank n and X the greatest class of subgroups of C which satisfies 

(a) the elements ~ 0 of a group J in X are sums of primitive sets in J; 

(b) every primitive set in a group J in X is a basis of a direct summand of J; 

(c) if J belongs to X and S is a direct summand of finite rank of J, every sub- 
group of C (or J) which is isomorphic with J/S belongs to X ; 

a group J is isomorphic with a group in X if, and only if, J is separable and 
r(J) S n. 


12. Vector-groups. 

Derinitions 12.1. If @ = (Gi, ---,G,---) is a@ set of (equal or different) 
groups, a vector (in ) is a single-valued function (of the indices v) such that the 
v-th coordinate f, of the vector f is an element of the group G, . 

The sum f + g of the two vectors f and g is defined by (f + g)v = fo + ge- 

Thus the set of all the vectors in @ becomes the additive group V = V(@) of all 


the vectors in ¢. 














120 REINHOLD BAER 


The group of all the (countable) sequences of integers is an example of a 
vector-group. Another example is the group discussed as Example 9.4. 

If V is a vector-group, then 
V’ is the subgroup of all those vectors f such that almost every coérdinate of 

f is 0; 

V. is the subgroup of all those vectors f such that every codrdinate but the v-th 

coordinate is 0; 

V.’ is the subgroup of all those vectors f such that the v-th codrdinate is 0, 
and V = V. + V. for every v, V’ = ee; 

The following proposition is easily verified: 

(12.2) The group A is isomorphic with a subgroup B of the vector-group V(¢), 
satisfying V’ < B S V if, and only if, there exist pairs of subgroups A., A, of A 
such that A’, and G, are isomorphic; A = A, + AJ for every v; A, < A. forv ¥ u; 
the intersection of the groups A, is 0." 

Finally V’ = V(@) if, and only if, ¢ is finite.” 

If n is a finite or infinite cardinal number, G any group, ¢ a set consisting of n 
groups which are isomorphic with G, the notation V(G, n) = V(@) will be used. 
(12.3) Suppose that R is a rational group of genus s. 

(a) sis an invariant of V(R, n). 

(b) n is for every rational group R an invariant of V(R, n) if, and only if, n is 
the only solution of the equation 2° = 2”. 

Proof. Let s° be the genus of the infinite part of the numbers of genus s. 
Then there exist elements of genus t in V(R, n) if, and only if, s’ < t S s for 
infinite n;s = tfor finite n. This proves (a). 

If n is finite, then r(V(R, n)) = nis an invariant of V(R, n). 

If n is infinite, then r(V(R, n)) = 2" is an invariant of V(R, n) and n is conse- 


*t This “approximate decomposition’’ of the group A into the groups A? is exactly the 
dual of L. Pontrjagin’s concept of direct decomposition of the character group of A. But 
not every approximate decomposition can be realized by a direct decomposition of A. See 
L. Pontrjagin, The theory of topological commutative groups, Ann. of Math., vol. 35 (1934), 
pp. 361-388. 

2 Though V(¢) is uniquely determined by the set ¢, ¢ is not always an invariant of the 
group V(¢). If e.g., @ consists of 2 groups, isomorphic with the additive group of all the 
real numbers, and y consists of 3 groups, isomorphic with the additive group of all the 
complex numbers, then V(¢) and V(y) are both complete groups of rank 2® and therefore 
isomorphic. 

A group which is given as a vector-group is therefore only given by a “‘representation’ 
of the group and not by invariant properties. This applies also to the topology in the 
group which is suggested by this representation, since such a topology is generally not a 
natural topology even in the weak sense that all the proper automorphisms are continuous. 
Natural topologies in the stronger sense that every proper or improper automorphism is 
continuous are rather rare. For if in the abelian group A there exists a non-discrete, 
natural topology in the stronger sense, then A is direct irreducible if A is connected. 

23 n is the only solution of the equation 2" = 27, if the Cantor continuum hypothesis 
2k = N.,. is satisfied. Without the hypothesis this proposition has not yet been proved. 
See W. Sierpinski, L’ Hypothése du Continu. 


’ 




















ABELIAN GROUPS WITHOUT ELEMENTS OF FINITE ORDER 121 


quently an invariant of V(R, n), if n is the only solution of 27 = 2". If wis 
another solution of this equation, then n and w are infinite. Let W be the 
additive group of all the rational numbers. Then V(W, n) and V(W, w) are 
both complete groups of the same rank and therefore isomorphic, i.e., n is not 
an invariant.” 

THEOREM 12.4. Suppose that R is a rational group of genus s. Then V(R, n) 
is completely reducible if, and only if, n is finite or R complete. 

Proof. If n is finite, then V(R, n) is a direct sum of n groups, isomorphic 
with R, i.e., completely reducible. 

If n is infinite and R complete, then V(R, n) is complete and therefore a 
direct sum of 2” groups, isomorphic with R, i.e., completely reducible. 

Suppose now that n is infinite and R a true subgroup of the additive group 
of all the rational numbers (R # 0). If, as may be assumed, the indices of the 
coérdinates are ordinal numbers, denote by W the subgroup of those vectors 
whose coérdinates with infinite index are 0. Then W is a direct summand of 
V(R, n) and is a V(R, &o). 

Since RF is not complete, there exists a prime number p such that pR < R. 
Denote by W, the subgroup of all those vectors in (W, s S |x|) such that 
for every integer 7 (= 0) almost every coérdinate is contained in p'R. Then 
W, is a closed subgroup of (W,s < | x}). 

If W’ is the subgroup of all those vectors in W such that almost every co- 
ordinate is 0, then W’ is a direct sum of No groups of genus s and thus W’ is 
countable. W’ < W,. 

If b is any element in W,, there exists an element b’ in W’ such that b = b’ 
mod pW,. W,/pW, is therefore countable. Since all the elements of W, 
have in W, genus s and since the p-values of their multiplicities are finite, and 
since finally W, contains a continuum of elements, this implies that W, is not 
completely reducible. 

Since (W,s < |x|) = Oand all the elements ~ 0 of (W,s S | x |) have genus 
s in W, and since all the elements + 0 of W, have genus s in W,, it follows 
from Corollary 8.9 that (W,s S | x |) is not completely reducible. 

Since all the elements of (V(R, n), s S ||) have genus s in V(R, n) and 
since (W,s S | x |) isa direct summand of (V,s S | z }), it follows that (V,s S 
|x|) and consequently that V(R, n) is not completely reducible. Thus the 
theorem and the following corollary have been proved. 

Corouiary 12.5. If n is infinite and the rational group R not complete, then 
(V(R, n), | R| S | x |) is not completely reducible. 

THeoreM 12.6. Jf R is a rational group, then (V(R, n), |R| S |x}) ts 
separable. 

Proof. Denote by s the genus of R and by g a number of genus s. If f 
is an element of V(R, n) and its v-th coérdinate + 0, then m(f, < R) = 


24 Whether the invariants | R | and 2" characterize V(R, n) is still unknown. 














122 REINHOLD BAER 


h(f, v)k(f, v)'g, where h and k are relatively prime integers, both relatively 
prime to the infinite part of g. If f, = 0, put A(f, v) = 0, k(f, v) = 1. Sinee, 
if f ~ 0, m(f < V) is the g.c.d. of the numbers m(f, < R), f belongs to (V,s S 
|x |) if, and only if, the l.e.m. k(f) of all the numbers k(f, v) is finite. 

If f = 0 is an element of (V, s S | x}), i-e., if k(f) is an ordinary positive 
integer, put A(f, v) = A(f, v)k(f)k¢ f, v) ). ACS, v) is either 0 or an ordinary 
positive integer and satisfies h(f, v)k(f)" = ACf, v)k(f, v)™. 

If f ~ 0 is an element of (V, s S |x|), denote by h(f) the minimum of the 
numbers A(f, v) 0. 

1. If a. = 1, let f be the closed subgroup of V generated by f. Since mf < 


~ k(f)'g = m(f, < R) for a certain u, it follows that V = f + y.. 
. If l <1 n(f), then f = f’ +f”, where 
fe = 2 (v)h(fA(f, v) “fe , fo = 2’ W)h(S, ry fe, (f. = 9), 
and f, = f. = 0, if f, = 0, 
Af, v) = h(f)z"(v) + 20), 0S 2"(v) < h(f). 


That f. and f.’ exist in R is a consequence of the definition of k(f). 

Then 1 = A(f’) and therefore, as proved in 1, V = f’ + V1, where wu is a 
suitable index and f’ the closed subgroup of V generated by f’. Since f” is an 
element of Vi, since h(f”’) < h(f) and since V{ is isomorphic with V(R, n — 1), 
it follows by complete induction that there exist two subgroups A and B of V 
such that V = A + B, f = 0 mod A, B consisting of all those vectors b in V 
such that b,, = --- = b,, = 0 for a given finite set of indices v; , and A being 
a direct sum of g rational groups of genus s. 

Since B is isomorphie with V(R, n — q), it follows by complete induction that 
every finite subset of (V, s S | 2 |) is contained in a direct summand D of V 
which is a direct sum of a finite number of rational groups of genus s. 

Since this direct summand D is a subgroup of (V, s < | |), this implies in 
particular that (V,s S | x |) is separable. 

By Corollary 4.4 the above result implies the 

Coro.iary 12.7. Every closed subgroup of finite rank of (V(R, n), |R| S 
|. |) ts a direct summand of V(R, n). 

Thus the groups (V(R, n), | R| < |x|) with infinite » and incomplete R 
are examples of separable groups which are not completely reducible, though 
all their elements ~ 0 have the same genus, and which therefore by Corollary 8.5 
do not belong to a class T, 

Coro.iary 12.8. Every closed subgroup of (V(R, n), | R| S | x |) ts separable 
and every closed subgroup of (V(R, n), | R| S |x|) which belongs to a class T, 
is completely reducible. 

This is a consequence of Corollaries 12.7, 4.4 and 8.7. 

If J is a group and s a génus such that for every closed subgroup S of finite 
rank all the non-zero elements of J/S have the genus s, it is undecided whether 
or not J is separable. 


Tue INSTITUTE FOR ADVANCED Srupy. 











SOLUTION OF A PROBLEM OF F. RIESZ ON THE HARMONIC 
MAJORANTS OF SUBHARMONIC FUNCTIONS 


By Trsor Rap6é 


Introduction. Let u(z, y) be a subharmonic function’ in a domain G. Con- 
sider a domain G’ comprised in G together with its boundary B’. If H(a, y) 
is continuous in G’ + B’ and harmonic in G’, and if H 2 uon B’, then H =u 
in G’ also, by the definition of a subharmonic function. If u is continuous, 
and if the Dirichlet problem is solvable for the region G’ + B’, then the har- 
monic function h determined by the condition h = u on B’ is clearly the one 
which yields the best possible limitation for u on the basis of the fundamental 
property of subharmonic functions quoted above. 

If however u is a general (and therefore possibly discontinuous) subharmonic 
function, then the situation is less clear. It can be shown (R 2, p. 358) that 
there exists in G’ a least harmonic majorant h* characterized by the following 
properties. (a) h* = uin G’, (b) if H is harmonic in G’ and H 2 u in G’, then 
H = h* in G’. But this least harmonic majorant did not seem to be the best 
one, as far as usefulness was concerned. At any rate, F. Riesz (R 1, p. 334) 
reserved the name of best harmonic majorant for a harmonic majorant defined 
in a different fashion, namely, in terms of the values of u on the boundary 
B’ of G’ (see 1.2), while the least harmonic majorant h* is defined in terms of 
the values of u in G’ alone. The best harmonic majorant, in the sense of F. 
Riesz, will be denoted by h and will be referred to by the letters B. H. M. 
The letters L. H. M. and the notation h* will refer to the least harmonic majorant 
described above. 

F. Riesz stated (R11, footnote on p. 334) that he established the identity 
of kh and h* in various special cases. Brelot’ gave an explicit proof in the 
case when the subdomain G’ is bounded by circles. It is the purpose of this 
paper to prove the identity of h and h* without any restrictions on G’, except for 
the assumption, implied in the very definition of h, that the Dirichlet problem is 
solvable for the region G’ + B’” 

The proof of this result could be based on the general theorems of F. Riesz 


Received October 8, 1936; presented to the American Mathematical Society, December 
1936. 

1 See F. Riesz, Sur les fonctions subharmoniques et leur rapport a la théorie du potentiel, 
parts I and II, Acta Mathematica, vol. 48 (1926), pp. 330-343 and vol. 54 (1930), pp. 322-360. 
These papers will be referred to as R 1 and R 2. 

2M. Brelot, Etude des fonctions sousharmoniques au voisinage d’un point, Actualités 
scientifiques et industrielles, vol. 139 (1934), p. 18. 

3 In §5 of this paper, we shall give an interpretation of this result which seems to express 
more adequately its true meaning. 

123 














124 TIBOR RADO 


on the representation of subharmonic functions in terms of negative mass 
distributions. It seemed desirable, however, to present a proof based on much 
more elementary considerations. The idea of the proof is to push the use of 
the method of approximation by integral means‘ a little farther than usual 
and thus to avoid the use of improper integrals which seem to obscure some 
of the facts which are essential for our purposes. Specifically, our main tool 
is a trivial modification (formula 2 in 2.2) of the classical formula of Green. 
Roughly speaking, we apply the formula not to the function itself which is to 
be studied, but to its integral mean. It seems to the author that this modified 
formula might prove useful in various other problems, also, whenever it is de- 
sirable to avoid improper integrals. 

It should be noted that the least harmonic majorant, as defined above, ad- 
mits of an important interpretation in potential theory. The potential of a 
negative mass-distribution is a subharmonic function u. The sweeping-out 
process, applied in a domain G’, leads to a new potential u* (see G. C. Evans, 
Potentials of positive mass, part II, Transactions Amer. Math. Soc., vol. 38 
(1935), pp. 201-236), and it follows immediately that u* = h* in G’, where h* 
is the least harmonic majorant of uin G’. This remark suggests various prob- 
lems, similar to the one solved in this paper, which the author plans to discuss 


elsewhere. 


1. Preliminaries. 1.1. Let G’ be a bounded domain (connected open set). 
Denote by B’ the boundary of G’. We shall say that G’ + B’ is a Dirichlet 
region if for every continuous function ¢ given on B’ there exists in G’ + B’ 
a continuous function h which is harmonic in G’ and which reduces to ¢ on B’. 

1.2. Denote by u a function which is subharmonic in a domain G. Con- 
sider a Dirichlet region G’ + B’ interior to G. As u is upper semi-continuous 
(R 1, p. 333), we have on B’ a sequence of continuous functions g ,k = 1,2, --- , 
such that ¢ “ u on B’ (the symbol ¢ \ u indicates that ¢ is a decreasing 
sequence). Denote by H, the solution of the Dirichlet problem, for the region 
’ + B’, with g, as the prescribed boundary function. Then (R 1, p. 333) 
H, \ hin G’, where h is harmonie and = win G’. This harmonic function h 
is the best harmonic majorant (B. H. M.) in the sense of F. Riesz’ of u in G’. We 
shall list presently some of its properties. 

1.3. The B. H. M. depends only upon the values of u on B’. If we use a 
different sequence g, “. u on B’, we obtain the same h. Also, if uw, ue are 
subharmonic in G and if wu, = uw on B’, then u; and uz have the same B. H. M. 
in G’ (R 1, p. 334). 


‘See R 2, pp. 343 and 345 for historical references concerning the use of this method in 
the theory of harmonic and subharmonic functions. 

®> Actually, F. Riesz required that gx be continuous and g, \, u in the whole domain G, 
while we only require that these conditions be satisfied on B’. The equivalence of the 
two definitions can be seen immediately by the same reasoning which F. Riesz used to 
establish the facts stated in 1.3 above. The wording chosen in our text is due to Brelot, 


loc. cit., p. 17. 











HARMONIC MAJORANTS OF SUBHARMONIC FUNCTIONS 125 


1.4. If H is continuous in G’ + B’, harmonic in G’, and H = uin G’ + B’, 
then the B. H. M. of win G’ is < H in G’ (R.1, p. 334). 
1.5. Let h be the B. H. M. of u in G’, and define in G a function @ as follows: 


_  juinG — G@’, 
~ \hin G’. 


wi 


Then ti is subharmonic in G. ‘To see this, take a subregion G”’ + B” such that 
G’ + B’CG",G"’ + B’ CG. Since wu is upper semi-continuous in G, we have 
in G” a sequence of continuous functions g, such that g, \ uin G”’. Define 
in G” the functions g; as follows: 


— Jk in G”’ —_ G’, 
oe G’, 


where H;, is the solution of the Dirichlet problem for G’ + B’ with the boundary 
condition H, = g, on B’. Then by 1.3 we have g, SS @in G’. Thus @ is the 
limit of a decreasing sequence of continuous functions and hence it is upper 
semi-continuous in G’’. As G”’ was an arbitrary subdomain, @ is upper semi- 
continuous in G. To prove that @ is subharmonic it must be shown that 


2r 
u(z,y) Ss - [ u(x + reosy,y + rsing)dg 


for every point (z, y) in G and for every sufficiently small r._ If (a, y) is either 
in G — (G’ + B’) or in G’, then the inequality is obviously satisfied for small 
values of r. If (z, y) is on B’, we have 


2r 
a(x, y) = u(z,y) s z u(r + rcosy,y + rsing)dyg 
T Jo 


2x 
si [ a(x + recosyg, y + rsing) dg, 
wT Jo 


since u S @ in G (see 1.2). ° 

1.6. If u is continuous on B’, then (see 1.3) the B. H. M. of u in G’ is simply 
the solution of the Dirichlet problem for G’ + B’ with the boundary condi- 
tion h = u on B’ (it is assumed that G’ + B’ is a Dirichlet region). 

1.7. In G’, defined as in 1.2, we have a (necessarily unique) least harmonic 
majorant h* (L. H. M.) defined as follows: (a) h* 2 win G’; (b) if H is harmonic 
and =u in G’, then H = h* in G’ (see R 2, p. 358). 

1.8. Clearly, h* < h. As stated in the introduction, we shall prove that 
h* = h. Let us observe that this is trivial if u is continuous. Indeed, take 
any point (2%), yo) on B’ and a sequence (z,, yn) — (@o, yo) in G’. Since 
u < h* Sh, and u(z,, yn) > ulxo, yo) = Azo, yo) = lim h(x, , yn), We have 
h*(an, Yn) — Ren, Yn) 20. That is, the harmonic function h* — h vanishes 
continuously on the boundary of G’, and consequently h* = h in G’. 








126 TIBOR RADO 


1.9. The subdomain G’ being given as in 1.2, define in G a function u* as 
follows: 


f . y Y 
» . juinG -—G@, 
u* = aaa 

\a* in G’, 


where h* is the least harmonic majorant of uinG’. Then u* is subharmonic in G. 

We first show that u* is upper semi-continuous in G. Clearly, it is sufficient 
to verify this property for points on B’. Let (x, yo) be a point on B’. As 
u* < d@ (see 1.5 for the definition of a), we have, for (x, , yn) — (20, Yo), 


lim u*(r,, Yn) S lim Grn, Yn) S W(X, Yo) = ulrzo, yo) = u*(xo, Yo). 


The relation 


Bis , 
u*(z,y) Ss — u*(zx + reosyg,y +rsing)dy 
2r 0 
is proved exactly as it was proved for @ in 1.5, 
1.10. For r > 0 define 


A‘? (x, y) = u(x + & y + n) dédn 


rr 


(see R 2, footnotes on p. 343 and p. 345 for historical references concerning the 
use of this approximating function in the theory of harmonic and subharmonic 
functions). Then A‘’(z, y) is continuous and subharmonic at those points 
of G whose distance from the boundary is greater than r. If we apply the 
same process to A‘’(xr, y) twice, using the same r, we obtain a function 
A(x, y) which has a number of important properties, some of which will be 
listed presently (in this section, u always stands for a subharmonic function). 

1.11. Take any region G’ + B’ interior to G. Then for r sufficiently small, 
A“ is defined and subharmonic in a domain containing G’ + B’, and has 
continuous derivatives of the first and second order there. On G’ + B’, 
AS?’ \ uforr \ 0. Asa consequence, f AS” -+ f u for r > 0, where f stands 
for any simple or double integral taken over any measurable range in G’ + B’ 
(R 2, pp. 342-345). 

1.12. If G’ + B’ is interior to G, then (R 2, p. 353) 


- a 2 
lim [ | AAS (x, y)drdy < +2, A = — z. : 
me Ss 4 Ox" oy" 


G’'+ B’ 


1.13. If G’ + B’ is interior to G, and if u is harmonic in G’, then we have 
3) ‘ a ‘ 
AS” = u at every point of G’ whose distance from B’ is larger than 3r, by the 
mean-value property of harmonic functions. Consequently for every closed 
7» > v 3 ’ 
set S in G’ we have a 6 > 0 such that AAS” = 0 on S for r < 6. 
1.14. Consider any bounded domain G’ with boundary B’. By using sub- 
divisions of the plane into smaller and smaller squares, we can obtain in a 
we . . . : P r ‘ 
familiar fashion an increasing sequence of nested domains G, with smooth 











HARMONIC MAJORANTS OF SUBHARMONIC FUNCTIONS 127 


boundaries B), which approximate G’ + B’ in the following sense’ (the properties 
to be listed are clearly not independent of each other and we state them ex- 
plicitly only for easier reference). 


(a) G, contains a prescribed point (20, yo) in G’. 

(b) G. + BL C Gin. 

(ec) GL + BLCG’. 

(d) Any point (x, y) of G’ is contained in some G, . 

(e) To every « > 0 there corresponds an mp = no(e) with the following property. 
Denote by S, the set of those points of G’ whose distance from B’ is less than e. 
Then for n > no the set S, contains Bi, . 

(f) The measure of G;, converges to the measure of G’, and consequently the 
measure of G’ — Gi, converges to zero. 

(g) Bi’, consists of a finite number of non-intersecting simple closed curves as 
smooth as desired. In particular, Gi). + Bi. is a Dirichlet region. 


1.15. We shall need the following well-known fact concerning the dependence 
of the solution of the Dirichlet problem upon the boundary conditions. Sup- 
pose the region G’ + B’ of 1.14 is a Dirichlet region. Denote by F a function 
which is continuous on G’ + B’. Let A denote the solution of the Dirichlet 
problem for G’ + B’ with the boundary condition h = F on B’, and let h, be 
the solution of the Dirichlet problem for G, + Bi, with the boundary condi- 
tion h, = F on B),, where G,, + Bi, is a sequence of approximating regions as 
described in 1.14. Then h, — h in the sense that to every 7 > 0 there corre- 
sponds a no = no(n) > Osuch that |h, —h| <n in G, + B, forn>m. We 
sketch the proof for the convenience of the reader. Given » > 0, we have an 
¢ = e(n) > O such that 


[zs ws) — Mar, w)| <2] 
} for [(x2 — 21)* + (ye — ys)" < «, 
| F(x2, y2) — F(a, yi)| < : 


for every pair of points (11 , y:), (v2, y2) in G’ + B’, since F and h are uniformly 
continuous in G’ + B’. Take then n > mo(e), where no(e) is the quantity 
defined in condition (e) of 1.14. Consider any point (7, y) on B,. By condi- 
tion (e), we have some point (#, g) on B’ such that [(2 — #)° + (y — gy} <.. 
We obtain then 


| h(z, y) tod h,(z, y) | Ss | h(a, y) = h(z, 9)\+ | h(Z, 9) 7 h,(2, y) | 


6 See for instance O. D. Kellogg, Foundations of Potential Theory, concerning the 
familiar facts in 1.14 and 1.15, and in §2. 








128 TIBOR RADO 


That is, |h — h,| <n on B) for n > no = no(e(n)). As h — h, is harmonic 
in G,, + Bi, it follows that the same inequality holds in G), also. 

1.16. The continuity of F was actually used for points of G’ + B’ close to B’. 
Hence the conclusion in 1.15 remains valid if we only know that F is con- 
tinuous in G’ + B’ — S, where S is some closed set in G’. 


2. A remark on the formula of Green.’ 2.1. Suppose the functions f, g have 
continuous derivatives of the first and second orders in a domain G’, and that 
these derivatives remain continuous on the boundary B’ of G’. If B’ consists 
of a finite number of sufficiently smooth simple closed curves, then we have 
the classical formula 


(1) lI (fag — gAf)dady = -[( ! _ 9) ds, 


a 


where n; refers to the interior normal with respect to G’. 

2.2. Let there be given in a domain G a function v with continuous derivatives 
of the first and second orders. Consider a region G’ + B’, interior to G, such 
that B’ consists of a finite number of simple closed smooth curves. In G’, 
take a circle C(xo, yo ; 7), with centre (xo, yo) and radius r, comprised in G’ 
together with its interior. Put 


: 1 2r : 
v (xe, Yo) = an [ v(x + recosyg, y +rsing) dg, 
T Jo 


U(x, y) = — log [(x — x0)” + (y — wl, (x, y) # (70, yo), 


L(x, y) (Ux, y) for [(x — x0)” + (y — yo)" = 7, 
r w, y = 9 9. 
—log r for [(2 — xo)” + (y -— yo)" |" sr. 

Denote by h the solution of the Dirichlet problem for G’ + B’ with the bound- 
ary condition h = v on B’. Then (see Kellogg, loc. cit., p. 237, footnote) 


1 al aH 
(i ye =-= Se entien: C00 ‘hia Is 
h(xo, yo) = [ ( # oz) ds, 


where H denotes the solution of the Dirichlet problem for G’ + B’ with the 
boundary condition H = lon B’ (that is, ! — H is Green’s function for G’ + B’ 
with pole at (xv, yo)). If B’ is smooth, then the necessary derivatives of H 
will remain continuous on B’.’ 

2.3. We apply now formula (1) first to the region bounded by B’ and 
C(x, yo; 7) for f = v, g = l, second to the circular disc bounded by C(2o , yo ; 7) 
for f = v,g = —logr and third to G’ + B’ for f = v,g = H. Combining the 
resulting equations with the expression for h(xo, yo) in 2.2, we obtain the 
formula 

? The functions lJ, 1, , H depend also upon the choice of (xo, yo). However, this point 
will be kept fixed, and there is no need for notations like H(z, y; ro, yo) ete. 

















HARMONIC MAJORANTS OF SUBHARMONIC FUNCTIONS 129 
(ir) 1 
(2) wv (x0, yo) = “~~ // (L(x, y) — H(2, y)| Av(x, y) dxdy + h(xo, yo). 


2.4. Actually, (2) can be obtained from the classical expression for v in terms 
of Green’s function by an integration under the integral sign. Conversely, 
that classical formula can be obtained from (2) by the passage to the limit 
r— 0. But our idea is to keep r finite in (2), so that we have to deal only 
with continuous functions. We shall need a slight extension of (2) which we 
shall consider presently. 

2.5. If v has continuous derivatives of the first and second orders in G, then 
formula (2) holds for every Dirichlet subregion G’ + B’. 

This may be seen as follows. Approximate G’ + B’ by regions Gi, + Bi, 
as described in 1.14. Observe first that the functions h, H, as defined in 2.2, 
actually exist for every Dirichlet subregion G’ + B’. Denote by h,, H, the 
functions corresponding to G, + B’,. We have then by 2.2 


(3) vw" (xo, yo) = -5- [ [we y) — H,(a, y)| Av(a, y) dxdy + hy(2xo, yo). 


We have h,(x0, yo) — A(x0, yo) by 1.15. Next we consider 


// L(x, y)Av(x, y)dxdy = al + [f. 


As Ll, and Av are continuous and therefore bounded in , we have 
As l 1A t nd therefore bounded in G’ + B’ i 


— 0, 


G'—Ga 


P . , 
since the measure of G’ — G, converges to zero. Hence 


// L(x, y)Av(x, y)dxdy — // L(x, y)Av(x, y) dady. 


Consider finally 


// H(x, y)dv(x, y)drdy = II H,,(x, y)Av(x, y) dxdy 


G? 


(4) + I [H(x, y) — H,(x, y)] Av(2, y) dxdy 


+ I H(z, y)Av(2, y)dady = IX? +12 47%. 
G'’-Gy 











130 TIBOR RADO 


We have J‘ — 0 because the integrand is bounded and the measure of G’ — Gs 
converges to zero. Take now any 7 > 0. By 1.16, we shall have for n larger 
than some mp = mo(n) 


H — Hi, | <ninG, + Bi. 


Hence, for large n, 
12 | <n f [ |avle, v)\dedy sf [| avte, ») | ded, 
Thus J’ — 0. It follows then from (4) that 
IJ H,(2, y)Av(2, y) drdy — I H(z, y)Av(z, y) dxdy. 
Gs G’ 


Summing up, formula (3) yields, for n — «, formula (2) for the most general 
Dirichlet region G’ + B’ C G. The assumptions concerning the smoothness 
of v could be generalized in an obvious way, but this is immaterial for our 
purposes. 

2.6. If the function v of 2.5 is subharmonic in G, then the function Ah in 
formula (2) is the B. H. M. of v in G’ (observe that v is continuous by assump- 
tion, and compare 2.2 and 1.6). 


3. A lemma. 3.1. Lemma. Let u be subharmonic in a domain G. Denote 
by G’ + B’ a Dirichlet region interior to G, and suppose that u is harmonic in G’. 
Then u = hin G’, where h is the B. H. M. of u in G’. 

3.2. To prove this lemma, consider the sequence of approximating functions 


un(x, y) = At? (2, y) n>wN 


(see 1.10), where N is large enough so that for n > N the function u, is defined 
in some domain which contains G’ + B’ in its interior. Denote by h, the 
B. H. M. of u, in G’. Since u, \ u, and since u, is continuous, we have (see 


1.2, 1.3) 


(5) h, > hin G@, 

where h is the B. H. M. of u in G’. Also (see 1.12) 

(6) 0O< [ [ suste y)dxdy < M, 
G’+B" 


where M is some finite constant. In G’, take a point (2, yo) and a circle 
C(xo, yo; T) with centre (xo, yo) and radius r, such that C(2o, yo; r) is comprised 
in G’ together with its interior. On account of 2.5, 2.6 we have then (since u, 
has continuous derivatives of the first and second order) 











HARMONIC MAJORANTS OF SUBHARMONIC FUNCTIONS 131 


us.’ (20, yo) = —5- // (L(x, y) — H(z, y)] Au,(x, y)dxdy + ha(xo, yo) 


(7) 


= x I, h, (xo, Yo). 
us 


3.3. We write 


(8) I, = II [H(x, y) — L(x, y)]Aun(2, y)dxdy = IJ 4. //. 


where S is a closed set interior to G’. Observe that l. = 1 = H on B’ (see 2.2). 
That is, H — 1, = 00n B’. As l,, H are continuous in G’ + B’, we have for 
every « > 0a6 > Osuch that | H(z, y) — L(x, y) | < «for every point (z, y) 
in G’ whose distance from B’ is less than 6. Choose the closed set S of formula 
(8) as the set of those points of G’ whose distances from B’ are 26. In formula 
(8) we have then, by (6) in 3.2, 


(9) J s ef Au, (x, y) dxdy S € I/ Au,(x, y)drdy < «M.° 
| G's G’-s G'+B’ 


On the other hand, for n large enough we have Au, = 0 on S (see 1.13). Hence 


in formula (8) we have / | = 0 for large n. That is, 





lim |Z, | S< «M. 


As ¢ is arbitrary and M is fixed, it follows that J, — 0. 

3.4. By 1.11, 3.3 and by formula (5) it follows from formula (7) that 
u (20, yo) = A(xo, yo). But u is harmonic in G’ and therefore u(x, yo) = 
u(xo, yo). Thus u(x, yo) = A(xo, yo), and the lemma is proved. 


4. The theorem. Turorem. Let u be subharmonic in a domain G. Let 
G’ + B’ be a Dirichlet region interior toG. Denote by h and h* the best harmonic 
majorant and the least harmonic majorant of u in G’ (see 1.2 and 1.7). Then 
h = h*. 

Proof. Define 

ut = inG — @’, 
h* in G’. 


Then u* is subharmonic in G (see 1.9). Denote by h* the B. H. M. of u* in G’. 
Since u and u* are both subharmonic in G and u = u* on B’, we have (see 1.3) 
h* = hin G’. Since u* is harmonic in G’, we have (see 3.1) u* = h* in @. 
But by definition u* = h* in G’. Hence h* = h, as stated in the theorem. 


8 Observe that u, is subharmonic and therefore Au, = 0 (R 1, p. 335). 














132 TIBOR RADO 


5. Conclusion. 5.1. Let u be subharmonic in a domain G. Denote by 
7’ + B’ a region interior to G, and by h’ a harmonic function in G’. Define 
in G a function w’ as follows: 

,_ juinG —@, 
~ \h' in G’. 
If u’ is subharmonic in G, we shall say that the harmonic function h’ is admissible 
for u in G’. 

5.2. We have then the following 

TueoreM. If u is subharmonic in a domain G, and if G’ + B’ is a Dirichlet 
region interior to G, then there exists in G’ exactly one harmonic function which is 
admissible in G’ for u in the sense of 5.1. 

Indeed, denote by h the B. H. M. of u in G’. Then h is admissible for u 
in G’, by 1.5. Let h’ be any harmonic function which is admissible for u in G’, 
and consider the corresponding function u’, as defined in 5.1. Let h’ be the 
B. H. M. of uw’ in G’. Since u and w’ are subharmonic in G and u = wu’ on B’, 
we have h = h’ in G’, by 1.3. Since w’ is harmonic in G’, we have h’ = wu’ in G’ 
by 3.1. By definition u’ = h’ in G’. Hence h’ = h in G’, and the theorem is 
proved. 

5.3. This theorem implies that the best harmonic majorant can be charac- 
terized as follows: If G’ + B’ is a Dirichlet region interior to G, then, as is 
obvious from 1.5 and 5.2, the best harmonic majorant of u in G’ can be defined 
as the unique harmonic function h with the property that the function a, equal to u 
in G — G’ and to h in G’, is again subharmonic in G. 


Tue Onto Strate UNIVERSITY. 











SUMMABILITY OF DOUBLE FOURIER SERIES 
By J. J. GeRGEN 
Part I 


1.1. Introduction. We consider in Part I the M. Riesz and the Cesaro sums 
of a simple series 


(1.11) ts 
p=0 
that is, the sums 


o(z) = > (z—p)"a,, Salm) = > AS, a, 


p< p=0 


where, in the Cesaro sums, A?> is the coefficient of x” in the expansion 


(Q—2z)** =1+4 > Azz’. 


p=!1 
Riesz’s theorem that for 0 < a the existence of either of the limits 


lim S,(m)/A*% = L, lim o,(x)/zx* = L 

implies that of the other is of course well known.’ In Theorems I and II below 
we give some formulas expressing in a simple manner each of the sums in terms 
of the other. On the basis of these theorems Riesz’s result can readily be ob- 
tained. The derivations do not in general follow Riesz’s procedure; and as 
regards conciseness there seems to be some advantage in following this second 
method of approach. Theorems I and II can also be used to advantage to 
simplify the extensions to two variables of Riesz’s theorem given by Dr. S. B. 
Littauer and the author.” In Part II of the present paper we shall apply these 
theorems in connection with double Fourier series. 

THEeorEM I. Let 0 < a. Let k be the integral part of a. Let f(x) be con- 
tinuous with its derivatives f’, f’, --- ,f°*” for 0 < x, and satisfy 


f0) =f) = --- =f*"@ =9, 
f(z) = Tia + 2)/{T(z)I’*(a@ + 1)} for1 Sz. 


Received December 26, 1936; presented to the American Mathematical Society, April 11, 
1936. 

! For a presentation of Riesz’s proof see Hobson, Theory of Functions of a Real Variable, 
vol. 2, 1926, pp. 90-98. 

2 J. J. Gergen and S. B. Littauer, Continuity and summability for double Fourier series, 
Transactions of the American Mathematical Society, vol. 38 (1935), pp. 401-435. This 
paper will be denoted by A. 


(1.12) 


133 











134 J. J. GERGEN 





Then the fractional derivative U(x), ; 
U(x) = f(x) = 1/T(k + 1 — a) [ (x — t)*-*f**” (edt, } 
0 
of order « + 1 of f, is continuous for 0 < x, and we have 
*m+1 
(1.13) S.(m) = | U(m + 1 — the.(t)dt — form =0,1,---, 
0 
(1.14) U(x) = O(x”), ro, 
(1.15) U(x) = O(2***"*), r—>+0. 
THeoreM II. LetO0 <a. Let 
(1.16) g(x) = l(a +1) > AZ”. 
p<ez 
Then the fractional integral V(x), 
V(r) = ¢a(x) = 1/T(a) [ (x — t)** gol(t)de, 
of order a of go, ts continuous for 0 < x, and we have 
(1.17) oa(t) = >, V(x — p)S.(p) for 0 < z, 
p<z 
(1.18) V(z) = O(2”), I, 
(1.19) V(z) = O(z*), r++ 0. 


2.1. Lemma for Theorems I and II. Both Theorems I and II depend on 
the following lemma. To simplify the writing we shall suppose in what follows 
that z, y are positive numbers, and that m, n, p, q are integers, positive or 0. 
We denote by M a number independent of m, n, p, q, z, y over the range 


0<z, O<y OSm OSn OSD O84, 


or that part of this range specified. The symbols o, O refer to the behavior of 
the function in question in the neighborhood of ~ or (x, ~). 

Lemma 1. Let 0 < 6 < 1. Let F(x) be integrable over every finite interval 
0 <2 < mand satisfy 


F(x) = 2” *"/T(p — 8) + O@*) 
forp =0,1,2. Then 
F(x) = O(a”). 
We have for each fixed b, 0 < b < 1, 


r(5)F;(z) = 2° | I + [ | (1 — t)*"Fo(txr)dt 


EW Es See 











RON LE 


SUMMABILITY OF DOUBLE FOURIER SERIES 135 


{ 
{ 


=’ a — b)* "x" F,(br) + (6 — 1)(1 — b)* *x °F, (br) 


+ (6 — 1)(6 — 2)r” [ ; (1 — t)’ *F.(tx)dt + [ a — it Fo(tz)at 


= E(b)r* + O(z), 
where 
r(1 — 6)E(b) = (1 — b)*"b* — (1 — by)“ 
b 1 
- ¢-—2) [ a — 0 “ae — 38 [ (a — teat. 
0 b 
Now, differentiating, we see that EZ’ vanishes for 0 <b <1. Hence, using the 


expansion 


= ee 1 rx+1 1° 
z—-z 2 2e+1' 2ze4+12+4+2 





_ eee 


with z = 1,2 = 1 — 6, we have 


r(1 — 6)E(b) = lim \( — b)*"b* -—5 [ (i —2)>"t a} 


=1-— if {a —s* -1)¢""'a 


= 1 — 6{1 + (1 — 6)/2! + (1 — 6)(2 — 4)/3!4+---} =0. 


The lemma follows. 


3.1. Proof of Theorem I. The continuity of f“*” implies the continuity of 
f‘**” and the truth of (1.15). On the other hand, U satisfies 


f(z) = 1/T(@+ » f (x — t)“U(adt 
0 
because of the first half of (1.12). Hence 


m m+1—p 
Ya, [ (m+ 1— p— u)*U(u)du 
0 


p=0 


[ wr aa(t)U(m +1 — Dat 


Te + 1) Ls(m +1 — play = S.(m). 


Accordingly, it is enough to prove that (1.14) holds. 
If ais an integer, then f reduces to a polynomial of degree a for 1 S x. Hence 


U(z) = f°" (2) = 0 for 1 < z. 
This gives (1.14) in this case. 








136 J. J. GERGEN 


Suppose then that @ is not an integer, that a = k + 1 — 6, where 0 < 6 < 1. 


From Stirling’s formula’ 


log T(z) = (x — 4) logx — x + $log2a7 + P(O)/z — [ P(t)/(t + x)*dt, 


Dd. {cos 2 nxt} /{2 nx}, 


n=l 


P(t) 


it follows readily that 
(a + 1)f(x) = x%e"™, 
where 
DYPH = O(x*’). 
Hence, 
Ma + 1)D”f = 2*"/T(a + 1 — p) + O(xe"”"’). 
Taking now in Lemma 1 
F(x) = Ta + 1)f**(2), 
we see that F; coincides with ['(@ + 1)U and that 
F,(z) = Pla + 1)D** f(z) = 2” "/T(p — 8) + OW") 
for p = 0,1, 2. Thus (1.14) holds, and this completes the proof. 


4.1. Proof of Theorem II. We have 
(4.11) a(x) = } ii (x — p)*. 


p<e 


The continuity of V and the truth of (1.19) then follows. In addition, 


a(x) = D> (x — p)* p> A>,” S.(q) = Lo V(x — p) Sa(p). 


p<z p<z 
It remains then to consider (1.18). 
Let Ax" denote the q-th difference 


qd 


(4.12) Az" = >), A;* "(2 — p)”. 


p=0 


Then, for 1 S q S n, 


Hence 


* See, fonexample, Bieberbach, Lehrbuch der Funktionentheorie, vol. 1, 1923, pp. 297-308, 








‘ 
e 
& 

Ss 
b 


ied oe ee 











’ 
% 


“KS 


EC Mer 2d 





SUMMABILITY OF DOUBLE FOURIER SERIES 137 


4,2” = nI, Aauiz” = 0. 
Now if @ is an integer, we have 
V(z) = Aauiz* = 0 


fora + 1 < 2; and this is (1.18) in this case. 
Suppose then that @ is not an integer, that a = k + 6, where 0 < 6 < 1. 
Denoting by u the largest integer less than z, we have 


go(2)/T(a + 1) = Ap** = 2 */T(-a) + Of *”). 


In addition, for each fixed positive n, 


n—l 
on(z)/T(a + 1) = Ante" + 1/n! > ASS Ae — wp + Q)" 


q=0 
n—l 
= 2" "*"/T(n —a) +0 a. oe os ree) 
q=0 
= 2" *"/T(n — a) + O(2"*”). 


Taking 
Fo(x) = ex(x)/T(@ + 1), 
we see that Ff; coincides with g./T'(@ + 1), and that 
F(x) = ge+p(2)/T(a + 1) = 2” *"/T(p — 8) + O@”**) 
for p = 0,1, 2. The lemma is then applicable and the theorem follows. 
Part II 


5.1. Introduction. We consider here a function f(u, v) integrable L over 
(0, 0; 2x, 27) and with period 27 in each variable. We shall suppose in addition 
that f is even in each variable, and shall consider only the Fourier series of f 
at the origin, that is, the series 


(5.11) i Ge, 


m, n=O 


where 
° r 
Om,n = AmAn [ cos mu au | cos nv f(u, v) dv, 
0 0 


Ao = 1/z, Mi = Ae = +++ = BW. 


These simplifications involve no loss in generality. Our object is to complete 
some of the results on Cesaro and Riesz summability of the series (5.11) given 
in A. We are interested in particular in the question whether the réles of sum- 
mability and continuity can be interchanged in the following Theorem VI of A.* 


‘ This question is raised on p. 415. 











138 J. J. GERGEN 


THeoreM VI. Suppose that 
OsSa<a- 2, 0s8B<b-2 


and that the series (5.11) is summable either (C; a, 8) or (R; a, 8) to sums. Then, 
if f is bounded (C; a, b) in the square (+0, +0; 6, 5) for some 6 > 0, it follows 
that f is continuous (C; a, b) with limit s.° 

5.2. In connection with Theorem VI it might be noted here, incidentally, 
that it carries with it the following permanency theorem. 

TueoreM III. Suppose that a, b, a, 8 are non-negative numbers, that f is 
continuous (C; a, b) with limit s, and that the series (5.11) is summable either 
(R; a, B) or (C; a, B) to sum S; then 


> 
< 


(5.21) $= 
The proof is immediate. We set 
E=a+a-+ 3, n=6+643. 


Then, since f is continuous (C; a, 6) with limit s, it is continuous (C; é, 7) with 
limit s.° On the other hand, since f is continuous (C; £, 7), it is bounded (C; , 7) 
in some square (+0, +0; 6, 6), 0 < 6. Thus, making use of our summability 
hypothesis and Theorem VI, we see that f is continuous (C; &, ») with limit S. 
We conclude that (5.21) holds. 

5.3. We turn now to the question on Theorem VI. The result that one 
might expect would be that continuity (C; a, 6) of f, plus ultimate boundedness, 
either (C; a, 8) or (R; a, 8), of the series (5.11), implies summability of the 
same type and order of the series for a, 8 sufficiently large. Peculiarly enough, 
this is false. This we shall show in paragraphs 6.1 through 7.1, the result of 
which we state in the form of 

TuHeoreM IV. Corresponding to every set of non-negative numbers a, b, a, B 
there can be constructed a function f(u, v) integrable L over (0, 0; x, 3), even and 
periodic with period 2x in each variable, and continuous (C; a, b), for which the 
series (5.11) ts ultimately bounded both (R; a, B) and (C; a, 8), but is summable 
neither (R; a, B) nor (C; a, B). 

Actually we shall prove somewhat more than is stated. The function f we 
construct will vanish on (0, 0; $2, 2). In addition, the series (5.11) will not 
only be ultimately bounded but bounded both (R; a, 8) and (C; a, 8). 

5.4. The natural analogue of Theorem VI being false, one might ask if it is 
possible to obtain any result of the same general character. In our final the- 
orem, the proof of which is in paragraphs (8.1) through (9.1), by requiring ul- 
timate boundedness of two different orders, we obtain a result of this kind. 


5 The definitions for R and C summability can be found on pp. 401-402 of A, those con- 
cerning the continuity and boundedness of f, on p. 413. 
* This follows directly from Theorem ITI, p. 413 of A. 








i 





SUMMABILITY OF DOUBLE FOURIER SERIES 139 


THeorEM V. Suppose that 
(5.41) OS a<§&, O0Osa<éi-1, 0Osb<xyM, 0sB<7n-1 


and that f is continuous (C; a, b) with limit s. Then, if the series (5.11) is ulti- 
mately bounded both (R; a, n) and (R; &, B), it is summable (R; &, n) to sum s. 
In addition, the corresponding result with C in place of R holds. 


6.1. Lemmas for Theorem IV. 
Lemma 2. For every m there exists an even, bounded and measurable function 
I,,(u), with period 2x, which vanishes on (0, $4), and whose Fourier series, 


Ln(u) ~ Do Cy, m cos pu, 


p=0 
satisfies 
Co, a = Ci, qg = ot> me Cae = 0, C mit, m a 0. 


To prove the lemma it is enough to show that, m being fixed, we can choose 
constants bo, bi, «++ , bm, so that 


m+1 


(6.11) > Ap. be = Vor], 


q=0 
according as p S m or p = m + 1, where 
r 
Ay.¢ = i cos pu cos qu du. 
® 
For, with the b’s so chosen, the even, periodic function J,, which vanishes on 
(0, $7) and coincides with 
m+1 


ps b, cos qu 
q=0 


for 44 < u S has the desired properties. 

Now a short calculation shows that A,,, is rational for p ~ q and has the 
value 1/(2A,) for p = g. Thus the determinant A of the A’s can be written 
in the form 


A= eae’ san + ne + Paste + a 
where the r’s are rational. Since 7 is not algebraic, it follows that A is not 0. 
Accordingly the equations (6.11) admit a solution, and the lemma follows. 
6.2. Lemma 3. Let 
(6.21) O< +++ Ca <%= F; €p/€pxi1 > ®, pe 


Let ¢,(v) be the even, periodic function which coincides with v for nu, Sv < ey 
and equals 0 elsewhere on (0, x). Let Ra(y; gp) be the Riesz mean, 


Rs(y; gp) = ba (1 —- y/n)? mf cos nv ¢,(v) dv, 
0 


n<y 











140 J. J. GERGEN 


of order 8, 0 S 8, of the Fourier series of ¢, at the origin. Then 
(6.22) | Rely; >) | < M/ebas, 


and, for every positive :,” 


; 
S 
3 
5 


(6.23) eb .s RolE/epsr; gp) 2 2/9 [ yeu(v)/v dy, po. 
Suppose first that 0 < 6. We have 


(6.24) Rely; ep) = 2y/m [ vai (yv) ¢,(v) dv 


2u/| [ + | ross en de 


Fy(y) + F,(y), 


say. 
Applying the second mean value theorem, we get 
” 
| PF, | = /|2 y/(r e+ 1) va+1(yv) dv (€p41 aes ep) 
ep+l 


i <) 
' i 
< M/ebss | | ys41(v) dv < M., 6544 
0 
In addition, since e,/é@p11 — ©, we have 
fep/ep+i : 20 
eas Filé/epy1) = 2 t/m / yes(v)/v' dv > 2 2°/x / yasi(v)/v* dv. 
g 
On the other hand, for 1 < y, 


Fz |< My / vasi(yv) | gp(v) dv 
<M(y’+y") / (gr 49) gp(v) dv < M. 


We conclude that (6.22) holds for 1 S y, and that (6.23) holds. Since Rg(y; ¢,) 
= Rs, (1; ¢,) for y < 1, the lemma follows for 0 < 8. 
The proof for the case 8 = 0 is similar and can be omitted. In this case the 


formula 


e 


Roly; ?p) = 2 n, a yi(nv) v'dv 


ep+1 


ep . 
+ vx [ [sin nv feot v/2 — 2/v} + cos nv]/v’ dv, 


pti 
where n is the largest integer less than y, takes the place of (6.24). 


7 For the definition, properties and references concerning y,,,, see A, p. 416. 


pe oes 





; 
‘y 
ia 
§ 
3 


baa ac HR. Sa 


SUMMABILITY OF DOUBLE FOURIER SERIES 141 


7.1. Proof of Theorem IV. Turning now to the proof of the theorem, we first 
select an infinite sequence of functions {#,(u)}, n = 0, 1, --- on the basis of 
Lemma 1. 

In the first step we set m = 0 and choose % so that &) has the properties 
specified for J, and satisfies , ; 


| | < 1, to = max | R,(x; o) | < 1. 
0<z 


In the second step, noting that 0 < & and that 
R(x; ®o) — 0, Im ow, 
since ) vanishes for |u| < 32, we choose an integer mp < m so that 


| Ra(x; Bo) |/to < 3 


form +12 2. We then choose 4, so that 6, has the properties specified for /,, 
and satisfies 


| @ | <1, t, = max | R(x; :) | < $b. 
0<z 
Continuing this process, in the (p + 1)-th step we select n,1 < mn, so that 
p—l 
(7.11) D> | Ra(x; Bz) |/te <2” for np +1 Sz. 
q=0 
We then choose ®, so as to have the properties specified for J,,, and satisfy 
|| <1, tp = max | R,(xz; ,) | < t,/2”. 
0<z 
The function f(u, v) we now define as 
f(u,v) = 22 @,(u) ¢p(0), 
p=0 


where the g’s are the functions of Lemma 3 with 
2 2 
€o = fF, a = hb, @=t, 


Since 0 < ty4, < t,/2”"’ < 4, these e’s satisfy (6.21). On the other hand, 
®,, gp are even, periodic and integrable, and 


Blu) = 0, D1 4,(u)dy(0) | < 0% 


the former for 0 S u S }z, the latter on (0,0; 2,7). Hence f has all the proper- 


ties asserted save perhaps those concerning summability. 
We consider the Riesz mean 








142 J. J. GERGEN 
R.,3(z, y) = p » (1 — m/z)* Zz. (1 — n/y)* am, n- 
m<z. n<y 


We shall show that R,,, is bounded for 0 < 2,0 < y and that 
(7.12) lim | Ras! > 0. 


(2, y) > (®%, ©) 


From these two facts the theorem follows. For first, by Theorem II of A, 
boundedness (R; a, 8) implies both boundedness (C; a, 8) and the equivalence 
of summability (2; a, 8) with sum s to summability (C; a, 8) with sum s. And, 
secondly, by Theorem III, since f is continuous (C; 0, 0) with limit 0, the series 
(5.11) cannot be summable at all unless it is summable to 0. 

Now 


Ra, a(2, y) = : ss R.(z; ®,) Rsly; Pq): 


q=0 
In addition, for z S np41 + 1, p S 4, 
R(x; ®,) -_ 0; 


and accordingly, for z S ny, + 1, 
P 
Ras = Lo Ra(x; &,) Rly; ¢0)- 
q=0 
Now, forn + 1S n,+1 S 2,0 < y, we have 


p-l p-l 
DL R(x; &,) Ra(y; ve) | < M > | Ra(a; @,) |/te < M/2? 
4 


q=0 


by (6.22) and (7.11). On the other hand, for 0 < p,0 < 2,0 < y, 
| Ra(x; Py) Rs(y; ep) | < tp>M/tp = M. 


We conclude that R,,3 is bounded for 0 < zx, 0 < y. In addition, choosing z, 
so that 


i, = R.(z> ; ®,), 
we have 
Np +1 Sz S Ny + 1. 


Hence, using (6.23) and choosing ~£ so that 
/ vex (v)/v' dv ¥ 0, 
t 
we have 


lim | Ra,a(z,y)| = lim | Ra(x,; ,) RalE/t5; gp) | 
) pe 


(zt, y) (2, @ 


= 22/5 [ yssi(v)/v® dv | ~ 0. 
g 


This is (7.12) and completes the proof. 














SUMMABILITY OF DOUBLE FOURIER SERIES 143 


8.1. Lemmas for Theorem \V. 

Lemma 4. Let0 <a <a-—1. Let g(u) be integrable over (0, x), even, and 
with period 2x. Let (1.11) be the Fourier series of ¢ at the origin, and let it be 
bounded (C, a) or (R,a). (The two are equivalent.) Let mo be an arbitrary posi- 
tive integer. Then we can write 


galt) = Do gyolx) ap + gmo(2), 


p<mo 
where the g’s are measurable for 0 < x, go, +++ ; Gmo-1 are independent of y, and 
| gp(z) | < M* x" forp < mo, | gm (x) | S M* x* Lub. | oa(u)/u*|, 
mosu 
M* being independent of ¢ as well as x and p. In addition, an expansion of the 
same type is valid with 
| Gm (2) | S M*x* 1.u.b. | S.(m)/m* |. 
mosm 


Denoting by ¥. the number and H(u) the function of paragraph 9.1 of A, 
and applying Lemma 11 of A, we have 


galt) = pat™*** % H (xu) oa(u) du 


bs {vest a H(xu)(u — p)* au) ap + ea H(xujo.(u) du 


p<mo 


D Gp(x) ay + GYmg(2), 


p<mo 


say. Now by Lemma 10 of A, 


[ | H(xu) (u — p)*| du < a2" [ u*| H(u) | du < M*/z2*"", 
Pp 0 


[ '| H(zu) oa(u) | du < 27° Lub. | ca(u)/ut |. 
mo mosu 
Thus, since H(xu)(u — p)*, H(xu)o.(u) are superficially measurable, the first 
part of the theorem holds. 

To obtain the second part we use Theorem II. We have 


gaz) = Do {veo / , H(xu)V(u — p) au} Sa(p) 


p<mo 


re vaste f H(ru) >> V(u — p)S.(p) du; 


mosp<u 


and the second part follows from the properties of V and the definition of S,(p). 
8.2. Lemma 5. If0 S a < £,0 S B < n, and if the series (5.11) ts ultimately 











144 J. J. GERGEN 


bounded both (R; a, n) and (R; &, 8), it is ultimately bounded (R; =, n). In addi- 
tion, the corresponding result with C in place of R holds.* 
Consider the Riesz sum 


o:. (2, y) = Do (x — p)* DL (y— 9)" ape. 


p<z q<y 
Under our hypotheses there is a positive mp such that 
|oa.g(t, y)| < Mr*y’, | oz.a(z, y)| < Ma‘y’, — for mo S z, m & y, 


Now, for mp S x, mp S y we have 


Ta + 1)T(E — a)o:,,/T(E + 1) = [ iy . iatin Ta, y(t, y) dt 


ym ‘ (x — t)**" (t — p)* dt } # (y — q)" ap, ¢ 
(8.21) p<mo JP a<y 


+ [ j (x — t)**" oa, a(t, y) dt 


ae +6, 
say, and 
(8.22) lo? |< May’, 
lo” |< Monel + lo” | 
(323) <M 2d e (y— gr?" t — gat D(x = p)* | ape | + Maty’ 
< F(z) y’, 
where F is independent of y. But 


mo 1 = 
ety t= > et | (x — t)**" (¢ — p)* at) 12 (i- a/v)" a». 
Pp 4 q<y 


p<mo 


is a sum of the form considered by Agnew in his lemma on double series,’ and 
rs — 1 
| (x — t)**" (t — p)* dt = o(1). 
P 


We conclude from the lemma and (8.23) that o” satisfies (8.22). The proof 
is then complete for Riesz sums. 

The proof for Cesaro sums is analogous, the formula taking the place of (8.21) 
being 

® The fact that (5.11) is a Fourier series is of no importance. 

® For a statement of and references to Agnew’s lemma see A, pp. 402 and 409. As stated 
in A, Agnew considered the case in which z, y are integral variables. The immediate 
extension to continuous variables is given on p. 409 of A. 

















SUMMABILITY OF DOUBLE FOURIER SERIES 145 


Sz, (m,n) = >, Ab" S,.,, (p, n). 


p=0 


9.1. Proof of Theorem V. Since continuity (C; a, b) implies continuity 
(C; a’, b’) for a < a’, b < b’, we can suppose that 


OSa<a-1<t-1S5h, O58 <b-1<7-1S8k, 


where h is the integral part of a, k the integral part of b. Also, for the usual 
reasons, we can assume that s is 0. 
Consider then o;,,. Let 0 < «€ be arbitrary. Let 0 < 6 be chosen so that 


| fa,o(x, y) | < ex*y’ forz S b,y S 6. 


Let ¥, va, ¥ be the numbers and H(u), K(v) the functions of paragraph 7.2 of 
A with £ in place of a and 7 in place of 8. Then we have, by Lemma 9 of A, 


oe = pate yrtht I. seen wy H (eu) K(yr) fo,v(u, 0) au, ») 


| Ss ee ee 
(0, 0; 8, 5) (8, 8; @, 2) (8, 0; ©, «) (0, 8; 2, 2) 


patter? yt! HK f, 4 d(u, v) 


4 
a Dr; 


i=1 


say. Now by Lemma 6 of A, 
(9.11) |n| < Naty’ [ | H (u) | u* du [ | K(v) | v° dv < Na‘y’"e, 
0 0 


where N is independent of ¢, z, y. Next, since 1 < a, 1 < b, we have 
| fav(a, y) | < Mz*y’ ford S 2,5 Sy, 


and thus 
(9.12) lre|< may’ [ ee au | vw" dv < Mz’y’. 
3 


It is enough then to show in the case of R summability that rs = o(z’y"), rs. = 
o(z*y"). Moreover, since the situation is symmetrical we can confine ourselves 
to T3. 

We first apply Lemma 5, choosing a positive integer mp» so that 


| o:,4(z, y)| < Mr'y’ for mp S x, m S y. 


Then we note that 








146 J. J. GERGEN 

{lonel+tin|/+ rl} +r] 

0 Ea + oa | K(yr) {z (x er p)* A» 
5 


p<z 


(9.13) [ cos pu fo, o(u, v) in| dv | 


Osx°y’ + mye | | K(yo) | aw [ | fo. o(u, v) | au 
\ t) 0 


O (a**"y’). 


F 
IIA 


II 


Next, we write 


ry = per'***" i H (xu) {r (y — q)’ | cos qu fa, o(u, v) du) du. 
5 0 J 


qa<yu 
Setting 
g(u,y) = D (y—@)"r [ cos qu f(u, v) dv, 
a<y 
we have, for a fixed y, 
eal 0) = Zy~a"e [cos 9 fou) de 


In addition, ¢ is an even periodic function of u. Its Fourier series at the origin is 


ps Xp {x (y — q)" Aq Gp, ‘, 


p=0 a<y 


which, for m S y, is bounded (R, a). Accordingly, using Lemma 4, we can 


write 
ga(u,y) = DS go(u) DU (y — 4)" Acta + 9(u,y) form < y, 


p<mo a<y 
where the g’s are measurable for 0 < u, go, «++ , Gm —1 are independent of y, and 
gp(x) | < Mx* for p < m, | g(x, y) | < Mz’*y" for m S&S y. 


Thus, 


ra— Dy ver [ H(xu) gp(u) du »» (y — q)" Ag, ¢ 


p<mo 
4 


O<z*t* [ g(u, y) H(axe du \\ 
\ J 


O (2*y’) = o(2°y’"). 


But the sum 











SUMMABILITY OF DOUBLE FOURIER SERIES 147 


> {var"" [ H(xu)gp(u) au) ‘z (1 — q, ‘y)’ AgAp, : 


(9.14) p<mo 
= rx *y" + o(1) 


is again one to which Agnew’s lemma is applicable. Since 
Yer” [ H(xu)g,(u) du = O(x**) = o(1), 
3 


we conclude, on applying (9.13) and the lemma, that the sum is o(1). From 
(9.14) we then see that r; = o(zx*y"), and this completes the proof for Riesz 
sums. 

The proof for Cesaro sums is of the same type but slightly more complicated. 
We first choose f, f so as to satisfy the conditions imposed upon f in Theorem I 
with a replaced by £, ». Then we have 


m+1 n+1 
S:z,,(m,n) = [ U(m+1-—- ads | U(n + 1 — y) o,,(z, y) dy, 
0 0 


where U, U are the fractional derivatives f“*”, 7°*”. Letting «, 4, r; have the 
same significance as above, we write 


4 1 
S..= Um +1 ~ adr f 
0 


t=1 


n+ 


1 4 
On+1—yrndy= LS, 


m+ 
0 i=1 
say. 

We see readily by (9.11) that 


m+1 n+l 
[8] < Nom + Dnt De [ ite) | de [ | U@) | dy 


< N(m + 1)§(n + 1)" «, 
where N is independent of m,n, «. In addition, we have, by (9.12), 
| S2| < M(m + 1)* (n + 1)’ = o(m‘n’). 


Accordingly, because of the symmetry, it is enough to show that Ss; = o(m‘n”). 
We first apply Lemma 5, choosing a positive integer mo so that 


n. 


IIA 


| Seg) < Mmin" for mo < m, m 
Then we note that 
Inj] ¢$M(24+ 0)" yy" [ | K(yv) | aw [ | fo.o(u, v) | du 


< M(x + 1)" y’, 
and, accordingly, that 





148 J. J. GERGEN 


vA 
a 
Il 


m+1 
O11 Se. +18) +514 [ | U(m + 1 — x) | dx 


‘4 n+l 
(9.15) / |Un+1—y)n| dy) if 
0 


| 
} be: 
O}m** n}, 
' 


Next, we write 
n+l oo 
[ U(n +1 —y)rsdy = vate [ H (xu) 
0 t) 


> Ai~dte i cos qv fa, o(u, v) av} du; 





q=0 ; 
and reasoning as with 7; we see that 4 
Cn+l a] n j 
tT? f+a+l1 * ” : 
i Orsdy — Lo var i H(xu)gp(u) du 2) AX «re dp, | 
0 p<mo q=0 





= vation [ H(xu) g*(u, n) du, 
é 


where the g*’s have properties analogous to the g’s. Thus, for m < n, 


m+l1 ) 
S; — po iv. [ U(m + 1 — z)a****? dz | H(au)g*(u) in} 
0 F 


pP<mo 
( n 
\2 Aj-aXq Gp, 7 


q=0 


m+l1 2 
vf U(m + 1 — 2)a**e* az [ H(xu)g*(u, n) du 
0 3 


m+l1 c) 
ofn | |U | 2z* az | ” di au} O(m*n"). ; 
0 3 ih 


Making use of (9.15) and applying Agnew’s lemma to the sum 


m+1 7 n 
Zz {mr [ U oite* dz / Hg* au) {2 y -., = ‘, 
pP<mo 0 q=0 


we conclude that S; = o(m‘n"), and this completes the proof. 
I 


UNIVERSITY OF ROCHESTER AND DuKE UNIVERSITY. 














PAPI a. eee on 


STRUCTURES AND GROUP THEORY. I 
By OysTEIN ORE 


In a recent paper on the foundations of abstract algebra’ I have shown that 
the principal results on algebraic domains are not primarily to be considered 
as properties of the elements of the domain itself but as properties of certain 
systems of distinguished subsets, like systems of subgroups, ideals, submoduli, 
etc. These systems of subsets have the common characteristic property that 
they form a structure, i.e., a system in which union and cross-cut of two elements 
are defined. The theorems on algebraic domains are shown to be theorems on 
structures. This explains the well-known similarity of several algebraic theories 
and makes possible a unified structural theory applicable to all systems: 

After this common foundation for the algebraic theories has been established, 
it is, however, not difficult to see that the various algebraic domains like fields, 
rings and groups have peculiar structural properties of their own. In certain 
cases it is even possible to characterize the domains by these properties. 

In the following we shall apply the principles of the theory of structures to the 
foundation of the theory of groups, that is, we base the theory as far as possible 
directly upon the properties of subgroups and eliminate the elements from 
theorems and proofs. This entails a certain simplification. More important, 
however, is the fact that this method, even in the elementary theory of groups 
which we consider in this paper, leads to new systematic points of view and 
interesting new results. 

In Chapter I we discuss the structure formed by all subgroups of a given 
group and indicate the general principle of duality. Furthermore, in the theory 
of structures we have constructed a quotient structure for any structure, while 
quotient systems A/B in groups have been defined only when B is normal in A. 
Hence we are led to the introduction of quotient systems for all subgroups. 
The algebraic system A/B is then a multi-group differing from ordinary groups 
only in the property that the product is not unique. 

In Chapter II we consider the law of isomorphism. When the assumption of 
isomorphism is weakened to co-set correspondence we are led to permutable 
groups. Such groups have the structural property expressed in Theorem 6. 
When structure isomorphism is required, one is led to the important type of 
subgroups which I have called quasi-normal subgroups. 

Some of the principal new results are to be found in Chapter III. The 


Received February 10, 1937. 

1 Oystein Ore, On the foundation of abstract algebra I, Annals of Math., vol. 36 (1935), 
pp. 406-437; II, ibid., vol. 37 (1936), pp. 265-292. These two papers will be cited as Founda- 
tions I and IT. 


149 











150 OYSTEIN ORE 


theorems of Jordan-Hélder and Schreier-Zassenhaus are analysed and several 
extensions of these theorems are obtained. When only index relations are 
required certain weak permutability conditions of the chains are required. 
When structure isomorphism is wanted we have to consider chains in which 
every term is quasi-normal in the preceding. For such chains there exists a 
complete analogue of the theorem of Jordan-Hdélder. 

The last chapter deals with the Dedekind structure formed by all normal 
subgroups. Here some of the results may be taken over directly from the theory 
of structures. The concept of direct similarity is shown to be equivalent to 
central isomorphism. The well-known duality between the center and the 
anti-center, i.e., the quotient group with respect to the commutator group is 
easily explained by the general principle of duality. Among the further results 
of this chapter I shall only mention the interesting self-dual Theorem 4. 


Chapter 1. Quotient systems 


1. Group structures.’ The system of all subgroups of a given group G forms 
a structure =. To any two given subgroups A and B of G there exists a cross-cut 
(A, B) consisting of the common elements of A and B, and a union [A, B], which 
is the subgroup generated by A and B. Since this structure has the property 
that any finite or infinite set of subgroups has a cross-cut and union, we shall 
say the structure > of all subgroups of a given group G is closed. 

The cross-cut and union satisfy the ordinary axioms for these operations: 


(A, B) = (B, A), [A, B] = [B, A], 

(A, A) = A, [A, A] = A, 

(A, (B, C)) = ((A, B), C), [A, [B, C]] = [[A, B], C], 
(A, [B, A]) = A, [A, (B, A)] = A. 


It is obvious that the two operations correspond dualistically. This simple 
remark enables us to express an important principle which we shall repeatedly 
apply. 

PrincipLe oF Duauity. To any structure theorem on groups corresponds a 
dual obtained by interchanging the two concepts union and cross-cut. 

In the following we shall consider various substructures of the structure = of 
all subgroups. Two structures =; and 2 shall be said to be structure isomorphic, 
>, ~ 2: if there exists a one-to-one correspondence between the subgroups of the 
two structures such that if 

A,2 Az, B, = Ba, 
then 
(A, , Bi) = (Az, Bo), [A, , B;] = [Ao, Ba]. 

If A > B are two subgroups of G, we shall denote the index of B in A by 

? Compare Foundation I, Chapter 1. 





i Po 


eee a Le 





A SM PO Mah 

















STRUCTURES AND GROUP THEORY. I 151 


{A:B}. The two structures 2, and 2: shall be said to be strongly structure 
isomorphic if for any two subgroups A; > B, and the corresponding Az > By; 
we also have the same index 


{A,:B,} = { Ae: Be}. 


In this case we shall write 2; ~ 22. 


2. Quotient systems.’ In an arbitrary structure = any two elements A = B 
define a substructure consisting of all elements H between A and B 


AZH2B. 
This structure shall be called the quotient structure of A and B and we shall 
denote it by UY = A/B. 

In the case where B is a normal subgroup of A, we ordinarily associate a 
quotient group with the symbol A/B. We shall now show that in the non- 
normal case it is also possible to associate with the quotient A/B an algebraic 
system which in the case of a normal subgroup reduces to the quotient group. 

We represent the group A by means of its left-hand co-sets with respect to B: 


(1) A = {a,B,a@B,---} =q@B+@mBH+:::, 


where the a; are suitably chosen elements of A. It should be observed that 
it is sufficient to consider only one set of co-sets (1), since the right-hand co-set 
representation may be obtained from (1) by the inverse automorphism a — a~* 
in A: 


A = {Ba;', Baz’, --- } = Ba; + Baz! +---. 


The totality of left-hand co-sets (1) shall be called the quotient system of A 
with respect to B and we shall write 


(2) A/B = {B,, Br, --: }, 


where the B; = a;B denote the co-sets in some order. This quotient set will 
now be made into an algebraic system through the definition of a multiplication. 
This multiplication differs, however, from the ordinary multiplication in alge- 
braic systems by the property that two elements do not define a unique product 
element, but a product set consisting of several elements. Let B, and Bg be 
two co-sets. The elements of A contained in the complex 


(3) B.-Bs = aa:B-a3-B 
will then give a certain subset of co-sets in (2) 

S= {B,,, By, , coe], 
where each B, is of the form 


’ Foundation I, Chapter 3. 












152 OYSTEIN ORE 





By = a_°b-ag-B, 





and b is an element of B. We now define the product of two co-sets as 
(4) BBs = {B,,, By, , ee }. 


It is natural to extend this definition of the product of co-sets to the product of 
arbitrary subsets 


S = {B,,,---}, T = {Bs,,--:} 
















of A/B by saying that S-T is the set of co-sets containing all products B,-B; . 
This multiplication is obviously associative. 

Let us now mention a few of the properties of the new multiplication. The 
product B,-Bs contains the co-set a,-ag-B. In the multiplication, the group 
B plays the réle of a right-hand unit element with the properties 


Ba-B = Ba, B-B. > Ba : 





for all co-sets B,. One may define an inverse of a co-set B, as a co-set Bg such 
that 


(5) Bs: Ba > B. 
This relation is equivalent to the existence of elements b; and b: in B such that 
(6) ag+bi-de — be. 
This relation may always be satisfied by taking 6; equal to the unit element 
e and as = a,'. By taking the inverse of the relation (6), one finds that (5) 
also implies 
B,:Bs > B. 

Hence we have shown 

TuHEeoreM 1. Every element of the quotient system A/B has at least one inverse, 
and if Bs is an inverse of B, , then B, is an inverse of Bs . 

Let us determine when a product is unique. If a,-B and ag-B have a unique 
product, we must have 

a.°B-ag-B = aq:ag-B, 

and hence for every }; in B there must exist another b: such that 


Aa*bi-dg = Aq: dg-be 





or 
—1l 
ag -by ag = be. 


This shows 
THEoREM 2. A product B,- Bg is unique if and only if Bz belongs to the nor- 

maliser group of B in A. 

Similarly one proves 








ae a a 








STRUCTURES AND GROUP THEORY. I 153 


THEOREM 3. The normaliser group of B with respect to A consists of all co-sets 
with a unique inverse. 


3. Multigroups. The properties of a quotient system naturally lead to the 
definition of a new abstract system which may be called a multigroup. It may 
be defined as a system IM of elements satisfying the following axioms: 

I. The product of two elements B, and Bz is a subset of M: 


B, Bs = {B,, , By, -*- }. 
More generally, the product of two sets of elements 
S = {Ba,, Ba, ++: }, T = {Bz,, Bs,,--- } 


is the set defined by all products B,- Bg . 
II. The multiplication is associative. 
III. There exists a unique right-hand unit element B, such that for all B, 


B,:B, = B. ’ BB, > B, . 
IV. To each B, there exists at least one inverse B,, such that 
B,:B. > B, B.-B, > B,. 


This definition shows that a multigroup differs from an ordinary group only 
in the property that the product is not unique. Obviously any quotient system 
A/Bisamultigroup. In the case of a multigroup arising from co-sets in a group 
there are further axiomatic conditions satisfied which have not been enumerated 
above. Although these multigroups have several interesting properties, the 
study of these must be reserved for a later occasion. I shall only mention 
here some recent investigations by Marty‘ on the subject. The multigroups 
are closely related to the “Mischgruppen”’ studied by Loewy.” It seems to me 
that the representation of the quotient systems by means of multigroups is pref- 
erable to the representation by means of “Mischgruppen”, since one avoids 
the double system of operators which the latter theory introduces. 

We shall complete these remarks by proving one theorem which is of impor- 
tance for the sequel. Let A > B be two subgroups of G and A, any sub- 
group of A containing B. The group A; consists of certain co-sets of A/B and 
these co-sets are seen to form a submultigroup of A/B. We shall show con- 
versely that every submultigroup A; of A/B constitutes a subgroup of A con- 
taining B when considered as a set of elementsin A. According to the definition 


*F. Marty, Sur les groupes et hypergroupes attachés a une fonction rationelle, Annales de 
l’Ecole Normale Supérieure, (3), vol. 53 (1936), pp. 83-123. A list of further publications: 
on this subject by the same author is given in the introduction. After the completion of 
this manuscript appeared a paper by H. S. Wall, Hypergroups, Am. Journal of Math., vol. 
59 (1937), pp. 77-98. 

& A. Loewy, Uber abstrakt definierte Transmutationssysteme oder Mischgruppen, Journal 
fiir Mathematik, vol. 157 (1927), pp. 239-254. See also R. Baer, Sitzungsberichte, Heidel- 
berg, 1928. 











154 OYSTEIN ORE 


A; contains the product of any of its co-sets. Hence according to a preceding 
remark it contains the product of any of its elements. To prove that A; con- 
tains the inverse a;' of any of its elements a; , we recall that A; contains an 
inverse co-set a2-B to a,;-B. As in (6), we then find az-b;-a,; = be , where b; and 
be belong to B. This shows that a,’ belongs to the co-sets B-a:-B which are 
contained in A;. Hence we have 

TuHreoreM 4. The submultigroups of a quotient system A/B when considered 
as a set of elements in A are identical with the subgroups of A containing B. 


4. Concepts in group structures. We shall now define various concepts 
which are of importance for the theory of group structures. We consider as 
before the structure = of all subgroups of a given group G. <A unit quotient in = 
is any quotient 


€ = A/A. 
The product of quotients may be introduced in the following manner: If we have 
4 = A/B, BS = B/C, ¢ = A/C, 
we shall write 
c= Ax 8, a'x € = &, Cx B= 4, 


and say that Wf is a left-hand and % a right-hand factor of ©. It is often con- 
venient to consider any subgroup A or even the group G itself as a quotient 
with respect to the unit element 
4 = A/E, G = G/E. 

The system of all quotients may be made into a structure, the quotient struc- 

ture of ©. When 
%, = A,/B, ’ WM = Ao/B, 
we define 
(Mi, Me) = (Ar, As)/(Bi, Bs), (Mi , Me] = [Air, Asl/[Bi , Be). 

We shall apply these concepts mainly in the case where Y%, and %: have the same 
denominator 


(7) 4, = A,/B, WM = As/B, 
and hence 
(8) (M1, Me) = (Ar, As)/B, [Mi , Me] = [Ai , Ae] /B. 


The two quotients %, and Y%. will be said to be relatively prime when their cross- 
cut is a unit quotient, i.e., when 


(Ai, A) = B. 


& Foundations I, Chapter 3. 























STRUCTURES AND GROUP THEORY. I 155 


We may also apply the principle of duality and define the duals of these 
concepts. Corresponding to (7) we consider quotients with the same numerator 


(9) BS, = A/B,, SB. = A/B,. 
We define their left-hand union and cross-cut respectively as 
[Bi ’ Be]; = A/(B, ’ Bz), (B ’ Be); _ A/([By ’ B,). 


The two quotients (5) are |.h. relatively prime if their ].h. cross-cut is a unit 
quotient, i.e., if 
[B, , Bs] = A. 
Next it is convenient to introduce the notion of transformation. Let 
(10) a = A/B; € = C/B 
be two quotients with the same denominator. The quotient 
(11) WH’ = (A, CG] X C' = CAC’ = [A, C//C 


is then said to have been obtained from % through (right-hand) transformation 
with ©. 

The transformation has a series of simple properties which we shall now 
enumerate. The proofs follow directly from the definition (11). 


Lemma 1. (a, CG] = CAC" x &. 
Lemma 2. (BX OA(B XxX ©)* = BCAC)B". 
Lemma 3. C[M, BIC* = (CAC, CVC"). 
Lema 4. (CX B(A K BE X BY = CAC". 
Lema 5. C(M X B)C* = GAG" x CVC", 
where 
G, = BEB". 


This rule for the transformation of a product may be extended to an arbitrary 
number of factors. 


Lemma 6. = [% X A, Bi X BI/M, B] = [% Ws Ms’, BB, Bz"), 
where 
% = ABA", B. = BAB 


The following theorem is of considerable importance. 

TueoreM 5. If a product 8 X Chas ar.h. factor A, then B has the r.h. factor 
CAC. 

Proof. Since 8 X € has the two factors A and GC, we can write 


Bx € = %, x [M, G, 








156 OYSTEIN ORE 
and division by € gives 
B= B, xX CAC”. 


Dually corresponding to right-hand transformation we also have left-hand 
transformation, which may be defined as follows. Let 


(12) w= A/B, D = A/D 
be two quotients with the same numerator. The quotient 
(13) a” = D* X (A, Dj. = DAD = D/(B, D) 


is then the left-hand transform of & by D. 

In the special case where the quotients % and € in (10) are relatively prime, 
ie., [A, C] = B, we shall say that %’ in (11) has been obtained from % by a 
r.h. similarity transformation. In the same manner %” in (13) is obtained from 
YW in (12) by a L.A. similarity transformation if A and D are |.h. relatively prime. 
The 1|.h. and r.h. similarity transformations are inverse in the sense that the 1.h. 
similarity transformation of 2%’ in (11) by [A, C]/A gives &, while the r.h. similar- 
ity transformation of YX” in (13) by B/(B, D) also gives Y. 

Let us finally mention 

THEOREM 6. Any right-hand union of two r.h. relatively prime quotients with 
the same denominator 


mM = (A, BI, (A,B) = € 
is also the Lh. union 
M = (W’, BL, (W’, B). = © 
of two Lh. relatively prime quotients 
w = SAB", BW = ASX" 


obtained from A and B by similarity transformation. 

The proof is immediate. The dual of Theorem 6 is obviously also true. 
Furthermore it may be extended to the union of an arbitrary number of quotients 
each relatively prime to the union of the rest. 


Chapter 2. The law of isomorphism 


1. Correspondences. A fundamental theorem in group theory is the ordinary 
law of tsomorphism: 

Let A and B be two subgroups such that A is normal in [A, B]. Then (A, B) is 
normal in B and there exists an (element) isomorphism between the two quotient 
groups 

[A, B)/A = B/(A, B). 

We shall now consider what remains of this law when we drop the condition 
of normality and consider two arbitrary subgroups A and B of G. We denote 
their union and cross-cut by 


























Se a ee 


STRUCTURES AND GROUP THEORY. I 157 


M = [A, B], D = (A, B), 
and we wish to compare the two quotient systems 
Y= M/A, B = B/D. 
The quotient system % is made up by certain co-sets 
(1) B= {---,b:D,--- }. 


Those co-sets of 2% which have the same multipliers as in (1) we shall call the 
co-sets of A corresponding to 8. They form a subset 


(2) W’ = {---,b:A,-->} 


of A. These co-sets are all different, and hence we have the well-known 
THEOREM 1. For any two groups A and B the index relation 
{{[A, B]:A} = {B:(A, B)} 
holds. 

Obviously the corresponding co-sets (2) form a generating system for the 
quotient system % in the sense that one obtains & by taking all finite products 
of them. Let us now determine when they actually constitute the whole group 
[A, B]. In this case each element of [A, B] must belong to some co-set b- A and 
hence each product a;-b; also has a representation b-a. We shall say that two 
groups A and B are permutable if 

[A, B}] = A-B = B-A, 
i.e., if to any element a, in A and }, in B there exist other elements such that 
a,°b; = be-a2. If A and B are permutable, there exists a one-to-one correspond- 
ence b-A = b-D between & and B. We shall call this the regular co-set cor- 
respondence and write A= B. 

THEOREM 2. The necessary and sufficient condition that there exist a regular 
correspondence between the quotients 


4 = [A, B]/A, BS = B/(A, B) 


is that A and B be permutable. 

A consequence is 

THEOREM 3. Let the index {B:(A, B)} be finite. The necessary and sufficient 
condition that 


(4) ([A, B]:A} = {B:(A, B)} 


is that A and B be permutable. 

One may now ask when there exists a structure isomorphism between the two 
quotient systems %{ and B. One can always obtain a correspondence between 
the subgroups of & and % in the following manner: Let us denote by A and B 
arbitrary subgroups in & and B 








158 OYSTEIN ORE 


| 
Sng 
INV 
— 
& 
IV 
bo 
| 
& 


(5) [A, B] 2 
The correspondence 
(6) A — (A, B), B-[A, B 


will then be called the regular structure correspondence between A and B. We 


can then prove 
TuHeoreM 4. The necessary and sufficient condition for the regular structure 


correspondence to establish a structure isomorphism 

(7) [A, B|/A ~ B/(A, B) 

is that for every A and B defined by (5) we have 

(8) A = [A, (B, A)], B = (B, [A, B)). 


Proof.’ The relations (8) are obviously necessary and sufficient in order 
that the regular structure correspondence give a one-to-one correspondence 
between the subgroups of % and B. Furthermore, when they are satisfied we 
find for B, and B, with the images A, and A, 


(Bi, B,] =“? [A, B,, B,] = [Ai, Aj], 
(B,, B:) - (B, A, ’ A:) —_ [A, (B, A, ’ A))] _ (A, ’ A,). 


The conditions (8) are seen to hold if A has the following property: For any 
C and Din M = [A, B] we have 


(C, [A, D]) = [D, (A, ©)], when C 2 D, 
(C, [A, D}) = [A, (C, D)], when C 2 A. 
In this case we shall say that A is structure normal in M. 

Let us finally suppose that A and B are permutable so that the regular co-set 
correspondence gives a one-to-one correspondence between the co-sets of Mf and 


%. We shall now make the stronger assumption that to every ain A and bin B 
there exists an exponent ng,» such that 


IV 


(9) 


(10) a-b = b"*’-a’, 
where a’ belongs to A. This condition is equivalent to saying that A shall 
be permutable with all subgroups of B. 

We can then prove 

TuHeoreM 5. Let the group A be permutable with all subgroups of B. Then 
there exists a strong structure isomorphism 


(11) [A, B])/A =~ B/(A, B), 
and the regular co-set correspondence between the two quotients also gives the regular 


structure correspondence. 


? This result has been derived for any structure in the paper by O. Ore, On the theorem of 
Jordan-Hoélder, Transactions Amer. Math. Soc., vol. 41 (1937), pp. 266-275. 











STRUCTURES AND GROUP THEORY. I 159 


Proof. Any subgroup B is made up by co-sets b D, b2D, ---, and to these 
correspond the co-sets b; A, b2A,---. These co-sets must again form a group, 
since all elements in a product b,A-b:A are containetl in co-sets of the form 
bb; -A = b3-A. Hence we have B — [A;, B], and it is seen immediately that 
the first relation (8) is satisfied. In the same manner any group A is made 
up by co-sets bA where the b form a group B = (A, B) and also the second rela- 
tion (8) is satisfied. Hence we have shown that the regular co-set correspond- 
ence gives a structure isomorphism identical with (5). The fact that (11) is a 
strong structure isomorphism follows from Theorem 2 or 3, since A is permutable 
with every B. 

It is also easily seen that in this case A is structure normal in [A, B]. 


2. Permutable groups. The preceding results indicate the necessity of a 
study of the permutable groups.* We shall now give some of their simplest 


properties. 
TueoreM 6. Jf A and B are permutable groups, then 
(12) (C, [A, B]) = [A, (C, B)], C 2A, 
(D, [A, B]) = [B, (A, D)], D 2B. 


Proof. To prove, for instance, the first of these relations, we observe that the 
right-hand side is always contained in the left-hand side for any groups. If now 
A and B are permutable, every element of [A, B] has the form a-b. If this 
element is to be at the same time an element c of C, we must have c = a-b, 
where obviously b belongs to C, since C 2 A. 

Furthermore let us mention 

Lemma 1. If A is permutable with B and C, then A is permutable with [B, C]. 

Lemma 2. A group is permutable with all its subgroups. 

Lemma 3. If B = B' and B’ is permutable with A, then B’ is permutable 
with (A, B). 

Proof. If d is an element of (A, B), then b’-d = c-b;, where c belongs to A 
and also to B. 

Lemma 4. If A = A’ and B 2 B' and A’ and B’ are permutable, (A’, B) and 
(A, B’) are permutable. 

The proof follows by two applications of the preceding lemma. 

TueorEeM 7. Let A = A’ and B = B’. Furthermore let A’ be permutable 
with (A, B) and (A, B’), while B’ ts permutable with (B, A) and (B, A’). Then 
there exists a one-to-one correspondence 


(13) & = [A’, (A, B)]/[A’, (A, B’)] = [B’, (A, B))/[B’, (B, AD] = B 
between the co-sets of these two quotients. 


8 Various properties of permutable groups have been given by E. Maillet, Sur les groupes 
échangeables et les groupes décomposables, Bulletin de la Société Math. de France, vol. 28 
(1900), pp. 7-16. 











160 OYSTEIN ORE 


Proof. We write 
M = [4’, (A, BY], N = (A, B). 
From Lemmas | and 2 it follows that M and N are permutable, and from The- 
orem 6 we obtain 
[M, N] = [A’, (A, B’)], 
(M, N) = (A, B, [A’, (A, B)]) = [(A, B’), (A’, B)]. 
The application of Theorem 2 gives us the correspondence 


(14) MW = (A, B)/((A, B’), (B, A’), 


and the same correspondence is found for B. 

Permutability of two groups is not a self-dual concept, since the structural 
relations (12) do not imply their duals. There exists, however, a theorem which 
may take the place of the dual of Theorem 7: 

TuroreM 8. Let A = A’ and B 2 B’ and let A and B both be permutable 
with [A’, B’]. Then there exists a correspondence 


(15) W, = (A, [B, A’])/(A, [B’, A’]) = (B, [A, B’))/(B, [A’, B’]) = B. 
Proof. In this case we write 
M, = (A, [B, A’), N, = [A’, B, 
and M, and N, are permutable according to Lemma 3. Theorem 6 gives 
(M,, Ni) = (A, [B’, A’), 
[M, , Ni] = [A’, B’, (A, [B, A’])] = ((B, A’, [A, B’)), 
and from Theorem 2 one obtains 
(16) 1, = ((B, A’], [A, B’])/[A’, B’]. 


TueoreM 9. Let A = A’ and B = B’, where A’ is permutable with B and B’ 
and B’ is permutable with A and A’. Then the correspondences (13) and (15) hold, 
and we have A = %, and B = B,. 

Proof. It follows from Lemmas 1, 2 and 3 that in this case the conditions for 
Theorems 7 and 8 are satisfied. The equality of the quotients is obtained from 
Theorem 6. From the correspondences (14) and (16) we deduce the following 
self-dual relation: 

TueoreM 10. Let A = A’ and B 2 B’, where A is permutable with B and B’, 
while B’ is permutable with A and A’. Then 


(17) (A, B)/(A, B, [A’, BY) = [A’, B’, (A, B)I/\A’, B’. 


One finds, namely, from Theorem 6, 

















STRUCTURES AND GROUP THEORY. I 161 


[(A, B’), (B, A’)] = (A, B, [A’, B’)), 
({A, B’], [B, A’]) = [A’, BY, (A, B)). 


There are various other interesting problems connected with permutable 
groups. One of the main problems is the determination of all permutable 
groups. It may be formulated as follows. Let A and B be given groups. Find 
all groups M = [A, B], where A and B are permutable groups isomorphic to A 
and B respectively. 

Another problem is the relation of the permutable groups to their structural 
properties. This problem may be formulated as a converse of Theorem 6: 
when do the relations (12) imply that A and B are permutable groups? 

We shall next determine when the correspondences we have derived may be 
replaced by strong structure isomorphisms. According to Theorem 5 we shall 
then have to consider groups which are permutable with all subgroups of other 
groups. For such groups we have 

Lemma 5. If B and C are permutable with all subgroups of A, then [B, C] has 
the same property. 

Lemma 6. If A 2 A’, then A is permutable with all subgroups of A’. 

Lemma 7. If A 2 A’ and B is permutable with all subgroups of A’, then (A, B) 
has the same property. 

Proof. Leta’ be an element in A’ and d an element in (A, B). Thend-a’ = 
a’"-b, where b belongs both to A and B. 

From these remarks we obtain 

THEOREM 11. Let A = A’ and B = B’, where A' and B’ are permutable with 


all subgroups of (A, B). Then 
[A’, (A, B)]/[A’, (A, B’)] ~ [B’, (A, B)]/[B’, (B, A’). 
Proof. In this case the conditions of Theorem 7 are satisfied. Furthermore, 


M is permutable with all subgroups of N according to Lemmas 5 and 7. Hence 
we conclude from Theorem 5 the strong structure isomorphism 


A ~ (A, B)/[(A, B), (B, AY), 


and similarly for 8. Corresponding to Theorem 8 we have 
TueoreM 12. Let A 2 A’ and B 2 B’, where |A’, B’) is permutable with 
every subgroup of Aand B. Then 


(A, [B, A’])/(A, [B’, A’]) ~ (B, [A, B’])/(B, [4’, B’)). 


Corresponding to Theorem 10 we have 
TueoreM 13. Let A = A’ and B 2 B’, where A’ and B’ are permutable with 


every subgroup of A and B. Then 
(A, B)/(A, B, [A’, B’]) = [A’, B’, (A, B)I/[A’, B'l. 


Let us observe finally that if A and B are groups such that A is permutable 
with every subgroup of B and B permutable with every subgroup of A, the rela- 








162 OYSTEIN ORE 


tions (12) obviously must hold. In this case it is seen, however, that also the 


dual relations 


IIA 


[C, (A, B)] = (A, [B, C)), CsA, 
[D, (A, B)] = (B, [A, D)), DSB 


are fulfilled. Again the interesting problem arises whether the existence of the 
relations (12) and (18) is sufficient to conclude that A is permutable with every 
subgroup of B and B permutes with every subgroup of A. 


(18) 


3. Quasi-normal subgroups. We shall now introduce a new concept. 

A subgroup A of G is said to be quasi-normal when it is permutable with every 
subgroup of G. 

This condition may also be stated as follows. For each g in G and a in A 
there exists an exponent n,., such that 


(19) a-g = g"**-a’. 
The quasi-normal subgroups obviously generalize the ordinary normal sub- 
groups. In that case we have n., = 1 for all a and g. The condition (19) 
may also be stated in the more symmetric form 

a-g" = g" -a’. 


The quasi-normal subgroups have several properties in common with ordinary 
normal subgroups. Let us mention first that from Theorem 6 follows 

TueoreM 14. Jf A is a quasi-normal subgroup and B and C arbitrary sub- 
groups, then 


(B, [A, C]) = [C, (A, B)] when B = C, 
(B, [A, C]) = [A, (B, C)] when B = A. 
It is obvious that when A and B are quasi-normal in G, [A, B] has the same 


property. 

TuHeoreM 15. If A is quasi-normal in G and B is any subgroup of G, then 
(A, B) is quasi-normal in B. 

Proof. Let d be any element of (A, B) and b any element of B. Thend-b = 
b"-a, where a must belong both to A and B. From Theorem 5 follows immedi- 


ately 
THEOREM 16. When A is quasi-normal in [A, B), then (A, B) is quasi-normal 


in B, and we have the strong structure isomorphism 
(20) [A, B]/A ~ B/(A, B). 


To prove the next theorem we shall need 
Lemma 8. Let A be quasi-normal in G. When g is any element in G and ao 
some given element in A, we can always write 


g” = (gao)”-a. 











STRUCTURES AND GROUP THEORY. I 163 


Proof. We write g = gao-a) and apply (19) to the various factors in the 
product g” . 

TueoreM 17. In the structure isomorphism (20) any quasi-normal group A in 
(A, B] containing A corresponds to a quasi-normal group B in B containing (A, B), 
and conversely. 

Proof. The first part of the theorem follows from Theorem 15, since B = 
(A, B). To prove the converse we consider the relation 


(21) b-b = b"-b, 


which holds for all elements in B and B. The elements of [A, B] have the form 
g = b-a, while the elements of A are of the form a-b. By means of Lemma 8 
we obtain from (21) 6-g = g"-d,, and when this relation is multiplied on the 
left by an element in A we obtain d-g = g”-d. for arbitrary d in A. 

Theorem 17 gives us the ordinary lemma that if A and B are maximal quasi- 
normal groups in G, then (A, B) is maximal quasi-normal in A and B. 

THEOREM 18. When A’ is quasi-normal in A and B' quasi-normal in B, then 
[A’, (A, B’)] is quasi-normal in [A’, (A, B)] and similarly [B’, (A’, B)] is quasi- 
normal in [B’, (A, B)], and there exists the strong structure isomorphism 


(22) [A’, (A, B)}/[A’, (A, B’)] = [B’, (A, B))/[B’, (B, A’). 


Proof. The isomorphism (22) is a consequence of Theorem 11. Let l = d-a’ 
be an element of [(A, B), A’] and l’ = d’-a’ be an element in [(A, B’), A’]. 
Since (A, B’) is quasi-normal in (A, B), we find 


I'-1 = d’-a,-d-a’ = d’-d"-a, = d™-l;. 


According to Lemma 8 we can write d” = [‘-a;, so that we finally obtain 
Vl=T-k. 

Theorem 18 represents the complete analogue of the lemma of Zassenhaus for 
ordinary normal subgroups. 

Let A’ be normal in A and B’ normal in B. Then [A’, (A, B’)] is normal in 
[A’, (A, B)] and [B’, (A’, B)] is normal in [B’, (A, B)], and there exists the ordinary 
isomorphism between the quotient groups 


[A’, (A, B)]/[A’, (A, B’)] = [B’, (A, B))/(B, (A’, B)].- 


4. Quasi-normal subgroups of the symmetric and alternating groups. In 
connection with the quasi-normal groups one may construct groups which in 
many ways are analogous to those ordinarily defined in connection with normal 
subgroups. Among these we shall mention the quasi-center. Let c be an ele- 
ment of G with the property that the cyclic group it generates is permutable 
with all subgroups of G. This is equivalent to saying that for any element 
g of G and any pair of exponents n and m we have c"-g” = g” -c". The group 
generated by these elements c we shall call the quasi-center of G. The quasi- 











164 OYSTEIN ORE 


center contains the ordinary center and also the nucleus introduced by Baer.’ 
From the definition of the quasi-center follows 

THEOREM 19. The quasi-center is a characteristic subgroup. 

A simple consideration shows that the symmetric group >, has a quasi-center 
C,, equal to the unit element except in the case n = 2, where ZS, = C2, and n = 3, 
where C; = A; is the alternating group. The quasi-center of the alternating 
group is the unit element except for n = 3. 

It may be of some interest to determine the quasi-normal subgroups of the 
symmetric group >, and the alternating group A,,.. The case of the symmetric 
group may be solved directly by the following 

Lemma 9. Let H be quasi-normal in G. If h is an element of H and t an ele- 
ment of order 2 in G, then H contains the transform tht’ and the commutator 
(t, h) = tht*h™. 

Proof. Since we can suppose that ¢ does not belong to H, we have th = h’-t. 
Since every substitution is the product of transpositions this implies 

THEOREM 20. Any quasi-normal subgroup of the symmetric group is normal. 

The proof of the theorem that the alternating group contains only the trivial 
quasi-normal groups A, and E£ is necessarily more complicated, since it implies 
the fact A, is simple. 

We first show 

Lemma 10. If a quasi-normal subgroup H of A,, contains a cycle with three 
elements, then H = A,. 

Proof. We may suppose n 2 4. If H contains the cycle h = (1, 2, 3), ac- 
cording to Lemma 9 it contains 


aha’ = (a, 3, 2), a = (1, a)(2, 3), a > 3, 


and hence H contains all cycles of order 3 with two fixed elements. We shall 
also need 

Lemma 11. Let n > 4 and let H be a quasi-normal subgroup of A,. If H 
contains a substitution which is the product of two transpositions without common 
elements, then H = A,. 

Proof. Let h = (1, 2)(3, 4) belong to H. The commutator 


(a, h) = (3, 5, 4), a = (1, 2)(3, 5) 


then also belongs to H according to Lemma 9. 

We shall now proceed to the actual determination of the quasi-normal sub- 
groups of A,. We suppose n > 4 and write the substitutions of H as the 
product of cycles without common letters. We consider the cases 

I. H contains a substitution h = (1, 2, --- , a) with at least one cycle with 
a > 4 letters. We put a = (1, 3)(2, 4) and find the commutator (a, h) = 


(1, 3, 5). 


* R. Baer, Der Kern, eine charakteristische Untergruppe, Compositio Mathematica, vol. 1 
(1934), pp. 254-283. 











STRUCTURES AND GROUP THEORY. I 165 


II. H contains a substitution h = (1, 2, 3, 4) --- with at least one cycle of 
order 4. Then we put a = (1, 4)(2, 3) and find (a, h) = (1, 3)(2, 4). 

III. H contains substitutions A with a cycle of order 3. If h contains only 
one such cycle while the other cycles are transpositions, then h’ is a cycle of 
order 3. Hence we may suppose h = (1, 2, 3)(4, 5, 6) --- and for a = (1, 4) 
(2, 5), we find (a, h) = (1, 4)(3, 6). 

IV. All substitutions in H are the product of transpositions. According t« 
Lemma 11 we can suppose that h contains at least three transpositions h 
(1, 2)(3, 4)(5, 6) --- and for a = (1, 4)(2, 5) we find (a, h) = (1, 6, 3)(2, 4, 
against assumption. 

THEOREM 21. The alternating group contains no quasi-normal subgroup for 
n # 4. 

For n = 4 we have the well-known exception. 


= 


or 
~ 


Chapter 3. Extensions of the theorem of Jordan-Hélder 


1. Refinements of chains. One of the main applications of the preceding 
theory is to obtain extensions of the theorem of Jordan-Hélder and its general- 
ization by Schreier-Zassenhaus.” 

In the following we shall consider two fixed groups A > B in the given group 
G. Let 


(1) A 
(2) A 


be two chains of arbitrary subgroups between A and B. We shall denote these 
chains by {B;} and {C;} respectively. The existence of the chains (1) and (2) 
may also be considered as a factorization of the corresponding quotient % = 
A/B in the sense defined in Chapter 1, 


ll 


B, > B\>--- > B, = B, 
G>a>--->C,2=B8B 


(3) A= BxBaxX--XB= GX GxX::: xX &, 
where 
(4) B; = By/B;, G; = Cii/C;. 


Any new chains obtained from (1) and (2) by intercalating new terms will 
be called a refinement of the given chains. The two refinements consisting 
both of r-s subgroups 
(5) B;,; = (Bi, (C;, Bi)), Cyr = (Ce, (Br, Cra)] 
we shall call the (lower) refinement of {B;} with respect to {C;{ and of {C,} with 
respect to {B;}, respectively. It may also be considered as a further fac- 
torization of (3): 


1°Q. Schreier, Uber den Jordan-Hélderschen Satz, Abh. Math. Sem. Hamburg, vol. 6 
(1928), pp. 300-302; H. Zassenhaus, Zum Satz von Jordan-Hélder-Schreier, Abh. Math. Sem. 
Hamburg, vol. 10 (1934), pp. 106-108. 








OYSTEIN ORE 





(6) SB = Bir X--- X Bi, GC; = Oya X ++ K G,, 
where 
B;.; = [Bi, (Cia, Bia)/(B:, (C;, Bi-)), 
Cer = (Ce, (Cra, Bia) V/ICe , (Cra, Bi). 
There also exists a dual refinement to (5): 
(8) Bi; = (Bu, (Bi, Cj), Cir = (Ce, (Bi, Cra]), 
which we shall call the upper refinement of {B;} with respect to {C;} and of 
!C,} with respect to {| B,}. 
Obviously we always have 


(7) 


B;.; 2 B;.,;, Cra 2 Ce- 
Corresponding to (6) we have the factorizations 
(9) Bi = BiaX-- X Bin, Cj = GaX- X Ge, 
where 


Bi; = (Bia, (Bi, Cial)/(Bi, [Bi , C4), 


(10) yee 
Ger = (Cra, (Ce, Bral)/(Crr, (Ce , Bi). 

2. Cross-cut permutable chains. We shall now say that two chains (1) and 
(2) are cross-cut permutable if they have the following property: 

For each i the group B; shall be permutable with all groups 


(11) (Bia, Cj) (j = 0, l, “++, 8), 
and for each j the group C; shall be permutable with 
(12) (Cj, Bi) (i = 0,1,---,7). 


Let us observe that cross-cut permutability refers only to certain properties 
of a group B; with respect to certain subgroups of the preceding B;., and 
similarly for C; with respect to Cj. . 

The main theorem on cross-cut permutable chains will now be proved. 

TueoreM 1. Let {B;} and {C,;} be two arbitrary cross-cut permutable chains 
connecting two groups A and B. These chains can then be refined into new cross- 
cut permutable chains with quotients corresponding in such a manner that for cor- 
responding quotients there is a one-to-one correspondence of their co-sets. 

Proof. We refine the two chains by the lower refinements (5). Both the 
refined chains contain r-s terms, and it follows from Theorem 7, Chapter 2, 
that By; @ Cj... 

The proof of the fact that the new chains are cross-cut permutable is more 
complicated. It may be done directly by calculating the groups corresponding 
to (11) for the new chains. We shall, however, apply another method which 
yields more information about cross-cut permutable chains. We first deduce 






























STRUCTURES AND GROUP THEORY. I 167 


Lemma 1. The chains {B;,;} and {C;} are cross-cut permutable. 
Proof. We shall have to show that B;,; is permutable with every (B;,;-1 , C,). 
Since B;_; contains B;, ;; we can write 


(13) (Bijan, Ce) = Diix = (Ce, Bin, (Bi, (Cin, Bix))). 

If now k 2 j, then (C; , B;4) is contained in (C;_; , Bi), and we find 

(14) Dijin = (Cx, Bia) (k 2 j). 
When k S j — 1, we apply Theorem 6, Chapter 2, to (13) and obtain 

(15) Dijin = (Cia, Br), (Bi, Cx)] (k Sj — 1). 


To show that B;,; is permutable with D;,; x, it is sufficient to show according © 
to Lemma 1, Chapter 2, that B; and B;_;,; separately are permutable with D;_;x. . 
It is obvious from the given conditions that B; is permutable both with (14) 
and (15). Furthermore, (Bj; , C;) is permutable with (14) according to Lemma 
2 and with (15) according to Lemmas 2 and 3, Chapter 2. 

Finally one proves in a similar manner that C; is permutable with the cross- 
cuts Dijin = (B;.,; ; Cy-1). 

Lemma 2. The chain {B;,;} is unchanged when refined with respect to {C;}. 

Proof. The terms of the second refinement are B;,;. = [B;,;, (Cx , Bi,j-)]. 
When the expressions (14) and (15) are substituted here, one obtains respec- 
tively 

Bij = (Bi, (Bia, C)), (Bia, Ci)] = [Bi , (Bin , C;)] = Bij, 
Bijx = (Bi, (Bin, Cj), (Cin, Bia), (Bi, Ci)) = Bijan. 


Lemma 3. The refinement of {C;} with respect to {B;} is equal to its refinement 
with respect to {B;,;}. 
Proof. We find C;,;,. = [Cz , (Cx, Bi,;)], and again the substitution of (14) 
and (15) gives 
Cink - Ci int (k 2 j) 


Cita = Cri (kK 53-1). 


From this lemma, together with Lemma 1, it follows that the chains { B;,;} and 
{Cx,.} are cross-cut permutable and the proof of Theorem 1 is completed. 

The preceding results also show that repeated refinements of cross-cut per- 
mutable chains give no new chains. 

Theorem 1 is obviously a generalization of the theorem of Schreier-Zassen- 
haus. Through specialization one can also obtain extensions of the theorem of 
Jordan-Hélder. We shall say that {B;} and {C;} are maximal cross-cut per- 
mutable chains when there exists no group between B;, and B; which is per- 
mutable with all cross-cuts (11) and similarly for C;.. and C;. We then have 

THeoreM 2. If there exist two maximal cross-cut permutable chains between 
the two groups A and B, both chains have the same number of terms and the quotients 











168 OYSTEIN ORE 


of the chains correspond tn such a manner that for corresponding quotients there is a 
one-to-one correspondence between their co-sets. 

When the index of B in A is finite, Theorem 2 shows that the indices of the 
two maximal cross-cut permutable chains are the same in some order. 

There exists for this theory a dual theory which one obtains by considering 
the upper refinements (8). We shall say that two chains {B;} and {C;} are 
union permutable when they have the following property: 

For all i the group B,, is permutable with 


(B; ’ Cj] (j = 0, I,- "y 8), 


and for all j Cj is permutable with 
[C;, Bi] (i = 0,1, ---,r). 


For such union permutable chains all the theorems derived for cross-cut 
permutable chains will hold when the lower refinements {B;,;} and {C,,,} are 
replaced by the upper refinements {B;,;} and {Ci,.}. In Theorem 2 we must 
replace maximal chains by minimal chains. 


3. Permutable chains. We have already observed that Theorems | and 2 
correspond to the theorems of Schreier-Zassenhaus and Jordan-Hélder. More 
specifically they correspond to the case of composition series where one considers 
chains in which every term is a normal subgroup of the preceding, because 
cross-cut permutability refers only to properties of a group B; with respect to 
certain subgroups of the preceding B,.,. Let us now show how one can obtain 
analogues of the theorems on principal series where one deals with normal sub- 
groups of the full group A. 

We shall say that the two chains {B;} and {C;} are permutable if every B; 
is permutable with every C;. We can then prove 

TueoreM 3. Let {B;} and {C;} be two permutable chains connecting A and B. 
These chains may be refined into new permutable chains with their quotients cor- 
responding in such a manner that for corresponding quotients there is a one-to-one 
correspondence between their co-sets. 

Proof. It follows from Theorem 9, Chapter 2, that in this case the upper 
and the lower refinements (5) and (8) are identical. Furthermore, we have 
Bi; = €;,. The fact that the refined chains are again permutable may be 
derived from Lemmas 1 to 4 in Chapter 2. 

Through specialization of Theorem 3 we can again obtain an analogue to the 
Jordan-Hélder theorem. We shall say that two permutable chains are maximal 
when no further term may be intercalated such that the resulting chains are 
still permutable. 

TuHeoreM 4. Any two maximal permutable chains between two groups A > B 


will have the same number of terms and their quotients correspond in such a manner 


that for corresponding quotients there exists a one-to-one correspondence between 


their co-sets. 





AANA GELLAR co sd 


weteereeneaie ISS 











lee SE CAMELS ORIOL 


idhecrenetie ES? 








STRUCTURES AND GROUP THEORY. I 169 


For finite groups this theorem, in the slightly weaker form when the indices 
of the two chains are the same in some order, is due to Maillet." 


4. Quasi-normal chains. The next natural step is to seek structure iso- 
morphism for the corresponding quotients in the two chains. We shall say that 
a chain is guasi-normal when each term is quasi-normal in the preceding. The 
main theorem is then 

TuHeoreM 5. Let {B;} and {C;} be two quasi-normal chains connecting A and 
B. These chains may be refined into new quasi-normal chains in such a way that 
there exists a correspondence between their quotients, with corresponding quotients 
having strongly structure isomorphic quotient systems. 

Proof. We apply the lower refinements (5) and all statements of Theorem 5 
are consequences of Theorem 18, Chapter 2. 

Theorem 5 is a complete analogue of the theorem of Schreier-Zassenhaus. 
The analogue of the theorem of Jordan-Hélder may be expressed as follows: 

THEOREM 6. If there exist two maximal quasi-normal chains between A and B, 
both contain the same number of terms and the quotient systems are strongly structure 
isomorphic in some order. 

A maximal quasi-normal chain is of course a quasi-normal chain in which 
no further quasi-normal terms may be intercalated. 

One might finally want to obtain an analogue to the theorem on principal 
chains by considering chains in which every term is quasi-normal in the full 
group A. This is, however, the only point at which our theory differs from the 
ordinary normal theory. In this case we cannot prove that the refined chains 
are again formed by quasi-normal groups in A. This is due to the fact that the 
quasi-normal groups in A°do not form a structure, since the cross-cut of two 
quasi-normal groups need not be quasi-normal in A. 


Chapter 4. Normal subgroups 


1. Similarity. Let us now turn to the properties of normal subgroups. The 
normal subgroups of a group G obviously form a structure. Since any two 
normal subgroups are permutable we also have: 

Any three normal subgroups A = B and C satisfy the Dedekind axiom 


(1) (A, [B, C]) = [B, (A, C)]. 


A structure satisfying the Dedekind axiom I have called a Dedekind structure.” 
The general theory of Dedekind structures may now be applied to the normal 
subgroups. Since the Dedekind axiom (1) is found to be a self-dual condition, 


1 E. Maillet, Sur de nouvelles analogies entre la théorie des groupes de substitutions et celles 
des groupes finis continus de transformations de Lie, premiére note: Sur des suites remarqu- 
ables de sous-groupes d’un groupe de substitutions, Journal de Math., (5), vol. 7 (1901). 
pp. 13-82. 

12 Foundations I, Chapter 1. 











170 OYSTEIN ORE 


our principle of duality holds to its full extent also for normal subgroups. One 
may generalize the theory of normal subgroups in the ordinary way by consider- 
ing groups with operators. An operator T in G has the property of making 
correspond to each element a an element a” such that 


(a-b)” = a’-b’. 
A subgroup A is invariant with respect to T if 
a Ss. 
All groups which are invariant under a certain system of operators 
P T = 7;,,T7Ts,°°: 


also form a structure, and this structure is a Dedekind structure in case the 


operators include the set of all inner automorphisms of G. 
In the structure of normal subgroups we introduce quotients and these are 
as usual associated with a quotient group. Such quotients can be transformed 


according to the rules given in Chapter 1. Let 

(2) a = A/B, € = C/B 

be two quotients with the same denominator B, whereby we assume as always 

in the following that the subgroups are considered as normal in G. We suppose 
that % and € are relatively prime, i.e., B = (A, C). According to the ordinary 
iw of isomorphism the right-hand similarity transformation 


(3) Cue" = [A, C)/C 


then gives a new quotient group which is isomorphic to & in the usual sense. 
In the more general case where & and € have a greatest common factor D, , 


D 


A= 4x D, C=G@xXDd, 

we have according to Lemma 4, Chapter 1, 

Cac” = G, YL, é*, 
so that the transform is isomorphic to the left-hand factor %, of YW. 

In the same manner, if 

(4) a = A/B, D = A/D, A = {B, D] 
are two left-hand relatively prime quotients with the same numerator, the left- 
hand similarity transformation 
(5) DAD = D/(B, D) 
gives a quotient group isomorphic with %. Again if % and D in (4) have a 
common I.h. factor, the transform (5) is isomorphic to a r.h. factor of YU. 


Two quotients % and % shall be said to be similar if one can be obtained from 
the other through a series of r.h. and |.h. similarity transformations. Similar 


t 
; 
5 


AAD et yt ps, 





1 it ELRDEIEN TALS OL TG SOP SF LEE 


VET te oer 





STRUCTURES AND GROUP THEORY. I 171 


quotients are isomorphic. Similarity is a special form of isomorphism and it 
follows from the general theory of Dedekind structures that it is the form of iso- 
morphism which occurs in the formulation of the decomposition theorems for 
algebraic systems. 

We shall say that when W% and € are r.h. relatively prime quotients with the 
same denominator defined by (2), then 


(6) mM = (A, © 


is the direct union of A and &. This corresponds of course to the ordinary 
direct product. In the same manner one defines the left-hand direct union. 
It follows from Theorem 6, Chapter 1, that to any r.h. direct union (6) cor- 
responds a I.h. direct union 2 = [Y%’, C’],., where YA’ and C’ are similar to A and 
€. This result may be extended to an arbitrary number of quotients. 


2. Direct similarity. We shall now study a special type of similarity. Let 
% and B be quotients with the same denominator. If there exists a third quo- 
tient € with the same denominator and relatively prime to both &% and 8 such 
that 


(7) § = (a, © = [%, ©], (4, © = (8, © =G, 
we shall say that & and % are directly similar in . The relation (7) may also 
be written 

CAC = CBC". 


When & and % are directly similar in some §, they are also directly similar in 
[M, B]. One obtains therefore by taking the cross-cut of 5 with [%, 8] 


(8) (A, G] = (B, G] = (A, B), (4, G) = (B, G) =G, 
where, according to the Dedekind axiom, 
G a (G, (2, ¥)). 


The isomorphism between % and % defined by (8) is such that to any a in & 
there corresponds a unique b in 8 with the property that 


(9) a-c, = b-ee, 


where ¢; and cz; belong to ©. The direct decompositions (8) show, however, 
that the elements of € are permutable with the elements of both Y%f and B; hence 
€ belongs to the center of [M, B]. The correspondence (9) may then be written 
a = b-c where c belongs to the center, i.e., we have a central isomorphism between 
% and % in the ordinary sense in which this term is used in group theory. 

TuHeoreM 1. If two quotients A and % are directly similar, they are centrally 
isomorphic in the union [A, B}. 

The following theorem is also a simple consequence of the preceding remarks: 











172 OYSTEIN ORE 


TuHeoreM 2. Let A, B and C€ be quotient groups with the same denominator. 
The relations 


(10) M = (A, B) = (AM, GC} = [B, C], (A, ©) = (8, © = G 
imply that © is an Abelian group belonging to the center of M. The relations 
(11) M = (A, C] = [B, C], (A,B) = (8, © = (% O = G 


imply that A and B are Abelian groups. Finally, the self-dual set of relations 
(12) M = (A, BV) = (A, C) = [B, G), (A, B) = (A, © = (B, © = Gq 
implies that I is an Abelian group. 

Let us also observe that any relation (7) implies that Yt and B have Abelian 
left-hand factors which are directly similar. This follows from (7) by division 
with (M%, B) and application of (12). 

The dual of Theorem 1 is of considerable interest. Let %,, 8; and ©, be 
quotients with the same numerator such that 
(13) M = [%M, ’ Si], = [B; ’ Gi]: , (%, ; @;): = (B; ’ S); = ©. 


We shall then say that %, and QB, are left-hand directly similar. Again it is no 
limitation to assume that in addition to (13) also the relation 


(14) mM = [M., Bil 
holds. In order to analyze the content of these relations, we write 
m = M/D, 4%, = M/A, B, = M/B, ©, = M/C, 
and find that the conditions (13) and (14) are equivalent to 
M = [A, C] = [B, C], (A, B) = (A, C) = (B,C) = D. 
These relations can, however, be written as 
(15) [a, ©) = [B, C], (A, B) = (A, ©) = (B, ©) = G, 


where we have introduced the quotients 
a= A/D, BS = B/D, © = C/D. 

According to Theorem 2 the relations (15) imply that 4% and % are Abelian 

groups. But since 
CAC ' = CVC = &, 
we also obtain that ©, is Abelian. This in turn means that since 
M= G X G, 

the group © contains the commutator group of WM. To obtain a completely 


dualistic formulation of these results let us define the anti-center of a group 
as the quotient group with respect to the commutator group. We then have 





Re 








STRUCTURES AND GROUP THEORY. I 173 


THEorEM 3. Let %, and B, be left-hand directiy similar quotient groups. 

Then we have the relations 
mM = [% 9 Bil = [%; P Gili, (MM , G@;); = (RB: , G,), = & ’ 

where ©, is an Abelian group which is a left-hand factor of the anti-center of M. 

This theorem is the dual of Theorem 1. It shows the duality between the 
center and the anti-center of a group. The existence of such a duality was 
already observed by Speiser™ in connection with the direct product decom- 
positions of a group. Here it is explained as an instance of our general principle 
of duality. 


3. Properties of three normal subgroups. Since the Dedekind axiom is 
satisfied for normal subgroups, any three normal subgroups will generate by 
cross-cuts and unions a special Dedekind structure containing in general 28 
normal subgroups. The discussion of this structure leads us to a set of relations 
between three normal subgroups which we have analyzed in Foundations I, 
and which we shall not repeat here. We shall only make one application to 
obtain a particular theorem for groups. 

Let A, B and C be the given groups. To abbreviate we shall write 


R = [(B, C), (C, A), (A, B)], 
R = ([B, C), (C, A], [A, B)), 
and 
T's, = ((A, [B, C]), (B, C)] = ([A, (B, C)], [B, C)), 
while T, and 7’¢ are obtained by permutation of letters. We can now prove 
for any Dedekind structure 
R= [T., Ts) = (Ts, Tc] = [Tc, Ta], 
R = (T,, Ts) = (Ts, Tc) = (Tc, Ta). 
This means that if we put 
A= 7T,/R, B = T;/R, € = Tc/R, 


the conditions (12) in Theorem 2 are satisfied, and hence we have the following 


interesting 
TuHeoreM 4. For any three normal subgroups A, B and C the quotient group 


4 = ((A, B}, [B, C], (C, A])/((A, B), (B, ©), (C, AD] 


is Abelian. 

A study of all possible similarity relations for quotients formed by consecutive 
elements in the structure generated by A, B and C reveals that these quotient 
groups fall into seven different classes of similarity. We shall state the results 
briefly. They may all be proved by application of the law of isomorphism. 


13 A. Speiser, Theorie der Gruppen von endlicher Ordnung, 2 Auflage, Berlin, 1927, p. 136. 











174 OYSTEIN ORE 


Tueorem 5. For any three normal subgroups A, B and C there exist the similar- 
ity relations 
(16) [A, B, C), [A, B) = [B, C), ([A, B}, [A, C)) 
= [C, (A, B)]/Tc = C/(C, [A, B)), 


and two others obtained by permutation. Three dual sets of similarities are ob- 
tained by interchanging cross-cuts and unions in (16). Finally, there exists the 
self-dual set of similarities 


({A, B], [A, C])/[A, (B, C)] = ((B, Al], [B, C))/[B, (A, C)] 
= ([C, A], [C, B))/(c, (A, B)} 


= (A, [B, C])/[(A, B), (A, C)] = (B, [A, C))/[(B, A), (B, ©] 


(17) : 
= (C, [A, B])/[(C, A), (C, B)| 


= R/T, = R/T3 = R/Te 


=~ 7,/R = T72/R = Tc/R, 


and all the quotient groups (17) are Abelian. 

The last remark follows from the proof of Theorem 4. 

The similarity relations of Theorem 5 actually go back to Dedekind."* They 
have been restated in part by Remak” and Garrett Birkhoff.” 

Another finite substructure of a general Dedekind structure may be defined 
by any four normal subgroups A > a and B > b. In general the corresponding 
structure contains 18 elements and the consecutive quotients fall into 7 similar 
classes. Since most of these similarity relations are trivial, we shall only men- 
tion 

TueoreM 6. For any four normal subgroups A > a and B > b there exist the 
self-dualistic relations 

(a, bj/[a, (b, A)] = [b, (A, B)]/(A, B) 
and 
([a, B], [b, A})/[a, b] = (A, B)/[(a, B), (b, A)] = [a, (A, B)]/la, (6, A)] 
= [b, (A, B)I/(b, (a, BY) 


The last relations contain the lemma of Zassenhaus for normal subgroups. 

It may be observed finally that the preceding relations can in some cases be 
extended to quasi-normal or permutable groups by using structure isomorphism 
or correspondences instead of similarity. In Chapter 2 we have already done 
this in the case of the lemma of Zassenhaus. 


YALE UNIVERSITY. 


14 R. Dedekind, Uber die von drei Moduln erzeugte Dualgruppe, Math. Ann., vol. 53 (1900); 
Werke, vol. 2, pp. 371-403. 

16 R. Remak, Uber Untergruppen direkter Produkte von drei Faktoren, Journ. fiir Math., 
vol. 166 (1932), pp. 65-100. 

16 G. Birkhoff, On the combination of subalgebras, Proceedings Cambridge Phil. Soc., 
vol. 29 (1933), pp. 441-464. 


BAPE, 





POWTER Sim 








A CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 
By Artuur B. CoBLe 


Introduction. In the study of certain groups of Cremona transformations 
in S, which I have called “regular groups” (II, §§4, 5), the determination of the 
types of transformations in the Cremona group was accomplished by the use 
of a related linear group with integral coefficients. In later papers’ it appeared 
that the class of Cremona groups with such related linear groups is probably 
quite extensive. It is the purpose of the present paper to discuss linear groups 
of this character independently of their association with Cremona groups. 
They are generated by a finite number of involutorial elements of a particular 
type, derived in §1, and characterized more completely in §2. These groups 
divide into two classes according to the values of a certain constant, e = +1, 
and they have an additional integer parameter e. The linear groups first men- 
tioned occur when « = 1, e = —1. These occurred in pairs of “associated 
groups” and in §3 this isomorphism described as association is extended to 
generic e. 

Each group has an invariant linear, and an invariant quadratic, form. This 
imposes the necessary conditions on the coefficients of the generic element, 
which are obtained in §4. These conditions, however, are not sufficient. The 
de Jonquiéres subgroups, defined in §5 after the manner of the de Jonquiéres 
group of planar Cremona transformations, are used in §§7, 8 to separate the 
cases of finite and of infinite order. 

The types of symmetric transformations, already determined when « = 1, 
e = —1, are found for the general case in §6. 

A particular glass, g,(a), of these groups, whose generators are unusually 
simple, is studied in §9 with particular reference to aggregates of products of 
ternary de Jonquiéres transformations, and again in $10 with reference to the 
nature of the coefficients of its elements. The writer hopes to consider the 
more general group in a later paper. 


1. A particular type of involutorial matrix. In connection with the so-called 
“symmetric Cremona transformations” there occur linear transformations with 
integer coefficients, the matrix of the coefficients having the particular form 


6 —y —¢€ —e 
(1) 5 _ canny ew 
6 —¢ —<¢ ~—y 


ee 


Received February 23, 1937. 
1A. B. Coble, Point sets and Cremona groups, I: Trans. Amer. Math. Soc., vol. 16 (1915), 


pp. 155-198; II: Trans. Amer. Math. Soc., vol. 17 (1916), pp. 345-385. 
2 A.B. Coble, Groups of Cremona transformations in space of planar type, I: this Journal, 
vol. 2 (1936), pp. 1-9; II: this Journal, vol. 2 (1936), pp. 205-219. 
175 











176 ARTHUR B. COBLE 


where a, 8, y, 6, € are positive integers and where y may be zero. An example 
is the well-known transformation 


, 
Xo = 2% — M1 — Le — 23, 
, 
= — ig — 23, 
(2) , 
Za => Go M1 —= Bis 
, 
Zeo=z=ie~ i %; 


associated with the quadratic Cremona transformation in the plane. 

In the instances mentioned these linear transformations are involutorial. 
We ask then under what conditions a square matrix of type (1) and order r + 1 
is involutorial. The customary conditions yield the following: 


r= ] r>t1 
a’ — Bs = 1, a’ — rBi = 1, 

i B(—a + y) = 0, —aB + By + (r — 1)Be = 0, 
b(—a + y) = 0, ai — yd — (r — 1l)de = 0, 
—8 + 7° = 1, —8B+y7° + (r—1)é =1, 

—68 + 2ye+ (r — 2)¢ = 0 


In the case r > 1, the last two conditions yield (y — «)” = 1, whence the entire 
set can be written as 
y=ert+e (e = +1), 
(4) a’ = (re+e)’, dla — (re + ef] = 0, 
68 e(re + 2e), Bla — (re + e)] = O. 


Since 6 and @ are to be greater than zero, we have 
(5) The matrix (1), if of two rows (r = 1), ts involutorial if y = a and 5, B are 
complementary factors of « — 1(a > 1); if of (r + 1) rows (r > 1), is involutorial 
ify =e+e(e=+1),a = re+e, andi, B are complementary factors of e(re + 2e), 
the case r = 2,€ = 1,e = —1 being excluded. We assume also that e >1. 
Certain particular cases of matrices (1) which are involutorial are excluded 
by the restrictions imposed in (5). These would be of no interest in connection 
with the groups about to be defined. We denote by J2..., the linear trans- 
formation on variables x», 21, --- ,2- Whose matrix of coefficients is (1) as 
limited in (5). 
It may be verified without difficulty that J,...., has the absolute invariants 


Q 
L 


bay — B(zi + --- + 27), 
réto — (a — 1)(a1 + --- + 2,). 


(6) 


II 





CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 177 


It has the relatively invariant linear form 


(7) L’ = réx%y — (a 1)(x +eee t+ Ir), 
with multiplier —1. It has also r — 1 linearly independent invariant forms of 
type x; — 2; (i, 7 > 0;% #* J), which are absolutely unaltered if e = —1, and 


reproduced with a multiplier —1 if e = 1. From these facts we conclude in 
geometric language that 

(8) The involution I. ..., with matrix (1), as limited in (5), is always of the 
type perspective at a point. Given the quadratic form Q, this point, or its polar 
linear form, defines the involution. If r = 1, this polar linear form is either L or 
L'; ifr > 1, itis L or L’ according as eis 1 or —1. If r = 1, the determinant of 
the matrix is —1; if r > 1, this determinant has the value (—1)" or —1 according 
as eis lor —1. 


2. The linear groups G,(a), G,(r, «, e). Let ji, --- ,j- be any combination 
of r indices from 1, 2, --- , p (9 > r), and let J;, ... ;, be the involutorial linear 
transformation on variables 2, x;,, --- , 2;, With the same matrix of coefficients 
as was assigned to J,..., in 1, the remaining variables in the set 1, --- , 2, 


being unaltered. We define the group g,(r, ¢, e) to be that generated by the (*) 


involutions J;, ... ;, formed for the givene 2 lande= +1. Thus g,(r, ¢, e) is 
a group of linear homogeneous transformations on p + 1 variables. 
There is no loss of generality in setting 


(1) $= 1, B = e(re + 2e) 


in the generators J;, ... ;,. For if 6 ¥ 1, the transformation 


(2) Xo = Zo, ai = br; (¢ = 1,---,~) 


carries the group into one for which 6 = 1. We have then 
(3) Ifr = 1, the linear group g,(a) is generated by p involutions of the form 


I;: Io = aty — (a — 1)z;, rt; = % — ax;, t%, =a (k ¥ 0,5). 
It has the invariant forms 


Q = a3 — (a — 1)(a7 + --- + 23), L = % — (a — 1)( +--+: + 2,), 
(a = 2). 


(4) Ifr > 1, the linear group g,(r, €, e) on variables xo, 41, +++ , 2, ts generated 


by (’) involutions of the form 
ro = (re + e)ao — e(re + 2e)(x;, + --- + 23,), 


Tj, +++ 5, tj, = To — e(2j, +--+ + 2j,) — cn, (= 1,---,7), 


1% = % (k #0, ji). 











178 ARTHUR B. COBLE 


It has the invariant forms 
Q = x5 — (re + 2e)(zi + --- + 24), 
L = rx — (re+e—1) (1 +--+ + 2,). 


Naturally, if e = 1, ZL may be taken in the simplified form 


(5) L = 2% — e241 + ++: + 2,) (e = 1). 

Thus, for r = 1, there exists a g,(@) for every a 2 2 and for every p 2 2. 
For r > 1, there exists a g,(r, €, e) for every « 2 1, for every p 2 r + 1, and 
for each e = +1, except when r = 2,¢€ = le = —1. 


It is clear from the nature of the generators that 
(6) Ifr <p’ < p, the g,-(r) is a subgroup of g,(r), the other parameters a, or €, € 
in the two groups being the same. 
We shall later (ef. 5 (1)) find other cases in which certain groups g,- are sub- 
groups of another group, g,. 

If we consider that the generating involutions are harmonic perspectivities 
determined by skew S, and S,-1-, in the S, of a, 41, --- ,2%,, we see that 
(7) The group g,(a) in S, is generated by harmonic perspectivities whose spaces 
of fixed points are the So with equation (a — 1)& — &; = 0, and the S,_; with 


equation x — (a + 1)z; = 0. Ife = —1, the generators of g,(r, «, —1) have 
also, for spaces of fired points, the So, (re — 2)& — (&;, + --- + &),) = 0, and 
the S,1, % — €(@j;, +++: + 2;,) = 0. If however e = 1, the generators of g, 
have for spaces of fixed points the S,.1, L = Xe, = Leg = +++ = Te,-, = 0, and 
the S,_,, L'’ = 2}, — Xj, = +++ = Xj, — Xj, = 0, where the indices ky, --- , ky 
are complementary to ji, «+: ,j, m1, 2, +++, p. 


3. Associated groups. In connection with regular Cremona groups and 
their attached linear groups g,(r, 1, —1) [ef. “II, §5 (27)], the author has called 
attention to an isomorphism referred to as “associated groups”. This iso- 
morphism persists in some measure for the more general linear groups considered 
here. We prove first that 
(1) If p — r > 1, the group g,(r, «, —1) is the linear transform of the group 
g,(p — r, €, —1), the generating involution I ;, ... ;, of the first group being the trans- 
form of the generating involution I ;,,, -.- jp of the second group. 

For the proof it is necessary only to find a linear transformation T of non- 
zero determinant from x to which converts the fixed So, S,-: of J;, ..- ;, (ef. 
2 (7)), ie., 

(re — 2h —(E, He + Ei), MO — ey + + 25), 
into the fixed Sj, S._, of I ;,4, +++ i, Which are 
\(p _ rjé = 2} fo es (E541 os | E;,),  — a(2;,4 oe eee 2;,). 


Since T must have the same effect for all sets of complementary indices j;, --- , j, 
and jrsi, *** ,Jp, 7 must be symmetric in 7, --- , z,, and we set 





CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 179 


Zo = Ito + m(x, + --- + 2,), 
(2) = 
EF =nX+ p(t: +2,) +qx (= 1,---,p). 


The determinant of T is q’ [lq + plp — pmn]. The conditions that So, So 
and S,_,, S,-; are conjugate respectively under 7 and 7~* respectively are 


U(re — 2) + mrin(re — 2) + pr + q:n(re — 2) + pr = (9 — rie — 2:02, 


8 
| 


l — &p — r)n:m — ep — r)p:m — ep — r)p — eg = 1:—€:0. 


These four conditions on 1, m, n, p, q respectively have the matrix 


0 0 re— 2 r 1 

0 —1 0 é(p — r) é 
re—2 0 0 (ep — rj —2 

€ 0 —e&p —r) 0 é. 


If | and m are eliminated from the last three, and the result simplified by using 
the first condition, we get (€ — «)g = 0. Since gq, a factor of the determinant 
of T, is not zero,é = ¢«. The conditions are then dependent. Setting p = 0, 
we find that 
(3) The groups g,(r, ¢, —1), go(e — r, €, —1) of (1) are conjugate under T in (2) 
with l:m:in:p:q = —(pe — 2): e(re — 2): —1:0:(re — 2), and with determinant 
2(re — 2)? > 0. 

If p — r = 1, ‘we seek a transformation T which converts the fixed S), S, of 
I;, ... ;, a8 given above into the fixed So, S; of I;,,, of gri1(&), which are (a — 1) 
& + &;,,,= 0, % — (a+ 1)%;,,,. The matrix of the four conditions on 1, m, n, 


Pp, 7 is 


0 -1 0 a+la+l 

€ 0 -—ea+ 1) 0 a+1 

0 0 re— 2 r 1 
re—2 rf 0 0 a-—l. 


If the first and second equations are solved for 1, m, and the results substituted 
in the last, a comparison with the third equation yields [(a + 1) — «lq = 0, 
whence e = a+ 1. Again setting p = 0, we find that 

(4) For e = 3, the group g,s:(r, €¢, —1) can be transformed into the group gr4:(a = 
€ — 1) by the transformation T in (2) withl:m:n:p:q = —(re — 2) — e:e(re — 2): 
—1:0:re — 2, and with determinant 2(re — 2)"*' > 0 in such wise that generators 
I;, ...;, and I;,,, of the respective groups are conjugate. 

It is clear that no group g,(r, ¢, e) with e = 1 can be transformed into a group 
g(a) or into a g,(r, €, e) with e = —1 in such a way that the generating involu- 
tions of the one group pass into those of the other, since, according to 2 (7), 
the fixed spaces of the generators of the one group are of different character 











180 ARTHUR B. COBLE 


from those of the generators of the other. Nor, for the same reason, can two 
g,'s with e = 1 be transformed into each other in such fashion that comple- 
mentary generators pass into each other. Thus association is characteristic 
of the groups g,(a) and g,(r, «, —1). 

(5) The group g,(r, €¢, —1) will have not only the series of subgroups isomorphic 
with go-«(r, €¢, —1) (k = 1, --- ,@ — r — 1) mentioned in 2 (6) but also subgroups 
isomorphic with g,-.-(p — r — k, e, —1) (l= 1,---,r— 1). If however « = 
1,2,thenk = 1,---,p—r-—2. 

For, according to 2 (6), g,(r, «, —1) contains subgroups g,-:(r, ¢, —1) ob- 
tained by fixing k of the variables. The subgroup g,-x(r, «¢, —1) is, according 
to (3), isomorphic with g,_.(o — r — k, e«, —1), and this latter, with 1 variables 
fixed, yields subgroups g,-.—.(o — r — k, ¢, —1). 

The distinguishing feature of the association here derived is the correspondence 
between complementary generators of the associated groups which automatically 


sets up anisomorphism. There may well be other types of isomorphism between "3 
two of these groups, and even other cases of conjugacy under linear trans- , 

: i 
formation. ; 


Reverting to (3), we observe that if p = 2r, the associated groups coincide, 
whence 
(6) The groups go,(r, €, —1) are self-associated. Each possesses an inner tso- 
morphism in which complementary generators correspond. 


4. Relations connecting the integer coefficients of the elements of the groups 
g,. In order to include both the groups g,(a@) and the groups g,(r, «, e), it is 
convenient to return to the notation of 1 with 


y=et+e, 58 = e(re + 2e), rs8 = a — 1. 


We also take the invariant forms Q, L as in 1 (6). F 
Let the generic element of g, be written as E 
wa 
, 
; _ Fo = Alo — Apt, *** —~AojLj -** —AopLp, 
(1) - ‘ 
Zt; = Aolo ~— Ait **: — A; jL; ayes —Qipl, (i = - fe Pp). 


We first observe that 


(2) aon = 1 + anéd8, ao = 6d;, aj; = Bb; (¢ > 0,7 > 0). 





For these relations are satisfied by the generators, and, if satisfied by (1), are 
also satisfied by the product of (1) and a generator. Hence they are true of 
every element. 

The quadratic relations on the coefficients a;; are as follows: 


(i,j,k = 1,---, 0) 





Dd’ = ay > b5 = ao : 
(3) S,ai; = 66b;; + 1, Dai; = 58d; + 1, 


ae 








i 
j 


VE 


Bena i SR 








CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 181 


Yidiaz; = b;-a00, > jbjai; = d;-aoo, 
Dias jain = 58bjb, (7 = k), D jai j0,%; = 5Bdid;, (i # k). 


The first column of four relations expresses that Q is unaltered by T. But it 
also expresses that the matrix 7’ obtained by interchanging b; with d;, also a;; 
with a;;, in T satisfies the identity 77’ = 1, whence T” is the inverse of 7. 
Hence 
(4) The inverse of T is obtained from T by interchanging b; with d;, and aj; 
with a ;;. 

The second column of four relations (3) then expresses that Q is unaltered 
by T™. 

If we apply T and 7 to the invariant form L, the following four linear 
relations on the coefficients of T are obtained: 


(@ —_ 1)>,d; = 1 (ao = 1), (a —_ 1)>,b; — (coo —_ 1), 


(5) 
Zia; = (a + 1)b; —_ 1, 2 Qi; = (a a 1)d; —_ 1. 
The following fact is needed in the next section: 
(6) If T ing,(r, «, e) is a product of g generators, then 


ao = (—e)’ + u(re + 2e). 


For this relation is true for a single generator, g = 1, » = 1, ag = a = re +e. 
Let us assume that it is true for 7, i.e., that ao = (—e)’ mod(re + 2e). It is 
then true for the product 7’ of 7 and an additional generator, since aj = 
aw(re + €) — e(re + 2e)(di, + --+ + d;,) and ago = aw(re + €) = aw(—e) = 
(—e’)(—e) = (—e)’" mod(re + 2e). 

The corresponding theorem for g,(a) is 
(7) If T in g,(a) is a product of g generators with determinant A = (—1)’, then 
ao = A+ ula + 1). 

This indeed is true of the generators themselves, and it remains true when 
they are combined. 

A further property, used in 9, of 7’ in g,(a) is 
(8) The value u defined in (7) satisfies the relation 


uw — 2b; = 2r (A integral and 2 0). 


To prove this we consider the product TJ; with coefficients ayo = (n — 2)aw — 
(n — 1)(n — 3)d;, b; = (n — 2)b; — aj;. Since, for the product, A’ = —A, 
ao = (n — 2) + (n — 2)u(n — 1) — (n — 1)(n — 3)d; = —A + (n — 1) 
{A + u(n — 2) — (n — 3)d;}, whence vu’ = A + u(n — 2) — (n — 3)d;. Also, 
according to (5), 3jb; = (mn — 2)2,b; — (n — 1)d; + 1. Hence yp’ — >,b; = 
(n — 2)(u — 2b) +A — 1+ 2d;. Since A — 1 is even, up’ — 2,b; is even if 
a — Xjb; is even. But, for a single generator, 1 = 1 and 2b; = 1, whence 
a — X,b; is even, and remains even under multiplication of the generators. 











182 ARTHUR B. COBLE 


5. De Jonquiéres subgroups of g,(r, «, ¢). We have found in 3 that when 


e = —1, the groups g,(r, «, —1) and g,(p — r, e«, —1) are simply isomorphic in 
such fashion that, if 7, --- 7, and j; --- j,-, are complementary sets of indices 
from 1, ---, , the generators J;,...;, and J;,...;,_, of the two groups cor- 
respond. If we take in g,(p — r, e, —1) that subgroup mentioned in 2 (6) which 
leaves x, «+ , 2, unaltered, the isomorphic subgroup in g,(r, e«, —1) is generated 
by involutions J,...,:,...:,.,, Where the indices 7, --- 7%, are selected from 
k + 1,-+-+,p. But this set of generators will generate a subgroup whether 


e be lor —1. We call this type of subgroup a de Jonquiéres subgroup because 
of its first appearance in connection with de Jonquiéres planar transformations. 
(1) The de Jonquiéres subgroup of g,(r, €, e) generated by involutions the 
Tho ... bi, «-- ip-» 18 Simply isomorphic with g,-.(r — k, €, e) (k <r). 

It is clear that, due to the relations z,, — x, = (—e)"(tm — 2,) [m,n =1,---, 
k; m < nj, which persist under repeated application (g times) of the given 
generators, the elements of the subgroup will have like rows, Zt, - a except 
in the rectangle of coefficients under 2, ---,2;. In this rectangle of co- 
efficients the principal diagonal terms will have the additive part (—e)’. It 
will thus be sufficient to have the sum of these rows and we introduce the 
variable 
(2) o= tee $+ H. 

A typical generator then reads 
ry = (re + e)ao — e(re + 2e)o — e(re + 2e) 
(Terr + +++ + 2), 
(3) = c¢= kx _ (ke + e)e = Ke(tess + eee + Ze), 
t; = % — eo — (tei + °°: + 2) — C2; 
G =k+1,---,7r). 


This transformation on the r — k + 2 variables x, ¢, x; is itself involutorial. 
With generators of the type (3) we make the following change of variables: 


t = kx — (re + 2¢)o, 2; = 2; (j= k+1,---,p). 


~ 


(4) 


Zo = Xo — eo. 
The transformation inverse to (4) is 
5) {(r — k)e + 2e}ao = (re + e)z — et, 2; = 2;, 
{(r — k)e + 2e}o = key — t. 


In these new variables the equation of the generator (3) is 


= (-e)t, 2 = 2% (l=r+1,---,p), 
(6) zo = {(r — ke + e}zo — ef(r — ke + Qe} (Znrr +--+ +2), 


Il 
cond 
+ 
— 
3 
— 


2; = 2 — e(Zesa + ++ +2) — C2; Gj 


a 
sf 
fe 


@) 





CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 183 


Thus to the generator (3) of the de Jonquiéres subgroup there corresponds a 
generator I;,,;,...,, of the g,.(r — k, €, e), and thereby to every element of the 
one group there corresponds an element of the other. 

It is of interest to have the expression for the element of g,(r, ¢, e) which 
corresponds to a given element of g,.(r — k, ¢, e). Let then the generic ele- 
ment of g,-.(r — k, €, e) be 
20 = Buto — 2 ;Boj2;, 

2; = Bite — 2 ,Bii2; (i,j =k+1,---,.), 
and let it be generated by g generators. According to 4 (2), (6) 
(8) Boo = (—e)’? + u’{(r — ke + 2e}, Bos = {(r — k)e + Qejed;. 
We have from (5) that {(r — k)e + 2e}a9 = (re + 2e)z — et’. On replacing 
zq in this from (7), and t’ by (—e)"t [ef. (6)], then making use of (4) and applying 
the relations (8), we can factor out {(r — k)e + 2e} to obtain the value of 2 in 
(9). The value of o’, similarly obtained, is 

o’ = ky'xo — [eky’ — (—e)"]o — ked,bj2; G=k+1,---, 9). 
In the light of our introductory remarks this yields the value of x; in (9). The 
remaining formula in 

xy = [(—e)? + w'(re + 2e)Jao — e(re + 2e)u’(ai +--+ + 2%) 
“=< (re + 2e)=,b;2;, 


(9) x, = u’Xo = eu’ (21 +--+ Le) + (—e)’ vi eX ,b; Tj; 
= Bwto — Bw(Xi + --- + 2x) — 25Bij7; 
(i= 1, s+ ksjl= k+ 1, +++ sp) 


2 
| 


is almost immediate. Hence 
(10) If a generic element of g,-x(r — k, €, e) is given as in (7), (8), then (9) is a 
generic element of a de Jonquiéres subgroup of a q,(r, «, e). 

In another connection I have called this process of passing from a given group 
to a subgroup of a more comprehensive group, the “dilation” of the given group 
(ef. *§4). 

In the above we have set the limit k <r. If however k = r — 1, the g,x 
(r — k, «, e) would be a g,-x(a) with only one free variable in the generators. 
This case would however be included under the above argument by setting 


9o—-(1, €, e) _ Jo—K (a =< + e), 
(11) 


e23ife=-1, e¢2life=1. 


3A. B. Coble, The ten nodes of the rational sextic and the Cayley symmetroid, Amer. Jour., 
vol. 41 (1919), pp. 243-265. 











184 ARTHUR B. COBLE 


6. Symmetric transformations and related theorems. It may happen that 
the groups g,(a), g,(r, «, e) contain other elements of the type 1 (1) than the 
generators. Such elements are called “symmetric transformations’. Thus 
the g,(3, 1, —1) generated by elements of type 1 (2), whose elements represent 
the effect of Cremona transformations in the plane, contains three such sym- 
metric transformations corresponding respectively to Cremona transformations 
of orders 5, 8, 17 with 6, 7, 8 F-points of orders 2, 3,6. Such symmetric trans- 


formations in a given group will generate a g,(r’, e’, e’) which is a subgroup of 


the given group and which therefore has the same invariants, Q, L. 

We cannot assume that 6 = 1 in these symmetric transformations, and will 
therefore denote them more specifically by I(r’, a’, 5’, 8’, «’, e’), where a’ = 
r’e’ + e’, and 8’B’ = e'(r’e’ + 2e’). Consider first the Q, L of g,(a) in 2 (3). 


They will be unaltered by this J if 
(’ +1)=s(a+1),, BP =(e—1)8, a& —1=Pri(a— 1). 


From the first and last we see that 6’ is a divisor of 2. If 6’ = 1,a’ =a,r’ = 1, 
and B’ = a — 1. This yields merely a generator J, of g,(a). If 6’ = 2, a’ = 
2a + 1, and a = r’(a — 1), whence a = 7’ = 2,8’ = 6. From e’(r’e’ + 2e’) = 
B'S’ = 12, and a’ = r’e’ + e’ = 5, we get €’(5 + ce’) = 12, 2c’ + e’ = 5. Thus, 
for e’ = +1, we have two solutions, e’ = —1, ¢ = 3,ande’ = 1, e’ = 2. These 
solutions yield two matrices of type 1 (1): 


Ss -6 -6 5 -6 -6 
(1) M:2 -2 -3 M’:2 -3 -2 
$ -3 -3; o -§ -~3% 


If we examine the group gz (a = 2) with generators J,, J2, we find that I,J. 
has the period three, whence go(2) is dihedral of order six. The remaining 
element of period two is I)JoI,; = I2:J,J2 with matrix M. Naturally an element 
with matrix M’ will not occur in the g2(2). Hence 

(2) Theg,(2) is the only g,(a) with a symmetric transformation. The g,(2,3, —1) 
generated by this transformation with matrix M is a subgroup of g,(2). In par- 
ticular go(2) is dihedral of order six, and M is in the conjugate set of three involutions. 

If the g,(2, 3, —1) were amplified by introducing the permutation group of 
2%, °*+,2,, the larger group so obtained would contain the g,(2, 2, 1) generated 
by involutions M’, since M’ is M followed or preceded by the interchange of 
21, %. We have not of course proved that g,(2, 3, —1) does not contain II,,. 
This however seems to be unlikely. We prove later that, when e = 1, either 
II,; or its even subgroup is contained in g,(r, 1, e). 

Suppose now that r > 1, and let g,(r, ¢, e) be generated as in 2 (4). We ask 
for generating involutions with r’ 2 r, e’, e’, and 6’, B’ = '(r’e’ + 2e’), which 
leave the Q, L in 2 (4) unaltered. In order to bring in a, a’ we observe that, 
since ec’ = e” = lranda=re+e,a' =r'e +e’, 





OTST MOREE: 


TET Rr eye 


pe 








LENSE MES RD 


Rd 


RT 


SRT on 














CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 185 


re(re + 2e) = rB = a= 1, 

re (r’e! + 2c’) = 1r'5'B’ = a” — 1. 
It is also convenient to write Q, L as 

rQ = rx — (a — 1)(@@i + ++: + 3), 

L = rx — (a — 1)(a1 + --: + 2,). 
Then the conditions for invariance ultimately reduce to the following three: 

(r’ — r)(e’a — 1) = (e’ — a)(r’ +17), 

v= (e+ 1)/@+1), 78’ = (e’ + I)(@ — 1). 


Since r’ 2 r, we first consider r’ = r._ Then from (3), a’ = a, 6’ = 1, B’ = B. 
Also, from a’ = a, we have r(e’ — €) = e — e’. The obvious case is ¢’ = «¢, 
e’ = e, which yields no new generator. But it may be that ce’ = —e, or r(e’ — «) 
= 2e. Sincee = +1, andr 2 2,thenr = 2,¢ =~e+e,e’ +e’ =e. Thus 
the new involution is merely the generator I. with the last two rows (or the 
last two columns) interchanged as M, M’ above. As noted above, g,(2, ¢, —e) 
generated by involutions Jj: is not the same as the g,(2, ¢, e) generated by in- 
volutions Ij. We also have excluded in 1 (5) the case e = 1,e = —1. 

Suppose now that r’ > r, say r’ = r+i(i 21). Suppose also that e = 1. 
Then the first equation (6) reads 


a’(te — 2) = —e(2r + 1) — 2. 


(3) 


Hence te — 2 < O,ie., i = lle = 1. Henceea=rt+1lr=rt+i,a’ = 
2r+3,6° = 2,8’ =2r+4. Froma’ = 2r+3=r'e +e’ = (r+ 1)e’ +e’, we 


have e’ = 1, e’ = 2, since e’ = —1 would yield a fractional «. This is the case 
(ec) of Theorem (4). 
Suppose that r’ > r as before, and that e = —1. Then the first equation 
(6) reads 
a’(ire — 2r — 21) = —2r(re — 1) — i(re — 2). 


Since r 2 2,e€ 21,7 2 1, thenire — 2r — 2 < 0. ~Ift,?’ are respectively the 
larger and smaller of r, 7, then ¢’ — 4 < Oore 23. Ife = 3,’ =i = 1, 
sinceer 22. Butthenr —2 <0. Againife = 2, =i=1. This yieldsa 
case, e = 2,79 = r+1,a’ = 2 — 1,a = 2r — 1, & = 1,’ = 4r(r — 1), 
listed as (d) in Theorem (4). From a’ = r’e’ + e’, or 27° — 1 = (r + 1)’ +’, 
we find that e’ = 1, e’ = 2(r — 1). 

There remain the cases for which e = —1 and e = 1. These however are 
the groups g,(r, €, e) attached to the groups of “regular Cremona transforma- 
tions” [ef. “II, §4], and for these groups the symmetric transformations have 
already been determined [ef. “II, §5, pp. 368-9]. These are listed in Theorem 
(4) as (e), «++ , (h). 








186 ARTHUR B. COBLE 


(4) The following sets of involutions I(r, a, 5, B, €, e) have the same invariant 
Q, L: 

(al): (1, 2, 1,3, —, —) (b1): (2, 2e + e, 1, 2e(e + @), «€, e) 
(a2): (2, 5, 2, 6, 3, —1) (b2): (2, 2e + e, 1, 2e(e +e), e + e, —e) 
(a3): (2, 5, 2, 6, 2, 1) 


(cl): (7,r+1,1,r + 2, 1,1) (dl): (r, 2r — 1, 1, 4(r — 1), 2, —1) 

(c2): (r+ 1,2r +3,2,2(r+2),2,1)  (d2): (+1, 2r —1,7r,4r(r — 1), 
2(r — 1), 1) 

(el): (3, 2, 1,1, 1, —1) (f1): (4, 3, 1, 2, 1, —1) 

(e2): (6, 5, 2, 2,1, —1) (f2): (6, 7, 2, 4, 1, 1) 

(e3): (7, 8, 3, 3, 1, 1) (f3): (7, 15, 4, 8, 2, 1) 


(e4): (8, 17, 6, 6, 2, 1) 


(gl): (5, 4, 1, 3, 1, —1) (h1): (2k, 2k — 1, 1, 2(k — 1), 1, —1) 


(g2): (8, 49, 10, 30, 6, 1) (h2): (2(k + 1), 2k — 1, k, 2k (k — 1), 
k — 1,1). 


We have seen that the cases (a), (b) were not of special interest, and that cases 
(e), --- , (h) have been discussed. We consider then more particularly the 
cases (c), (d). In both we pass from 2, ---,2, to %1, +++ ,243. Let then 
J, be the generating involution J; ,..., .1, e441, ...,4a (kK = 1,-++,r +1). With 
8 = e(re + 2e), the product J,,.J, has the matrix 


1+, —Be =—fe ++: =—fle +6) —B 
e =—(&-1) —€ +++ ee + €) —e 
(8) Inde: § oe i at ew “ 
] —e —e€ —(e+ e) 0 
ete —ee+e) —ele+e)--- —ee+2e) —(e +). 
If in this « + e = 1, since e > 0,e = +1, then e = 2,e = —1, B = 4(r — 1), 


as in case (dl). From an inspection of the last two rows and last two columns 
it is clear that then 

(6) J raid = J J r41. 

Hence 

(7) In the case (d1) of (4), the generating involutions are permutable. Their 
product JiJ_ .++ Jp., ts a symmetric involution necessarily of type (d2). 











; 
( 
: 








CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 187 


In connection with the cases (c) we use the element 


a’ —B(ee + n) —B(ee + n) 
—B(2ee — e + n) —B(e + e) 
e+ 7 —(ee +e + en) —e(ee + 7) 
—(e + e)(ee + - 1) —e(e + €) 
etn —e(ee + 1») —(€e + € + en) 
(8) Jerid rd rai: —(e+ e)(ee + » — 1) —e(e + e) 


Qe —e+n —(2he — ee +n) —(2ee — e+ en)-:: 
—(e+e)(Qee —e+n-—1) —e(e + 2c) 
e+e —e(e + e) —e(e + e) 
—e(e + 2e) —(e+e), 


where a’ = re+e+ Ble +e —1),7= €& —e€4+ 1,8 = ere + 2e). This 
matrix will be of the typically involutorial form only if the elements to the right 
of «’ are all equal. This requires 


ete=et+e—et+1=2e-—-ete—e+], 


whence « = 1. Then also the elements below a’ are equal. Moreover the last 
two elements in the second row must be equal, which requires further that e = 1. 
Then (8) is the involution J,,... , -41 described in (c2) of (4), except that either 
the last two rows or the last two columns are interchanged. Hence 


(9) J J raid y si J sid rd p41 = Ih, woe rga(7, r+ 1) _ (r, +} B)*Sa, ->s , 068 


From this there follows at once that 
(10) In the grsi(r, 1, 1), the product JiJ; (t, 37 = 1, --- ,7 + 1) has the period 
three, and the transform J ;J ;J ; the period two and matrix of type (8). This linear 
group contains the subgroup of even permutations of the variables 2, +++ , X41, 
the cyclic elements, J;J ; and (ijk), of period three, being in the same conjugate set. 

For JWrade = Nh... early 7 + 1), Ierads = (7, r — 1)-h,--- . r41; 
whence J,-J id ruatde = (7,7 — 1)-7,r +1) = ( -— 1,7r4+ 1,7). 

An examination of the matrix (8) for the case « = 1, e = —1 shows that then 
J adr rua = (r,7 +1). Hence 
(11) The group g-s:(r, 1, —1) contains as a subgroup the permutation group 
Thirsty: Of 41, °** , Xras, the transpositions (x,;x;) being in the same conjugate set 
as the generators. 

Since g,/(r, €, e) is a subgroup of g,(r, €, e) [r < p’ < p], there follows that 
(12) The groups g,(r, 1, e) (p > r) contain as a subgroup either the symmetric 
group I1,,, or the alternating group II,1/2, of the variables x,, ++ , x, according as 
eé= —lore=1. 











188 ARTHUR B. COBLE 


For ¢ = —1, this was already known in connection with the regular Cremona 
groups. 


7. Finite and infinite groups g,(a) and g,(r, «, ce). There is a very simple 
geometrical criterion which separates the finite and infinite cases of the groups 
under consideration, namely: 

(1) If in the space S,(xo, x1, «++ , 2), the linear space L = 0 cuts the quadric 
() = 0 in a quadric without real points, then the group g, is finite; otherwise, it is 
infinite. 

To apply this criterion we write Q = 0, L = 0, using the notation of 1 (6) 
in the form aj + --: + 23 = 6/8, (4. + «+: + 2,)/Vp = ri/(a — 1)V/p. In 
this metric form the condition given in (1) is obviously 6/8 < 1°6’/p(a — 1)’. 
This reduces by virtue of r§8 = a — 1 and a > 1 to the form 


(2) p(a — 1) < r(a@ + 1). 

We divide the cases as before into 

(2a) r=1, p < (a+ 1)/(a — 1); 

(2b) r>1l, e=-l, p<vre/(re — 2); 
(2c) r> i, e=1, p < (re + 2)/e. 


In the case (2a) with p 2 2,a@ = 2and p < 1 + 2/(a — 1), there is only one 
solution p = 2, a = 2. 

In the case (2b) with p > r > 1, e 2 land p < r + 2r/(re — 2), since p — r 2 
1, then 1 < 2r/(re — 2), or r(e — 2) < 2. Hencee = 2ore=1. Ife = 2, 
p<r+1+41/r—lorp=r+1. Ife = 1, the caser = 2 being excluded, 
p<r+2+4/(r — 2). Thus forr = 3, p < 9;forr = 4, p < 8; forr = 5, 
p <9;andforr > 5,p =r+1,r4+2. 

In the case (2c) with p > r > 1, e 2 1, and p < r + 2/e, there is only one 

solution e = 1,p = r+ 1. 
(3) The groups g,(a), g,(r, €, e) are finite in the following cases: (a) g,(a) = 
go(2); (b) g,(r, ¢«, — 1) fore = 1,2 andp = r+ 1;fore = landp=r+2; 
fore = 1,r = 3, and p = 6,7, 8; fore = 1,r = 4andp =7;fore=1,r=5 
and p = 8; (ec) g,(r, «, 1) fore = 1,9 = r+ 1. Inall other cases the groups are 
infinite. 

In this section we prove that the cases mentioned are infinite, leaving a dis- 
cussion of the finite cases for the next section. We observe first that the cases 
(b) for « = 1 are known in connection with the regular Cremona transformations. 
It has been proved [ef. ‘II, §5, pp. 375, 377] that go(3, 1, —1) and gs(4, 1, —1) 
are infinite. Hence go..(3, 1, —1) and gsi:(4, 1, —1) also are infinite [ef. 2 
(6)]. According to 3 (3), their associated groups, go4.(6 + k, 1, —1) and gs,: 
(4 + 1, 1, —1) are infinite. Again by the use of 2 (6), we see that go;i4(6 + k, 
1, —1) and gs,:..(4 + 1, 1, —1) are infinite. But these cover all the cases (b) 
for « = 1 except those asserted to be finite. 


vy 





ae. | ane 

















Tan nlantat ental 








CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 189 


There remain to be considered only the cases 
(di) go(a) = gs(2); (de) g(a 23); (ds) grsa(r, 2, —1); 
(dy) Gril’, €> 2, —1); (ds) 9ra2(r, ® 1); (de) Grail’, «> 1, 1). 


In each of these cases we have selected the smallest value of p which yields an 
infinite group, the larger values being accounted for by 2 (6). 

In the case (di), with g;(2) and its generators defined as in 2 (3), it is easy 
to verify that the form 


M, = (3k — 1)°% — (3k — 1)(8k — 2), — 3k(3k — 2)a2 — 3k(3k — 1)z, 


is transformed by (IyeI3)* into My.,, whence (I,J2J3)’ has not a finite period, 
and g;(2) is infinite. 

In the case (dz) with a = (a — 1) 2 2, we define sequences of polynomials 
P,., Cx [ef. *p. 111] by the recursion formula 


P,. = a Py = Pes; P_, = 0, Ps = ], P, = a; 


(4) 
Cy = Pye + Pra. 


As immediate consequences of these definitions we find that 
2 Y 
aP _ Cr = P,, (a _ 1)Px1 = aC 1 _ Cr, 


(5) . 
aP se C;. = Py2, i (a nea 1)Pr1 — al), = Cy-2. 


If then we define the forms 
(6) D, = Pyrite — Ci.21 — Cyrte, Ey = Pyrito — Crit — Cite, 


it is easy to verify from (5) that J; in ge(a) interchanges E,_,, D,, and I: inter- 
changes D,_,, E; whence I,J, transforms D, into D,,2 with respective leading 
coefficients P.1, Presi. From (4) it follows that P, — Pra 2 Pra — Pro, 
whence, since P; — Py = a — 1 2 1, the polynomials P; continue to increase 
and J;J, has not a finite period. Thus go(a 2 3) is infinite. 

In the case (ds), g-42(r, 2, —1) is associated with, and isomorphic to, g,+2 
(2,2, -—1). Sincer > 2,r+2 24. In any case g,,2(2, 2, —1) contains sub- 
groups g,(2, 2, —1) and it is sufficient to show that g,(2, 2, —1) is infinite. Con- 
sider the form 


J, ‘igi ') to — (2k* — 1) a — 2k* me — 4 (' 7 ') (as + 24). 
It is easy to verify that the element J 1273 of g.(2, 2, —1) transforms J; into 
Jiii1. Hence this element is aperiodic and g,(2, 2, —1) is infinite. 

In the case (dy), g-4:(r, € > 2, —1) is associated with g,i;(a = 2) [ef. §3 (4)]. 
Since r = 2, g,4:(@) is either g3(a@) or contains subgroups g;(a). But, according 
to cases (d,) or (de), gs(a = 2) is infinite. 


*S. F. Barber, Planar Cremona transformations, Amer. Jour., vol. 56 (1934), pp. 109-121. 











190 ARTHUR B. COBLE 


In the case (ds) of g,,2(r, 1, 1) denote the generator J;, ...;, by Ji,., is2- 
Since r 2 2,r + 2 2 4, and generators J,,,-, Jr41, 42 With non-overlapping 
indices exist. Again it is not difficult to verify that the form 


Ly = (*) [mr — am — ++: — a3] —8k 2-1 — (8k —1)2, 


2 
2k 
-4 ) (Xr41 + X42) 


is transformed by (J)... --J 41,742)" into Ly.,. Hence this element of the group 
is aperiodic and the group is infinite. 

In the case (de) of g,4: (r, € > 1, 1) let Jj, ... 4, = Ji,,,. The linear form z, 
transformed alternately by J,.:, J, yields the sequence 


[ro — 271+ -°--- +2, a)] - (e + 1)z, 
(e + 1)[zo — ef, +--+ + 2-1)] — (6 + Ia, — ee + 2)2-41, 
e(e + 1)[to — e(, + --> + r-1)| -- (e+ I)(é + ¢«— 1)z, — ee + 2)241, 


These forms have the type 
7) alto — e(t: + +++ + 2-1)] — ba, — Chr, 

(e+ 2)a-—-b-—c=1. 
If (7) is transformed by J,,; , it acquires coefficients a’, b’, c’, where 
(8) Jara’ = (1 + &a — Bb, b’ = e(e + 2)a — (1 + ©)b, ce’ =. 
The interchange in this of b, ¢ and of 6’, c’ gives the effect of J, upon the form 
(7). But this J, and J,,; in (8) are the generators J2, I; of a g2(a), where 
a = 1 + € = 3 and in which also B = 1,5 = a — 1. Thus, according to the 
result in case (d:), the effect of J,J,,: upon forms (7) is aperiodic, whence this 
element J,J,4; is aperiodic and g,,,(r, € > 1, 1) is infinite. 


8. Finite groups g,(«) and g,(r, «, e). In this section we verify the finiteness 
asserted in 7 (1), (3) and determine the nature of the finite groups. We take 
up the cases as they are listed in 7 (3). 

With respect to the case (a), or go(2), we have seen [ef. 6 (1) et seq.] that this 
is a dihedral group of order six. 

With respect to the case (b) we observe that, apart from g,4:(r, 2, —1), all 
the instances have the form g,(r, 1, —1) and therefore have been found in con- 
nection with regular Cremona transformations. We list these in order. 

The grsa(r, 1, —1) (r > 2) is a ges2): isomorphic with the symmetric group on 
r + 2 things. This is the linear group associated with Moore’s cross-ratio 
group of Cremona transformations [ef. *]. 


® E. H. Moore, The cross-ratio group of n! Cremona transformations of order n — 3 in flat 
space of n — 3 dimensions, Amer. Jour., vol. 22 (1900), pp. 279-291. 














4 

















CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 191 


The g,42(7, 1, —1) (r > 2) is a gorya)) ar+1. It has an invariant Abelian go,., 
whose factor group is isomorphic with the symmetric group. If r = 2p, it 
has an invariant ge whose factor group of order (2p + 2) !-2°” is isomorphic 
with the collineation group which transforms into itself the collineation group 
of order 2°” induced on a hyperelliptic Kummer p-way by the addition of the 
2°” half periods. If r = 2p — 1, the group is a subgroup of the one just de- 
scribed [ef. *]. 

The self-associated [ef. 'II, §5] ge(3, 1, —1) is isomorphic with the gsisio of the 
27 lines on a cubic surface [ef. ‘II, §3]. 

The two associated groups g;(3, 1, —1), g7(4, 1, —1) each have an invariant 
gz whose factor group of order 8! 36 is isomorphic with the group of the double 
tangents of a planar quartic [ef. ‘II, §3). 

The two associated groups gs(3, 1, —1), gs(5, 1, —1), each have an invariant 
gz Whose factor group of order 10! 96 is isomorphic with the group of the tri- 
tangent planes of a space sextic of genus four on a quadric cone [ef. “IT, §3]. 

In the last instance under case (b), according to 6 (7), the generating involu- 
tions, J1, --- , J-41, are permutable and the finite g,.:(r, 2, —1) is abelian, of 
order 2", and type (1, 1, --- , 1). 

There remains only the case (c) of g,:(r, 1,1). The theorems 6 (8), (9), (10) 
indicate that this group is isomorphic with a symmetric g;-;2);. We make this 
proof more precise by using the set of r + 2 linear forms: 


m, = —(r + 1)to + (r + 2)(ti + +++ + 241), 


(1) m; = % — (r + 2)2; G@=1,---,r+1), 
My, + Mm +++ + My = O. 
We find that J,,; changes the sign of each of m,, --- , m,, and interchanges 


M,; Mr4, With a change of sign in each. Moreover, the transforms of r + 1 
of these forms, and the invariance of L, defines the transformation. Hence 
grai(r, 1, 1) is isomorphic with the permutation g;-;2): of these r + 2 forms. 
The form m,, is the form —L’ of 1 (7) formed for the symmetric element J used 
in 6 (9). The g,4:(r, 1, 1) does not contain the element J. If this were added 
as an additional generator, the doubled group with invariant g. = 17 would 
contain the odd as well as the even permutations of 7, «++ , 241. 

The geometric criterion given in 7 (1) for finiteness recalls the theorem of 
Minkowski ("p. 185) that a positive quadratic form in n variables cannot admit 
more than (2"*' — 2)" integral linear transformations. Hence it can admit 
only a finite group of such transformations. Yet this theorem can not be applied 
here directly. For if, say, 2 is eliminated by using the invariant linear relation 
L = 0, our group is converted into one which has rational coefficients (ef. ‘IT, §6]. 


* A. B. Coble, A generalization of the Weddle surface, etc., Amer. Jour., vol. 52 (1930), 


pp. 439-500. 
7H. Minkowski, Geometrie der Zahlen, Leipzig, 1896. 











192 ARTHUR B. COBLE 


9. The generic element 7’ of the group g,(a). We have in 4 taken this ele- 
ment of the group g,(a) generated by J; (¢ = 1, --- , p), 

' ro = at) — Bri, a,= 6% — ati, a, = 2; (g=1,---, p37 #71). 
” a2=2, 86=a —-1, 

in the form 

(2) T: Lm = Cmolo — Amt — °-°* — Ant, (m = 0,1, --- ,p). 
We have also obtained certain numerical properties [4 (2)] of these coefficients, 
and certain quadratic and linear relations [4 (3), (5)] satisfied by them. We 
seek to prove that, with one type of exception noted below, the coefficients 
Cimn (m,n = 0, 1, +--+, p) are positive integers or zero. If T could be attached 
to a type of Cremona transformation, this could be inferred from the fact that 
geometric multiplicities are positive or zero. We proceed then to define a type 
of Cremona transformation determined by T. 

The ternary de Jonquiéres transformation of order n has a set of 2(n — 1) 
simple F-points and a single F-point of order n — 1. Its effect upon curves 
with order t), and multiplicities y:, -+- , Yern—1) at the simple F-points, multi- 
plicity 4, at the (n — 1)-fold F-point, and multiplicities £,--- ,t, at p — 1 
further ordinary points is expressed by the equations: 


to = nly — (1 $s + Yoin—1)) - (n aa 1)t,, 


yi =b—y—h (i = 1,2,---,n— 1), 
(3) J3: P 

t; = (n — 1) — (" +++ Yo(n—1)) — (n — 2), 

ti = t, (j = 2,---,p). 


The ¢’, y; are multiplicities at the inverse F-points of the transformation. If 
now we form products of such transformations, always taking the simple F- 
points of the last factor at the 2(m — 1) inverse F-points of the preceding product 
which arose earlier from the simple points of the factors of this product, and 
not at any time allowing more than p additional F-points to appear from the 
(n — 1)-fold F-points of the factors (i.e., forming products as though the 2(n — 1) 
F-points could be fixed for all factors), we then secure an aggregate of types of 
products whose effect upon curves is given by the aggregate of elements of the 
group generated by J;, --- , J,, each J being formed like J; in (3). 
Since y:, +++ , Yen—1) Occur symmetrically in all of the J’s, let 


(4) = Yt Y2 + ++ + Yun. 
Then (3) takes the form 

th = nly — o — (n — I)th, 

o’ = 2A(n — 1) — o — An — I)k, 
th = (n — 1) — o — (n — 2)th, 





wens ean 








ee | ns ae 





gE we 
Tesla 


SORIA 











CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 193 


This also is an involutorial transformation on the variables o, t. We now 
make the change of variable 


(6) 2r = —o + WH, r+a2r;=t; (j = 0,1, ---, 9p). 
Then all of the generators J take the typical form 
xy = (n — 2)x — (n — 1)n, 
(7) x, = (n — 3)xm — (n — 2)n, 
a’ = 2, % =i, ***, L = Zp. 
But (7) is the generator J, of g,(a) amplified by x’ = x, where 
(8) a=n—2, =n—3, B=n-1, and B= —1. 


Hence 


(9) The generic type of ternary Cremona transformation which is a product, 
formed as indicated above, of de Jonquiéres transformations of order n is isomorphic 


with the generic element (2) of the group g,(n — 2). 
Suppose that the generic element 7 of g,(n — 2) with constants (8) is given 
as in (2) with the necessary conditions on its coefficients as determined in 4 (2), 


(7), (8), namely 

ao — 1 = a(n — 3)(n — 1), ain = (n — 3)d;, 
(10) aj=(n-—1)b (Gj =1,---,0), 

o — A = u(n — 1), uw — 2b; = 2d. 
We seek to determine the corresponding type of ternary Cremona transforma- 
tion. From (6) and (2) we find that 
to = 2’ +29 = & + ato — Dja0jt; = 2 + amollo — x) — Dja0,(t; — x) 

= 2[1 — aw + Yja0;] + aoolo — > ja0;t; 

= 3(—o + 2to)[1 — a + Xja0;] + colo — = jaojt;. 
On substituting for ay and a; from (10), we have for ¢j, and similarly for the 
other variables, 


to = {1 + (n — IZ di} — [ZH h(t --- + ym») — (n— IZ H4,, 
yi = (Zjbj}lo — 312); — we} QQ t+ +++ + yor») + Ay — Zs; 
(11) [i = 1,---,2(m — 1)], 
t, = {(n — 1)du}to — diy + o-+* 4+ youn) — Dyraes-t; 
[k = 1,---, pl. 


This value of y; is determined indirectly from o’. For, —o’ + 2 = —o + 2t, 
whence 


Il 


o’ = {2(n — 1)D,b;}to - (22 ;b; = l)o —- 2(n = 1) b4t;. 














194 ‘ ARTHUR B. COBLE 


Now o’ is obtained from y; + «++ + Yyon—1), and the values of the y’”’s are the 
same except in the matrix of coefficients of y:, «++ , Yan»). This square matrix 
has like elements —m except in the principal diagonal where they are —(m + 1). 
On comparing the o’ arising from such rows with that just given, and noting 
that 


22 ,b; — 1 = 2(n — 1){3(2,b; — w)} — A, 


we obtain the y; given in (11). Since =;b; — y is even [cf. 4 (8)], the coefficients 
in (11) are integral. 

(12) The transformation (11) defined by the coefficients a;; of T in g,(n — 2) 
represents a geometrically existent ternary Cremona transformation, the product 
of a set of de Jonquiéres transformations. It is formed from generators J in (3) 
precisely as T is formed from generators I. 

Since (11) is attached to a geometrically existent transformation, its co- 

efficients have a variety of properties which can be transferred at once to the 
coefficients of T which are found in (11). We recapitulate some of these: 
(13) (a) The coefficients a;; of the generic transformation T of g,(a = n — 2) 
with 6 = n — 3, 8 = n — 1 are positive integers or zero with the exception that if 
ai; = — 1(i > 0,7 > 0), then every coefficient in the same row and column as a;; is 
zero. (b) With every set of x equal numbers b; there is associated a set of x equal 
numbers d;. The x columns and x rows of the matrix of coefficients thus isolated 
have in common a square matrix M, whose elements are all —m except that in each 
row and column of M, there is one element —(m + 1). Otherwise the x columns 
are identical and the x rows are identical. (c) For a particular element a;; (i > 0, 
j > 0), the inequality (n — 1)(b; + di) S 1 + (nm — 1)E,b; + aj; ts valid. 

This last inequality arises from the known inequality (cf. ‘II, p. 368), 7; + p: 
< m + aj;, which occurs in connection with a ternary transformation of order 
m which possesses an F-point of order p; and of multiplicity a;; on a P-curve 
of order o;. 

We have just used the ternary transformations to obtain properties of the 
group g,(a), but clearly the reaction is mutual, as is expressed in (12). Thus 
one may read off from the rows and columns of coefficients in (11), the char- 
acteristics of homaloidal nets and of P-curves in terms of the rows and columns 
of the comparatively simple elements of the group g,(a@). We may then expect 
to find applications of the groups g,(a) and g,(r, €, e), not merely in connection 
with new space transformations and groups where they first were observed, 
but also in connection with aggregates of transformations already studied. 

The underlying restriction a = 2, applied in the above account, yields n — 2 
=2orn = 4. It is interesting to notice that in the excluded cases n = 2, 3 
the products of de Jonquiéres transformations as used above may be regarded 
as the elements of a ternary Cremona group. For if the 2(n — 1) simple F- 
points of the direct and inverse transformations could be superposed at positions 
Pi, °** » Pons, fixed for all the generators, these generators would yield an 
actual group. Thus, if, for n = 2, p;, pe are fixed at the circular points and 














al 














CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 195 


Ds, Ps is variable, the group of inversions is determined. If, for n = 3, the four 
simple points of the cubic transformation, both direct and inverse, are fixed at 


Pi, *** , Ps, then the double F-points must coincide at O,, since the five direct, 
and five inverse, F-points are projective. The generator Jo, is then the projec- 
tion of each conic of the pencil on p,, --~- , ps into itself from O,. A quadratic 


involution with F-triangle at pe, ps3, ps and fixed point p,; converts the conics 
into lines on p,, and the lines on O; into conics on po, p3, ps, Pi. Thus our 
generators are transformed into cubic involutions with a fixed double F-point, 
three fixed simple F-points and one variable F-point. They generate an infinite 
de Jonquiéres group with a pencil of invariant lines through p,. 

For n 2 4 the two sets of direct and inverse simple points of the de Jonquiéres 
transformation cannot lie in superposed position. 


10. Inequalities satisfied by the integral coefficients of the elements of g,(a). 
In this section we take g,(a) with 6 = 1, 8 = a’ — 1. We shall be concerned 
primarily with those linear forms which are respectively the conjugates of 20, 
and the conjugates of x; , under g,(a), i.e., with the rows of the various elements 
in g,(a). It is convenient to use also the contragredient form g,(a)¢ of g,(a)z 
which is defined by the following contragredient invariant: 


(1) foro — (€1z1 + --- + &a,) = foto — (fit + «+> + Et). 
Then the contragredient generators of g,(a@), and g,(a): are 

ry = at) — (a — 1)n, f = af — &, 
Q) a: ; ” ee: 7 

Y= Xo — at; 1 = (@ — 1)h — ah. 


The invariants of the two groups are respectively: 
L(x) = % — (a — 1)(1 + --- + 2), 
L(t) = (a + 1)& — (& + --+ + &), 
Q(z) = 2 — (a — 1)(ai + --- + 25), ; 
Q(t) = (a — 1) — (i +--+ + &). 
A set of values c of the 2x’s will be called a characteristic C(x) (x = c), and 
this set will often be indicated by a form cof — (cig: + --- + Cpt); a set of 
values y of the £’s will be called a characteristic C(£) ( = y), and again this set 
will often be indicated by a form yoto — (yi%1 + +++ + Y p%>).- 


The characteristics C(x) indicated by & and its conjugates will be called 
characteristics H(x). A list of the early conjugates follows: 


(3) 


H = fo, 
H; {1 + (@ — 1)}& — &, 
Aji {1 + (a — 1)(@ + 1)}h — ag; — &;, 











196 ARTHUR B. COBLE 


Hiss = {1 + (a — 1)a’}t — (a’ — adi; — af;, 
Hex = (1+ (a — Ie’ + a4 1)}h — ats — at; — &, 
(4) Hye = {1 + (@ — 1)(e’ — a — @ + 1)}t — (@* — 2a" + 1k, 
— (a — a)é,. 
H jxjs = {1 + (a — 1)(a® + 1) fo — (a? — a + 15; — (a? — aE; — kk. 
Huijs = {1 + (@ — 1)(e’ + It — (@’ — a’ )Ei — ot; — &, 
Hujs = {1 + (a — 1)(e’ + @’)}b — (0° — a); — aE; — atk, 
Haj = (1+ (a — Ia +o? +a t+ 1)}h — a; — at; — o& — &, 


It is clear that 
(5) The set of conjugates (4) of & under g,(a): have for polars as to Q(x) the set 
of conjugates of x under g,(a)z. The form H;;... is the transform of H by the 
product I(&)-Ié)---. The coefficients ho, hi, ---,h, of a particular form 
hots — (hid: + --- + h,pt,) satisfy the relations Q(h) = 1, L(h) = 1. 

The characteristics C(é) indicated by x; and its conjugates will be called 
characteristics P(£). A list of the early conjugates follows: 


Pi = Sey 

P; =% — az;, 

Py = at — (a — 1)z, — ax;, 

Pini = (a — a@)to — a(a® — a — 1)2; — (a — 1)%, 
Pu = aX — ala’ — 1)x; — (a — 1)z, — az;, 


(6) Pirie = (a — 2a" + 1)ao — (a — 2a® — a’ + 2a)ai — a(a’ — a — 1)2,, 
Pyaj = a (a — 1)a — (a” — @)(a’ — 1)2; — aa? — @ — 1)2; — (a — 1)ax, 
Pim = (@ — a + 1)% — (a — ao — o& + a), — a(a’® — 1)2; 
— (a — 1)x., 
Piniy = (a — «to — (a* — 20°)2; — a(a’ — 1)x; — (a* — 1)ax, 
Pinim = to — a (a? — 1)tn — aa” — 1)x, — (a? — 1)x — an;, 


ed 


Again it is clear that 

(7) The set of conjugates (6) of x; under g,(a), are such that Pj, ... is the transform 
of x; by the product I(x)-I,(x) ---. The coefficients mo, m,--+ ,, of a par- 
ticular one of these forms, moto — (m1 + ++: + 2,2,), satisfy the relations L(x) = 
1, Q(r) = —-1. 


PRE Pree 





eee a i il 








CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 197 


We prove first the theorem: 

(8) Any characteristic (ho; hi --- h,) for which L(h) = 1, Q(h) = 1, and for 
which h; 2 0 (j = 1, --- , p), when arranged so that h, = he 2hzs 2 +--+ = 
satisfies the inequalities 

0<h < (a+ 1) 
except in the one instance (1; 0 --- 0) given by H in (4). 

For, from L(h) = 1, a 2 2 and h; 2 0, it follows that ho > 0. If ho = 1, 
it follows from h; + --- + h, = 0 that h; = 0 (Gj = 1,---, p). This yields 
the exclude characteristic (1;0---0). Ifhka>1,h, >0. Ifthenhk, = --- = 
h, = 0, L(h) = Q(h) = 1 become (a — 1)h; = ho — 1, (ae — 1)h? = WB — 1. 
By division, (a + 1)h; = ho + 1, whence (h) = (a; 10 --- 0), which satisfies 
(8). If he > 0, we use the inequality hi(ho + --- +h,) [AE AEH --- +h2. 
This, combined with L(h) = Q(h) = 1, yields [(ho — 1) — (a — 1)](a+ 1h = 
(hs — 1) — (a — 1)hi. Hence (ho — 1)(a + 1)h; = hi — 1. Since ho > 1, 
(a + 1)hy >h+ 1, or ho < (a + 1)hy. 

We prove further that 
(9) Every characteristic (h; hi --- h,) which satisfies L(h) = Q(h) = 1, and 
which satisfies the system (infinite except when p = 2, a = 2) of inequalities P j, - - - 
(x = h) = 0 obtained from the forms (6), can be reduced by transformations in 
9,(a)z to the characteristic (1;0 --- 0). 

For, since (h) satisfies 2; = 0, the h,, --- , A, are positive or zero, whence ho 
is positive. According to (8) there is an h; such that ho < (a + 1)h;. Then 
I,(z) applied to (h) yields a characteristic (h’) for which hg < ho and which also 
satisfies the inequalities h; = 0, since these inequalities for h’ arise from in- 
equalities in the set (4) satisfied by (hk). The element ho can therefore be con- 
tinually reduced until it reaches 1, in which case the remaining h’s are all zero, 
since they still satisfy x; = 0. 

It is probable that all of these inequalities are independent. For example, 
when a = 2, there is a case (h) = (11; 61‘) which satisfies x; = 0 but which does 
not satisfy P; = 0. There is also a case (h) = (20; 971°) which satisfies x; = 0 
and P; 2 0, but which does not satisfy Pj, 2 0. 

A list of characteristics for which L = Q = 1 and x; 2 0, for this case a = 2 
and complete up to n = 17 is as follows: 


(1; 0), (2; 1), (4; 2, 1), (5; 2°), (8; 4, 2, 1), (10; 4%, 1), (10; 5, 2°), (11; 6, 1°)*, 
(13; 6, 4, 2), (13; 7, 2, 1°)*, (14; 6, 5, 2), (16; 8421), (17; 9321°)*, 
those marked * not being reducible to (1; 0 --- 0). 


(10) Jf a characteristic (h) = (ho; hi --- h,) is reducible in g,(a)., when a = 3, 
to (1; 0 --- 0), it satisfies the following system of inequalities: 
(I) ah; = ho < (a + 1)hi; (II) hy > he > +++ > Ihe > hes = 0 


= hye = +: =h,; 


(IIT) ho > (a + 1)he ; (IV) ho — ahz > hy. 











198 ARTHUR B. COBLE 


For if (A) is reducible to (1; 0 --- 0) in g,(a),, then conversely (1; 0 --- 0) 
can be transformed into (h) by a sequence of involutions 7,. Such a set of 
transforms, hofs — (Aid: + --- + Apé,), of (1; 0 --- 0) by sequences of four or 
fewer involutions is found in (4). The statements of (10) are satisfied by all 
of these transforms, and furthermore the largest h; (¢ = 1, --- , ) is that which 
arises from the involution last used. Suppose then that these statements re- 
main true for transforms under all sequences of n or fewer involutions. If (h,) 
is such a transform, we apply an additional involution to obtain a transform 


(h'ns1) = (aho — (a’ — 1)hj; ho — ahj, hy, +++ Aja, hin, “"¢ » hy), 
where j = 2 else (hi,41) is an (hy). Then the inequalities (II) are satisfied by 
(h.,41), since (II) and (IV) are satisfied by (h,). The second part of the in- 
equality (I) is then a consequence of (8). The first part of the inequality (1), 
hi, = ahj, reduces toh; = 0. The inequality (IV) is an immediate consequence 
of L(h’) = 1 which can be written in the form 


ho — ah, = hy + 1+ [(@ — 3)hi + (a — I)(hs + «+» +i) + (ht — Aa)). 


Finally, the inequality (III) is a consequence of (IV) and hz; < hj. 

(11) The generic conjugate of x» in g,(c) (a = 3) has the form x5 = hor — (a — 1) 
(hit, +--+ + h,x,), where the numbers ho, ---,h,, when properly ordered, 
are those described in (10). 

This brings out the rather unlooked for result that when a 2 3, none of the 
hi, +++ ,h, are equal unless they are zero. Thus the matrices M, of 9 (13)(b) 
do not exist. However this is not true when a = 2, as the list preceding (10) 
shows. 

The conditions, finite in number, given in (10) do not replace the infinite 
number mentioned in (9). They include the first of these, such as x; 2 0 and 
P; = 0, but they do not lead to P, = 0 [ef. (6)]. 

We now consider the characteristics (a9 ; 7 --- 2,) mentioned in (7), examples 
of which are given in (6). The first of these, (0; —1, 0, --- , 0), is somewhat 
exceptional in that m = 0 and a 7; is negative. This is the only solution of the 
equations L(r) = 1, Q(r) = —1 for which m = 0. It is also clear that the 
second example, (1;a 0°"), is the only solution of these equations for which 
™,°**,7, are zero. We now prove that 
(12) Except for the two particular cases just mentioned, every characteristic + 
which satisfies the equations L(r) = 1, Q(x) = —1, and for which x; = 0 (i = 
1, --+ ,p), when arranged so that x, = m. = --- 2 m,, satisfies the inequalities: 


(a — 1)m < m < am; tm < (a — l)mifa 23. 

For from the inequality (a, + --- + ,) 223 +--- +74, it follows immedi- 
ately from L(x) = +1 and Q(x) = —1 that mom(a + 1) = (e& — 1)r5 +m +1, 
or mom (a + 1) > (a” - 1)x, orm > (a—1)m. Let thenm = (a—1l)m+hk 
(k > 0). Again from Q(x) = —1 we find that 2(a@ — 1)m(m — k) = r-1 
+p +---+ 2. Since k > 0 and m > 0, then k < m, and m < am. 
If now also m2 = (a — 1)m + hk, then 





ue +> Sree ae 


LE ON NER SITE ITTTETS: PBN, 


Ce ante ravioe we 


RP TPE 





a, 
wal a 


LE ONS ERD Rg. tS 


fm ears KEE som 6 


POOL IPE OS, 





CLASS OF LINEAR GROUPS WITH INTEGRAL COEFFICIENTS 199 


—(a — 1)\(a — 3)m3 — 2(a — 1)mok — Wa — 1)mok; = 
W—-Ltkitm+--- tax. 


The right member is positive or zero. On the left the first term is zero or nega- 
tive if a = 3, the second is negative, whence k, < 0, or m2 < (a — 1)m. 
(13) With the exception mentioned in (12) every characteristic , for which 
(moto — 121 + +++ + @,2,) is a conjugate of an x; under g,(a), satisfies, when 
a = 3, not only the equalities of (12) but also the additional inequalities: 


(a? — 1)m > m+ am; ™m> mm > °) > te > Tey = O = wee = = Tp. 


It is clear that this is true of the members in the list (6) and that further the 
largest integer 7, of the characteristic arises from the involution last applied. 
Let us then assume that (13) is true for all cases (7,) which arise from zx; by 
applying n or fewer involutions. If then we apply an additional involution 
(which is not J, , since the transform would then be a (x,_,) for which (13) is 
true), the transformed characteristic is 
(x’) = (amy — 7; 3 (a’ — 1) — am;, Wiy*** » Wj-1, Wy4i,*** , Tp) Qj Pa 2). 
If (a’ — 1)m — ax; > m7, ie., if the first inequality is satisfied by (x), then the 
remaining inequalities are satisfied by 7’. But this first inequality is a conse- 
quence of L(r) = 1,a@ 2 3, and 7; 20. For (a = 1)L(r) = (@ — 1) can be 
written as 
(a? — 1) — ame — m = (a + 1) + [(a — 3)m + (m, — m) + (a — 1) 

(me + + + m)] 


On the right a — 1 > 0, and every other term is zero or positive, whence the 
first inequality is satisfied by z and also by (z’). 

Some of the inequalities of (10) and (13) fail in the case of a = 2 because of 
the presence of periodic elements other than the generators. 

If a # 2, we have the theorem 
(14) The groups g,(@), a = 3 contain no periodic elements other than the involu- 
torial generators. 

For if T = Ii,Jk, «++ Ik, is an element of g,(a) naturally so written that no 
two successive subscripts k are alike, then it may happen that k, = k,. Then 
T is the transform of T’ = J;, «++ Ii,., by I; and T’ has the same period as 7’. 
The determination of the period may again be simplified if ke = k,,. Pro- 
ceeding in this way, we must eventually find that T is the transform of a genera- 
tor of period two or that T is the transform of an element T” = J;,,, +--+ Ii, , 
where 1 + s # n — 8. Then (7”)’ is a product of j(n — 2s) generators such 
that no two successive factors are alike. But it is clear from (10) (III) that, 
as these successive factors are adjoined, the values of ho in the characteristic 
ry = hor — (a? — 1)(4i2; + +--+ + A,2x,) steadily increase so that (e*} can 
never be the identity. 


UNIVERSITY OF ILLINOIS. 











COMMUTATOR ALGEBRA OF A FINITE GROUP OF COLLINEATIONS 
By HERMANN WEYL 


1. Introduction. In Chapter V, §§2—4 of my book The Theory of Groups and 
Quantum Mechanics (English edition, London, 1931) I gave an elementary ac- 
count of the decomposition of tensor space into subspaces invariant under the 
algebra of “symmetric” transformations. The treatment was based upon the 
reciprocity between the ring of a finite group y whose elements s induce certain 
linear operators s in a given vector space R and the algebra % of those trans- 
formations A in ® that commute with all operators s. It was immaterial that 
X was the manifold of all tensors of a certain rank f in an underlying vector space 
and y the symmetric group of all f! permutations operating on the f indices or 
arguments of the tensors. In many respects the group ring stands in a simpler 
relationship to this commutator algebra % than to the enveloping algebra B 
of the operators s, and therefore it seems desirable to discuss both sides in a 
direct way rather than to rely upon the general theory of a matric algebra and 
its commutator algebra (cf. in this regard the observations in the concluding 
section). 

A number field k may be given in which all numbers which occur are supposed 
to lie. We consider vectors 


f= (fi, et » Sn) 
in an n-space ® whose components f; are numbers in k. When the result of a 
linear transformation LU’ = ||u,x|| on f is denoted by Uf, the components of f 


are to be written in a column. A collineation f — f in the projective (n — 1)- 
space based upon the number field & is a linear transformation U of the co- 
érdinates f; combined with an automorphism s: a — a’ of k: 


fi = Unfi or f= Uf’. 


Representations of a finite group by operators of this generalized type involving 
automorphisms of the reference field were studied with remarkable success in a 
recent paper by Messrs. I. Nakayama and K. Shoda (Jap. Jour. Math., vol. 12 
(1936), pp. 109-122). The same step will here be carried out with respect to the 
commutator algebra. 


2. The group ring. The situation we are concerned with may be described 
thus. Given a finite group y of order h; if kis of prime characteristic, that prime 
shall not be a divisor of h. To each element s of y corresponds an automorphism 
a — a’ of k such that 

(a’)' - =”. 

Received March 1, 1937. 

200 











Pact: 








COMMUTATOR ALGEBRA OF FINITE GROUP OF COLLINEATIONS 201 


A collinear representation of degree n of y associates with each s an operator 
f —f in the n-space ® of the form 


(2.1) i= > un(s)fi (i, k = 1, ---,n) 


in such a way that composition of collineations reflects the composition of group 
elements. This is expressed by the following equation for the matrix U(s) = 


|| wen(s) |! : 


(2.2) U(st) = U(s)U"() (s, ¢ in y). 
In particular, 
(2.3) U(s)\U(s") = E or Us") = U“(s). 
(2.1) shall be indicated briefly by 
(2.4) f=sf. 
We introduce the “quantities” of the group ring p by the formal sum 
a= > a(s)-s 


extending over all elements s of y; the components a(s) are arbitrary numbers 
in k. Such quantities are at first abstract elements forming a linear manifold 
(or “vector space’’) t of h dimensions. At the same time they serve as operators’ 
in R: 


(2.5) af = Li als)-sf. 

This “realization”? suggests how to perform multiplication: one has 
a,(a2 f) = af 

if a = aa be defined by 

(2.6) a(s) = 2 alt) a3(?’). 


The multiplication is associative: 
(a;a2)as = @(28s). 
Indeed, the left side equals 
Pea axlt’ as" (e), 
whereas the right side equals 


oe (arltas (e))". 


1 These operators are not collineations in the same sense as the operators f — s/. 











202 HERMANN WEYL 


Not even the multiplication with multiples 81 of the unit element 1 of y (6 in k) 
is commutative here: while 8a is to be interpreted as the quantity with the 
components Ba(s), a8 has the components a(s)@’. 

In this section we concentrate on the abstract group ring p and shove its repre- 
sentation by operators (2.5) into the background after it has served its purpose 
of suggesting the law of multiplication (2.6). 

A part p of r closed with respect to addition and the operation 

x— x’ = ax (x — x’ = xa) 
for any a in pis called a left (right) invariant subspace of r. pis a linear subspace 
in the sense that it contains 
x + y, ax (x + y, Xa) 
along with x, y whatever the number a ink. Associating 
(2.7) (a): x’ = ax 
with the quantity a gives rise to the regular representation (p) of p whose space 
is r itself: 
a(bx) = (ab)x. 

TueoreM 2A. A left invariant subspace p possesses an idempotent generator 
e (to the right), i.e., xe lies in p for every x, and xe = x for everyxinp. The same 
is true for a right invariant subspace, but one must then write ex instead of xe. 

The theorem implies that e = le is in p and hence ee = e. 

Its proof is almost the same as given in my book, l.c., on pp. 291-292 for the 


customary group ring. For a left invariant subspace p it runs as follows. 
We construct a projection x — x of r onto p, i.e., a substitution 


(2.8) as) = > d(s, dx(t) 


t 


with the two desired properties that it changes every x into an X in p and is the 
identity within p. From the expression 


y(s) = Dealr)a'(r's) for y = ax 


one concludes that the substitution leading from 
y(s) = 2’(r''s) to 9(s) = #'(r"'s) 


is a projection as well: 


i(s) = Dod'(r''s, rdy(t). 


t 


Hence the same holds for the “average”: 


e(s, t) = i > d'(r's, rt). 


pee Bs 





w= 











Veer 





WEIS ra 








COMMUTATOR ALGEBRA OF FINITE GROUP OF COLLINEATIONS 203 


As it satisfies 
e(r's, rt) = e(s, 2), 
we may write 
e(s, t) = e'(t”'s) with e(s) = e(s, 1), 
and then the projection 


#s) = > e(s, d)x(t) 


can be abbreviated to = xe. 
For a right invariant subspace a projection will be of the form 


#s) = Do d(s, t)x" * (t) 


rather than (2.8), because this kind of substitution changes xa into Xa along 
with x — &, and the averaging process is to be defined by 


e(s, t) = ; > d(sr, tr) = e(st™). 


Incidentally right- can be reduced to left-invariance by a simple process ex- 
changing the order of factors. With a quantity a we associate 4 as defined by 


(2.9) a(s) = a’(s™’). 

The relationship is involutorial because (2.9) entails 
a(s) = 4'(s"). 

The reader is called upon to verify the 

Lemma 2B. If a = aja2, then & = &4:. 

From now on only left invariant subspaces will be considered, and the specifica- 
tion “left” will be dropped. We derive from the existence of the generating 
idempotent these consequences (loc. cit.): 

THEOREM 2C. An invariant subspace »p containing the invariant subspace 
pi < pcan be split according to p = p; + pz into p; and a complementary invariant 
subspace pe . 

Proof. e, being an idempotent generator of p,, we have 

x = xe, + (x — xe;) = x; + X& 
for every element x of p. The first summand x; is in p,, while the second satisfies 
x,e, = 0. Hence pe = {xe} is linearly independent of p, . 

A given idempotent generator e of p splits like every x of p into two parts 
(2.10) e=e+e& 
lying in pi, pe respectively. As (2.10) implies the following decomposition of 
any xX in p: 

x = xe = xe, + Xe = X+%, 








204 HERMANN WEYL 


we have in particular 
ee; = e1, ee; = 0, 
e2e,; = 0, C2@o = Co. 

TuHeorEeM 2D. A similarity projection x — x’ of an invariant subspace p upon 
p’ is generated by aft multiplication with a quantity b:x’ = xb. 

A correspondence x — x’ is a similarity projection with respect to the algebra 
(p) of the operators (a), (2.7), if carrying x + y into x’ + y’ (linearity) and ax 
into ax’. The proposition follows at once if b is taken as the image e’ of the 
idempotent generator e of p; one then has eb = b. 

Invariant subspaces which can be put into a one-to-one similarity correspond- 
ence are called similar or equivalent. 


3. Formal lemmas. We now return to the representations (2.4), (2.5) of 
7 and p by operators in R. Notice that 
sf+f)=sf+sf, s(af) = a’-sf. 
For any value i = 1, --- ,n of the index 7 we denote by f; the quantity in p 
with the components 


fis) = sfi = p> uin(s)fi 


and by f the column (f,, --- ,f,) whose s-component is the vector f(s) = sf. 
The arguments used (l.c., §4) rest on the validity of the following formal lemmas. 
Lema 3A. 
(3.1) a-f,; = > anf, 
with 
aux |] = Da(r)U~“(r). 


Proof. The s-component of a-f is the vector 
g(s) = >, alr)(r' sf)’. 
Apply 
f=r(r'f) = UM's, (r’f) = UM 


to sf rather than f and thus verify the statement of the lemma. 
LemMa 3B. f;-a = g; where the vector g is defined by 


g= da(r")-rf = af. 
In other words, if g = af, then g = f-a. 
Proof. f;-a = x is indeed given by 


a(s) = > srf;-a"(r") = sg. 


r 











COMMUTATOR ALGEBRA OF FINITE GROUP OF COLLINEATIONS 205 


Lemma 3C. An equation of the kind 


(3.2) a(s) = D> gi Sf; = eU(s)f", 
where ¢ = (¢1, -** , @n) ts a row rather than a column of numbers (contravariant 


vector) entails 
a(s) = De fi 8¢i 


i=1 


when this is interpreted as meaning 
(3.3) a(s) = ¢'U"(s)f. 
The linear transformation 

\|aix || = || > sfi- see ||, 
that is 
(3.4) A= > U(s)f' ¢" U-(s) 


commutes with the operators s. 
Proof. (3.3) follows at once from (3.2) by taking (2.3) into account. g being 
an arbitrary vector in ®, we compute from the explicit expression (3.4): 


tity = U(A(t'g)' = UMA'U“"Wg, 
U®A'U"(@) = > U(ts)f"* eo" U(ts) = A. 


4. Reciprocity between group ring and commutator algebra. The object 


of our investigation is the ring % of all linear transformations A = || ax || : 
(4.1) fi = LD airfi 
k=l 


in R which commute with the operators s induced in R by the elements s of y. 
For each A in &, (4.1) thus implies 


(4.2) sf; = Xi ain-Sfe or f; = Di aiefs. 


For each A in & the multiple aA lies in % provided the number a is self-con- 
jugate: a* = a for all group elements s. Hence & is an algebra in the subfield 
of self-conjugate numbers rather than in k itself; nevertheless we venture to 
speak of 9% as the commutator algebra. 

The complete reciprocity between r under the influence of (p) and ® under 
the influence of & can now be established in the same manner as l.c. §4. Terms 
like “invariant”, “irreducible”, “similar” or “equivalent”, when applied to r 
or ® refer to the algebra (p) of operators x — ax or to the algebra WI respectively. 
The term “linear subspace” is in r to be interpreted as demanding closure wit 








206 HERMANN WEYL 


respect to addition and the ordinary multiplication x — ax (not the modified 
aft multiplication x — xa) by numbers a ink. p being such a linear subspace of 
t we introduce the corresponding subspace 2 = *p of M as the set to which a 
vector f belongs if f; is in p fori = 1,---,n. (4.2) shows at once that § is in- 
variant. Vice versa, if 8 is a given linear subspace of R, we define p = 4% as 
the linear closure of all the quantities f; (¢ = 1, --- , n) arising from vectors f in $. 
If 


I? « ">, (a = 1,2, --- ,m) 
is a linear basis of the m-dimensional vector space {, then 4 consists of all 
quantities x of the form 

gr= Doh? = D (yet) (y{*’ arbitrary). 


ai 
In particular we set 
aR = To. 


According to Lemma 3A, 4&8 is an invariant subspace of t.. Moreover, by 
definition, 


5D <p, B< #58. 


Inclusion can here be replaced by equality: * and 4 are inverse operations pro- 
vided we limit ourselves within R to the invariant subspaces Y-and within t to the 
invariant subspaces » of t.. Besides, those operations are conservative as to 
reduction, decomposition and equivalence. We exhibit these facts in two 
theorems: 

TueoreM 4A. If p(p’, pi, pe) are any invariant subspaces of t. and $ = Xp, 
then 


p’ < p, p= pi + pe, Di ™ De 
imply 
Pp’ < X, P= .4+ 2, Zi ~ Be 
respectively, while conversely, 
p= 58. 
TueoreM 4B. Jf $ (B’, Bi, Be) are any invariant subspaces of R and p = 4¥, 
then 
B= #p 
and 
Pp’ < %, Z=+ 2, i ~ PB 
imply 
p’ <p, p= + pe, Pi ~ Pe 


respectively. 








2 Ri MRE TR IP 


iA ent. 

















COMMUTATOR ALGEBRA OF FINITE GROUP OF COLLINEATIONS 207 


Before proceeding to the almost literal repetition of the proofs, we make this 
remark. If é is the idempotent generator of an invariant p, then the correspond- 
ing % = #p consists of all vectors of the form ef. Indeed, g = ef lies in $ 
because g; = f;6 by Lemma 3B, and for each f in $ one has ef = f. 

1. We prove the first part of Theorem 4A by observing that the decomposition 
~» = pi + pe when applied to an idempotent generator é of p: 6 = €; + € leads 
to this decomposition 2 = B, + 2. of B = Kp: 


F=eF =e Ff +eF = F, + F; (F in §$). 


Lemma 2B allows us to shear all the roofs off the relations 


+ 


é:ég = &é; = 0 (€:é: = &:, dé: = &), 
thus warranting the independence of the parts 2; , Be : 
e, Ff: = 0, eF; = 0. 


2. The similarity correspondence between p; and pe 
’ 


X2 = x1b, x1 = xb’, 
gives rise to the mutually inverse transformations 
fr = bhi, fi=d’fr 
between the vectors fi , fo of B:1 = *p, and B. = #p2. By (4.2) these formulas 
establish a similarity correspondence: 
fi = Af, entails bf, = A(bf,) {A in %}. 


To secure the last part, p < 49, we construct 4 by means of the idempotent 
generator é of p as follows: if g“ (a = 1, ---, m) ranges over a basis of the 
complete vector space ®, all f°” = eg lie in 8 = #p, and hence 


y= Di eh sais > (op #™) 


in 8%. On introducing 
- im Ys (e@ g), 


we have y = xé. So xé lies in 43 if x lies in m = 59. But each x in p satisfies 
both conditions: x in t} and xé = x. 

The converse Theorem 4B exhibits the really important facts. Its assertion 
that p = ’B implies 2 = *#p for any subspace ¥ invariant under Y% is the back- 
bone of the whole theory. Let é@ be the idempotent generator of p = 4%. 
Like all elements of p it is of the form 


&s) = Diyi*-sfi”, 


a,t 
where 


| ae ne (§*", wee, (e)) 








208 HERMANN WEYL 


ranges over a basis of 2. Hence by Lemma 3C, 


e(s) = > sei” -fi*’, 


and any vector g= ef of # p is given by 
(43) n= L (Lars) = De”, 


where 
(a) (a) 
Qik ™ dD sfi-sei* : 
s 


(a) 


Each term g‘® of the sum (4.3), g = 2g‘, arises from f‘” by a linear trans- 
formation A‘ = || a{f’ || which according to the same lemma commutes with 
all s. Hence §, being invariant with respect to the transformations A of the 
commutator algebra %, contains g‘ as well as f‘”’. This proves our statement: 
gin Bor #p < §&. 

The decomposition $2 = Y, + YP: implies by definition that each quantity 
xin p = 4 can be written as a sum x, + X2, X; in p, = 4B, Xe in po = BPe. 
It remains to prove that p, and p, are linearly independent or that the inter- 
section p* = p; ° pe is empty provided $* = E,° LB, be empty. But according 
to the part of Theorem 4B already proved, 


Kp* < Kp, = £1, ¥p* < 2; 


hence *p* < $* and by the last part of Theorem 4A: p* < §*. 

The transition from 2, ~ Be to p; ~ pe for p; = 421, pe = &P2, is to be based 
on the following statement, the proof of which is contained in Lemma 3B: 

Lema 4C. ft is right as well as left invariant. 

Therefore ro has an idempotent generator i to the left: i in m, ix = x for 
every X in ft. 

Let f* be a basis for 8, and let the given similar mapping of B, on PB. send 
f into g“. When we put 


x = D vi” -fi, 
a, 

(a) (a@) 

» fos Do!" “Si 
a, 


the correspondence x — y between an x and a y with the same coefficients ¢}* 
will establish a similarity mapping of p, = 48 on pp = 4P2, because by Lemma 
3A, we obtain 


(y;*’ arbitrary numbers) 


) 


ax = Dn”, ay = Te ae 


with 


LS oh 


ot le, 


a ee a ee oe 





# 
: 


n> aa A REE OE 


hn ee ere 7 





COMMUTATOR ALGEBRA OF FINITE GROUP OF COLLINEATIONS 209 


This definition of x — y, however, goes through only if x = 0 implies y = 0. 
We first prove that 
xf = 0 implies yf = 0 


y( a) 


(for any vector f). By Lemma 3C the vector F = xf is a sum of terms F’™, 
the a-th of which arises from f‘” by the transformation 


A® = || aie? || = | 2 sfi-ses”? |. 


Since A“ is in A, the given similarity mapping of B, on $2 sends F“ into the 
corresponding part G‘” of G = yf and hence F intoG. Therefore F = 0 implies 
G = 0, and more especially, when the numbers ¢g§*? satisfy the equation x = 0, 
we must have yf = 0 for every vector f, or by Lemma 3B, f;-y = 0. Hence 
the given quantity y satisfies zy = 0 for every z in 1, in particular for z = i. 
But as y itself lies in ro , the ensuing equation iy = 0 yields the desired result: 
y = 0. 

The complete reciprocity established by Theorems 4A and B involves the fact 
that the process #* not only changes p = 0 into *p = 0 and a part p’ < p into 
apart *p’ < #p, but alsoap + Ointoa #p ¥ 0 and a proper part into a proper 
part provided the p’s are invariant subspaces of t). The decomposition of to into 
irreducibly invariant subspaces » leads to a decomposition of R into subspaces 
trreducibly invariant under the algebra A, and both decompositions run absolutely 
parallel even as to the pertaining equivalences. 


5. Representations of the group ring. The roof operation. Not so simple 
is the relationship of the abstract group ring of our quantities a to its represen- 
tation by the operators 


J—af 


in ® which form a homomorphic operator algebra 8, although % also breaks up 
into irreducible parts similar to the irreducible parts of p. For a given vector f 
and a given invariant subspace p of rt we denote by p(f) the set of vectors xf 


arising from f by all thexinp. Let f‘” (a = 1, --- , 2) be a basis for the n-space 
® and 
(5.1) t=ptpt::: 


a decomposition of r into subspaces irreducibly invariant under (p). Our 
statement is proved in the well-known manner by going through the list 


nlf), tate! n(f), 
po(f), «++, pf), 


‘ema 2 Ce ao Oe ee 


and dropping a term each time it is contained in the sum of the preceding ones. 
The construction depends on the choice of the codrdinate system f‘ as well as 








210 HERMANN WEYL 


the decomposition (5.1), and does not result in such a thoroughgoing parallelism 
as we encountered for the commutator group. Coincidence between the num- 
bers of equivalent parts is not to be expected. Whereas for the commutator 
algebra it was essential to restrict onesélf to the two-sided invariant subspace 
ty, t is here to be taken modulo that two-sided invariant subspace whose ele- 
ments a satisfy the equation af = 0 identically in the vector f. 

Considering all these circumstances, it seems to me inadequate to avail one- 
self of the reciprocity’ between a matric algebra B and its commutator algebra 
Y% for getting a hold on A through %, although the most essential point, the full 
reducibility of 4%, can be reached by this method applicable to any abstract 
semi-simple algebra p. One could visualize the situation as follows. One 
decomposes the “one’’ 1 of p into independent primitive idempotents: 


1=e@ + @+:::; 


e,e, = e; or 0 according asi = kori #k. (An idempotent e is primitive if not 
allowing of a decomposition e; + @: into two independent idempotents except 
the trivial ones e + Oand0+e.) It can be shown then that 


x=xe,+xe+---andf=ef+ef+-:: 
result in a decomposition of r and ¥ respectively, 
t=pAtmt---, R= Bth+---, 


into irreducibly invariant subspaces. Again, f — af is a given representation 
of the elements a of p by operators in ®, and invariance in r refers to the opera- 
tions x — ax in ® to the algebra % of linear transformations commuting with 
the operations f — af. However, the correspondence thus established between 
the parts p; of rand 8B; of R depends on the choice of the idempotent generators 
e; of p;. To make the correspondence independent, one must match the sub- 
space $B of the vectors ef against the subspace p of the quantities xé rather than 
xe. So one may say that the more elementary and complete reciprocity we 
expatiated on in the preceding sections is due to the existence of the operation * 
in a group ring while missing in an arbitrary abstract algebra. The rdle of * is 
further clarified by the following. 

Tueorem 5A. If the invariant subspaces ~,, ~2 generated by the idempotents 
€,, €2 are equivalent to each other, so are the invariant subspaces , , pz generated by 
é,, é. p and » are the substrata of contragredient representations. 

Let the one-to-one similarity mapping x; — 22 of p; on pe carry e; into b and 
in the inverse direction é: into a. We then have 
(5.2) Yq = 2b, x1 = 2a. 


2 See, for instance, Weyl, Ann. Math., vol. 37 (1936), p. 718, Th. (1.4-B). Even the above 
statement should be made with reservation, since 8 is not a matric algebra, and & not a 


matric algebra in the strict sense. 
* Compare (l.c.) pp. 352-354. 





i 





COMMUTATOR ALGEBRA OF FINITE GROUP OF COLLINEATIONS 211 


b satisfies e;b = b, and like every other element of pe , bee = b. Hence 

(5.3) €: bez = b, eae; = a. 

Moreover, if we put 2; = a in the first and 2. = b in the second of the equations 
(5.2), we obtain 

(5.4) é2 = ab, €, = ba. 

Conversely the relations (5.4) guarantee that (5.2) are reciprocal mappings 
Pi = po. We need only to “roof” these equations (5.3), (5.4) in order to con- 
clude that é,, & are linked by 6, a in the same fashion as ¢; , 2 by a, b. 

To prove the second part we have to introduce the notion of trace: the trace 
of a quantity a, tr(a), is its unit component a(1). The trace tr(zy) of the product 
of two variable quantities x, y is bilinear in the sense that it satisfies the dis- 
tributive law with respect to decomposition z = x, + 22 of the first as well as 
the second factor, and that it takes on the numerical factor a if one replaces 
zx by az or y by ya (distinguish between fore and aft multiplication). Moreover, 


(5.5) tr(zy) = Dd) 2(s)y"(s") 


is a non-degenerate bilinear form: each of the equations tr(az) = 0 or tr(za) = 0 
when holding identically in z leads toa = 0. (5.5) is not symmetric in z and y 
as is the case for the ordinary group ring. However, the obvious equation 
tr(@) = tr(a), together with Lemma 2B, establish the following modified law 
of symmetry: 

tr(yx) = tr(zy). 
One readily verifies 
(5.6) tr(srs*) = tr’(z). 

e being a given idempotent, let p and q be the left and the right invariant sub- 
spaces consisting of the quantities ze and ex respectively. We assert that tr(zy) 
is non-degenerate if z and y vary in p and q respectively. Indeed, if z is any 
element whatever and a in p, then 

az = ae-z = a-ez = ay, 
where y = ezising. Hence the assumption tr(ay) = 0 for y in q implies tr(az) = 0 
for all z, whence a = 0. Similarly for the second factor. We now refer p 
and q each to a coérdinate system a; and b; such that 


(5.7) r=ta,+---+£&a,, y=bm +--: + ham 


describe p and q if the numbers £ and 7» vary freely ink. From the non-degener- 
acy of tr(ry) for (5.7) follows readily the coincidence h = g of the dimensions of 
p and q and the possibility of adapting the coérdinate system b; in q to the arbi- 
trarily chosen coérdinate system a; in p such that 


tr(zy) = im +--- +h 


for x in p and y in q. 









212 HERMANN WEYL 






We now consider the simultaneous substitutions 









(5.8) z’ = &, y’ = ys 





The y-substitution may also be put into the form 





in p and q respectively. 





(5.9) y’ = sy, 
where 
g = mb, + --- + 6, 


now varies in the left invariant subspace p generated by é@. We write (5.8), (5.9) 
in terms of the coérdinates as 


i = > uia(s) Ei, 3 = p> vix(8) nb. 





ne 


The relation (5.6) yielding 
tr(z’y’) = tr(zy) or Dein = Dein’ 


proves the two matrices 

|| use(s) ||, || vse(s) || 
to be contragredient. Hence the regular representation of p induces in p and p 
contragredient representations of y (assuming coérdinate systems in p and p 
properly adapted to each other). Observe that passage to the contragredient 
matrices O(s) in a collinear representation s — U(s) produces a representation 
again, as is readily seen from equation (2.2). 


Tue INSTITUTE FOR ADVANCED Srupy. 








Zz 
» 
> 
t 
a 
s 














TAYLOR’S SERIES OF ENTIRE FUNCTIONS OF SMOOTH GROWTH 


By N. WIENER AND W. T. Martin 


1. Introduction.’ Let a, = 0 (nm = 1, 2, --- ), and 


(1) > a2” ~ sH(z) 
1 

We may rewrite (1) in the form 

(2) > a,e" "© _, 8 
1 

where 

(3) H(z) = e”"*, 


We shall derive the following Tauberian theorem. 


THEeorREM 1. Leta, = 0 (n = 1, 2, --- ), and let (2) hold, where 


(4) F is four times continuously differentiable for a S x for some a; 


(5) F’(x) 2 const. > O fora S z; 
(6) F'"(x) = o({F’”"(2)]') 
(7) F(z) = o((F’(z)/’) 
Then for any positive i, 
4 
(8) lim (2m)' p a,6e-rem™ = 8, 


zo A szs¥(n) Set 


where ¥(x) is defined by 
(9) [lear = vo, 


and G(x) is the inverse function to F'(x) fora & zx. 
As a converse to this theorem we shall prove 


(x ), 


(§— «), 


(x ~); 


(x — ~). 


TuHeoreM 2. Let a, = 0 (n = 1, 2, --- ), and let (8) hold for every positive 
value of \, where F fulfills the conditions (4), (5), (6) and (7) and y and G are 


defined as in Theorem 1. Then (2) holds. 


Received January 19, 1937. 


! The authors’ attention was directed to problems of this type by Professor Vijayaragha- 
van. It has come to our attention that a similar group of problems has been recently 


attacked by Mr. Kales of Brown University by quite different methods. 


While neither 


direction of work is reducible to the other, Mr. Kales’ priority in entering the field is clear. 


213 





214 N. WIENER AND W. T. MARTIN 


In the next theorem we place some additional restrictions on F and we obtain 
certain inequalities related to y. 
THeoreM 3. (i) Let F fulfill the conditions (4) and (5) and the further condition 


(10) for some positive «, F(x) = O({F’(x)]'*5 (x — ~). 
Then for any positive e, 

(11) ni" < y(n) 

and 

(12) n'* < ¥(n), 


for sufficiently large values of n. (ii) Let F fulfill the conditions (4) and (5) and 
the additional condition 


(13) F(x) is ultimately greater than any power of x. 


Then for any positive e, 
(14) ¥(n) < n'** 


for sufficiently large values of n. 

We shall obtain the following theorems on entire functions of smooth growth 
as immediate consequences of these Tauberian theorems. 

TuHEeoreM 4. Let 


(15) f(z) = >b,.2" 
0 
be an integral function, and let 
2r 
(16) > | f(re) |? d@ pe se”? log r) (r a ), 
0 


where F fulfills the conditions (4), (5), (6) and (7). Then for any positive i, 


} 
(27) > b, i, Siniitaicalaaa oe 


17) lim —— 
(17 m 


22 zS¥(n)s2t+r 


an” 


where ¥ and G are defined as in Theorem 1. 

The converse to Theorem 4 is 

THeoremM 5. Let (15) be an entire function and let (17) hold for every positive 
value of , where F fulfills the conditions (4), (5), (6) and (7), and G and y are 
defined as in Theorem 1. Then (16) holds. 

If we place additional conditions on F, we have the following gap theorem. 

THeorEM 6. Let (15) be an entire function, and let (16) hold, where s # 0 and 
F satisfies the conditions (4), (5), (6), (7), (10) and (13). Then (17) holds where 
¥ and G are defined as in Theorem 1. Furthermore, for any positive « we shall 
always have (11) and 


(18) n'* < y(n) < n** 






















nn a ne 

















TAYLOR’S SERIES OF ENTIRE FUNCTIONS OF SMOOTH GROWTH 215 


for sufficiently large values of n. Thus the function (15) can have only a finite 
number of gaps of magnitude (v, v + v'**), that is, for any positive ¢, the equations 
An = Anat mp cco = On+[nitg = 0 
can hold for at most a finite number of values of n. 
As a consequence of Theorems 1 and 2 we shall derive the following non-linear 


Tauberian theorem. 
THEOREM 7. Leta, 2 0 (n = 1, 2, --- ), and let 


(19) Cn = > an Qn—m (n = 1, 2, is }. 
1 


If F fulfills the conditions (4), (5), (6) and (7) and if ¥ and G are defined as in 
Theorem 1, then the following two statements are equivalent: (i) the relation (8) 
holds for every positive value of \; (ii) the relation 


(20) lim (2x)* > 670 im)—28 Ce n)) aie 3 
—- rsV20(4n) s2t\ 
holds for every positive value of X. 
For the proof of Theorem 1 we shall put 


(21) — = g(u + ¥(n)), 
so that u occurs as the difference of two variables 
(22) u=¢ (t) — y(n). 


Here ¢ and y are functions to be determined with regard to the following con- 
siderations. We shall develop the exponent né — F(é) in a Taylor’s series with 
remainder and we shall determine ¢ and y in such a manner that the coefficient 
of u shall vanish and that the coefficient of }u’ shall be negative unity. Then we 
shall use the conditions on F to show that the remainder in the Taylor’s series 
approaches zero as § — ~; indeed, that it approaches zero in such a manner 
that (2) shall imply that 


2) 


(23) p> Gp, eX im)—F (a(n) ,—tu? _ ( = oo), 
1 


We shall then show that the General Tauberian Theorem of Wiener’ is applicable 
to (23). Here the kernel K(u) is e*”* whose Fourier transform ¢* is non- 
vanishing, and, as we have noted, u occurs as the difference of two variables. 
The relation (8) will follow immediately. 

Theorem 2, the converse of Theorem 1, will follow by an application of the 
converse portion of the General Tauberian Theorem just referred to. The other 
theorems will follow at once. 


2. Let us put 
(21) £ = g(u + ¥(n)). 
2 N. Wiener, Tauberian theorems, Annals of Mathematics, (2), vol. 33 (1932), pp. 1-100. 











216 N. WIENER AND W. T. MARTIN 


We have, formally, on holding n fast and developing in powers of u, 
(24) nt — F(t) = ne(u + ¥(n)) — Fe(u + ¥(n))) 

= ng(¥(n)) — F(e(¥(n))) 

+ uln — F’(e(n)))le’W™)) 


+ . I[n — F’(e(¥(n)))]e"W(n)) — F’@W())le'Wm))F} 


oad § Dilnel) — F(e(v))Jomousyim 5 


where 0 S 6 = @(u,n) S 1. 
We desire that the coefficient of u shall be zero, and the coefficient of }u’ 
negative unity, whether or not n is an integer. Necessary conditions for this 


are, formally, that 

(25) F’'(e(W(w))) = w, 

(26) ¢'(v(w)) = (uw), 

for, as regards the latter, the coefficient of }u’ is 

(27) [Do [we'(W(w)) — F’(ew)))e’W(w))] — ¢’H(w))}/¥'(w). 
Denoting by G(w) the inverse function to F’(w), we see that (25) and (26) 
become 

(28) e(¥(w)) = G(w), 

(29) [v'(w)’ = G’(w). 


For our functions ¢, ¥ we are thus led to take 


(30) v(w) = I [@7Wwdt, ew) = Gy"(w)), 
where a is so large that F is four times continuously differentiable for w 2 a 
and 0 < const. Ss F’’(w). With ¥, ¢ so defined, (24), with w in place of n, 
is valid for w and u + y¥(w) sufficiently large and (25), (26), (28) and (29) are 
valid for w 2 a. 

The last term of (24) may be written 


ayy wee Ou + He) — F'"(o(Ou + H6w))Ne'(Ou + H(w))T 


— 3F’"'(o(du + ¥(w)))e’(du + ¥(w))e"’ (du + ¥(w)) 
— F'(y(du + ¥(w)))e’" (du + ¥(w))}. 





2 N DIR RAE AOR ue 





—« 


SL. OT IG RE I mk a 





swt 





ee ee mu 





a eae 





TAYLOR’S SERIES OF ENTIRE FUNCTIONS OF SMOOTH GROWTH 217 


Differentiating (25) and using (26), we have 


(32) F"(ep(w)))[e’W(w))P = 1. 

For a general argument z, (32) becomes 

(33) F’(v(z))le’(@)F = 1, 

and yields on differentiation 

(34) F’" (g2))le’@ I" + 2F'"(e@))¢e’(@)e’"(z) = 0. 


Thus if 6u + ¥(w) = z, (31) becomes 


= {we"(z) — F’'(e(z))e'(z)e"(z) — F’(ele))e""(2)} 
(35) 





_ {teow vm F'(o(z))]}e’"(z) < ooh. 


¢'(z) 
Now, by the law of the mean, there will be some v in the interval between ¥(w) 
and z such that 














(36) F\e(Uu))) — Fels) = —uk"(elo))e'(e) = — 7. 
Expression (35) thus becomes 
" + o'"(z) ay 
37 —— <6 ud 
te 6 Fe) + eS" 
and 
(8) T° = exp {epldw)) — Flotdw))) — du’ — fw? 4 272). 
6 e (ve) (2) J 
We shall now prove two facts relating to the function 
(39) A(u, §) = wt — F(é) — wG(w) + F(G(w)), 
where 
(40) £ = g(u + ¥(w)). 
(A) For any positive U 
(41) max |A(u,£) + 3u°| 0 (E> «); 
julsu 
(B) there are constants wo and & such that, if 
(42) 2<|ul, u< y(t) — Ww), f < &, 
then 
(43) A(u, t) S — |u|/2. 











218 





N. WIENER AND W. T. MARTIN 


For the proof of (A) let us differentiate 


(44) 


to obtain 
(45) 


Thus 


(46) 


(47) 


(48) 


(49) 


(50) 
(51) 


Now if | z 
(54) 
since 


(55) 


The relations (52), (53) 


¢’(¥(x)) = [G’(z)]' 











o" (W(a))W'(z) = HG@’(@))'@"(2). 
" G(x) | 
ea) = Say 
ur ’ G’(x)@’"(z) irs (G’"(a)P. 
y (V(x) )y (x) = 2(G’ (x)? ’ 
o” (W(z)) = G(x) , 
¢ (¥(x)) —-2[G"(x)}?’ 
oY) _ @(z)G"(z) — (G(x) 
¢ (W(x) 2[G"(z)P , 
Translated into terms of G, (6) and (7) become 
G(x) = o([G’(x)}') 
‘ (2 + «) 
G(x) = o([G@"(z)). 
In view of (50) and (51) it follows from (48) and (49) that 
¢”" (zx) 
AsO — 0) 
oa) 0 (x 
¢'""(z) ie . 
¢ (x) ais - r 
< const., then 
¢'(z) _. 2 @ 
ae (2 ) 
. ¢’(z) * ox) a = 
06 (0) -f = alia —— 


and (54) yield the result (A). 


For the proof* of (B) let us differentiate A(u, €) partially with respect to u, 


dA _ 


ou 


(56) 


_ Gw) —é - 
ww) —(w) 


g(y(w)) — elu + (w)) 


¢’ (vw) 


* This proof has been revised in accordance with a suggestion of J. J. Gergen. 


or ‘(Ou + ¥(w)) 
ew) * 





























TAYLOR’S SERIES OF ENTIRE FUNCTIONS OF SMOOTH GROWTH 219 


Now by (54), 


(57) oa (¢ > «) 


uniformly for | u| < 2. Next let & be determined so that 
(58) —u-4s4%e-u44; Awd s -Ww4+1, 


for |u| < 2, & < € and let wy be so determined that 


(59) | @’"(w) | = [G’(w)}* 
for ¥(w) = ¥(wo). We shall now show that (B) holds with these values & , wo . 
In fact, let & < & and let a be any value of u such that 





(60) a <¢ (t) — ¥(w) 
and 
(61) kK <1. 
| OU _Jumi 
Then, since 
, GA G’(w) aA 
it follows that 
aA 
(63) -1s%4] 5-1. 


In view of (56) it follows that 

>0 ifu < 0, 
(64) = ‘ 
du <0 ifu > 0, 


and hence 


|aA 


(65) i | <1 for u on the open interval (0, @). 
Since 
aA aA 
66 — ee ee = > 
(66) du lL =-1, au i= =I, 
it follows that 
(67) |@| s 3. 
Accordingly, 
(68) =< <-1 forfoSt, 2sSusy '(é)—Vwo); 
- 0A " = 
(69) = > 1 fori: Si us —-2,us¢ (&) — Ww). 


(B) then follows. 














220 N. WIENER AND W. T. MARTIN 





Setting 
(70) p(t) = 9 “(é) — Hw»), 
we see that (A) and (B) imply not only that for any positive \ 


(pi) 
(71) > max laa < M, < wo 


yv=—0 y—-ASusr 


for all sufficiently large £, but also that for any positive \ 


[pté)) 
oe ° <=figg® 
(72) lim max le i eer I = 0, 
[~2 y= y—-ASusv 
for the ends of each function run down, as we have seen, faster than e~*!“! and 


any finite portion of 


(73) exp [w§ — F(~) — wG(w) + F(G(w))} 


2 


—}u 
as § —> o, 


approaches e 
Let us consider 
(74) > a e"* -F(§) - > a eX 2 (n)—P(G(m)) nb F()—n@(n) +7 (G(a)) 
ve wo 
Since this approaches s as § > ©, it follows that the sum is < s + 1 foré 2 & 
for some &. But exp {né — F(E) — nG(n) + F(G(n))} is greater than some 
constant h, > 0 for 0 S u S 4, & = &, forsome & = &, where & = g(u + ¥(n)). 
Hence 


~~ nG -_ ; 8 1 
(75) 7 a,e” (n)—F (G(n)) < be ok ; E > §3. 


OSusa at 


Now 0 S u S&S dis equivalent to 





(76) ¢ '() -A S vn) So '() 
ry 
and hence : 
(77) eee ae (v = 0,1,2, ---). j 
rS0(m) Srtr : 
From these facts we prove immediately that : 
(78) > a, 672 F (an) he? —>@ ° (é salt 2); j 

n=wo 
indeed 


| 
i Gp eX GF (a(a)) {ei i ff Pmraten er | 


we 
=| (nto! ju? f—F ({)—nG (n)+F(G(n)) 
(79) $2 max |e" —e” ilies '] 
y= r—lsSusy 






a,err-reotan} —»@ (E-+ @), 


y—-lsgus 








ET LO iN EET ALE 








TAYLOR’S SERIES OF ENTIRE FUNCTIONS OF SMOOTH GROWTH 221 


and this proves (78). If we define 


( (2x)} } at re™ for y > v(wo), 


(80) L(y) = ¥ (wo) S¥(m) <y 
ce fory < ¥(un), 


then (78) is equivalent to 
(81) il e 1e-Y* AL(y) — [ edu (x4 — o), 


Now let us apply the General Tauberian Theorem of Wiener (loc. cit., pp. 27, 28) 
to (81). The kernel e 4? has the non-vanishing Fourier tunnaferns e **: the 
function L(y) is a monotone function such that 


y+hr i 

(82) [ dL(y) = (2r)' DO ane™®-FO™ < (2m)! My 
y vS¥(n) Sta 
and hence if 
. ( 1 for0 S —u 3, 
(83) K,(u) = § 
. 0 otherwise, 
then 
(84) [ K:(x — y)dL(y) > of K2(u)du = Xs. 
That is, 
z+h 

(85) [ dL(y) — Xs (x —> @), 
or 
(86) (Qn)? > a eree)—Fie(s)) mii i (x —> @ ). 


A zs¥(n)s2tr 
This concludes the proof of Theorem 1. 


3. For the proof of Theorem 2 we note that the class > of functions of the 
form (83) is such that there is no z for which for every function K2(u) of = 


(87) sy f K,(u)e™*du = 0. 


(2m)! 


Thus (84) implies‘ (81) and this proves Theorem 2. This argument shows the 
validity of Theorem 2 even if (8) holds only for two values of \ whose ratio is 
irrational. 


‘See Wiener, loc. cit., p. 26. 








222 N. WIENER AND W. T. MARTIN 


4. For the proof of Theorem 3 let (4), (5) and (10) hold. Then for some 
positive 


1 1 


(88) G'(F(x)) = FG) > (FP'@ i > Ss 
and hence for every positive « 

(89) x * < Gz); 

(90) r** < y(a), 

for sufficiently large values of x. From (90) it follows that for every positive e, 
(91) x * < ¥(z) 


for sufficiently large values of z. This proves part (i) of Theorem 3. 
For part (ii) let (4), (5) and (13) hold. In view of (5) and (13) it follows that 
G(x) is O(2‘) for every « > 0. Applying the Schwarz inequality to (38), we 


have 
z z } 
W(x) s | / az | Gear] 


(92) = O(2'[G(z)}) 
= O(2r'**) wistie 


which yields (14). 


5. Let (15) be an integral function. The condition (16) gives 


(93) =. [- | f(re"*) |? do = >> |b, [?r** ~~ ge” 2 ee”) (r— o), 
If we set 

(94) a, = |b, |’, é = logr’, 

(93) becomes 

(95) > ae we 44 (—— ~). 


Applying Theorem | to this case, we see that the conclusions of Theorem 4 
follow. Theorem 5 is an immediate consequence of Theorem 2. In. order to 
obtain the result on gaps stated in Theorem 6, let us assume that (11), (12) and 
(14) hold for sufficiently large values of n and let us consider the large values of 
n for which 

(96) xsyv(n)S2r+.. 

Let g(x) be the inverse function to ¥(x7). Then the values of n for which (96) 
holds are those for which 


(97) g(x) Sn S g(x +d). 

















TAYLOR’S SERIES OF ENTIRE FUNCTIONS OF SMOOTH GROWTH 223 


But 
g(x + d) = g(x) + rg"(x + Ad) 0<es1 
1 
(98) = 12) + Tey HM) 


g(x) + O([g(x + r)}**) 
= g(x) + r0([g(z)}**). 
Thus the values of n for which (96) holds are those for which 
(99) g(x) <n S g(x) + rO({(g(z))"*) 
and hence if s # 0, the function (15) can have only a finite number of gaps of 
magnitude (», » + »***), 
6. Let a, = 0 (nm = 1, 2, --- ), and let 
(100) C, =D) Gn Gan (mn = 1,2,---). 
1 


If (8) holds for every positive value of \, in view of Theorem 2 it follows that 
(2) holds and hence 


~ 2 
(101) b anc] gwr® _, i (E> ow); 
I 


(102) 2 ere + (—  «), 


1 


In view of Theorem 1, (102) implies that 


4 ( ) ” 
(103) lim (2m) Qe od, 
zoo A zS¥(n)<ztr 
where 
F(x) = 2F(zx); G(x) = G(3z); 
, ’ ‘ 1 , 
(104) G,(x) = 3G’(32); v(x) = Va¥ (32); 


Vilz) = V/2 ¥(4x) + const. 


Thus (103) is equivalent to (20). Similarly, if (20) holds for every positive 
value of \, (102) holds and hence (2) holds. This shows the complete equivalence 
of the two statements contained in Theorem 7. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 








SUMMABILITY OF CONJUGATE DERIVED SERIES 
By W. C. RANDELS 


We consider a function f(z) with period 2 and integrable over (—7, 7). 
The Fourier series of such a function is 
ol 
hag + > (a, cos nx + b, sin nz), 
n=1 


where 


a, = | f(x) cos nz dz, b, = | f(x) sin nz dz. 
T J-— Tw J-. 


r 


The conjugate derived series of f(z) is 


a 
Dd n(a, cos nz + b, sin nz). 


We define 
e® = f(x +t) + f(z — t) — f(z) 


and 
o => [ ely) csc ty dy. 
T t 


Throughout this paper we shall suppose that g(t)/t C L on (—7z, x). This 
implies that &(f) C L, since 


® © r r y 
[isoiast [ wf I oty) ese dy| dy = [dy | oly) ese*au| dt 
0 wr Jo t 4r Jo 0 


- of [" ew") 
- of [ |e) |=). 


bec | (y)(t—y)*"*dy—>S as t— +40, 


Ifa > 1 and 


{e-1 


we say that 
conj. der. lim g(t) = S(R’, a). 
If 0 < a S 1, and for some 8 > 1 


am % F me , 
fea I &(y)(t — y)’ * dy —S as t—+40, 
Received June 5, 1935; in revised form, February 7, 1937. The author is indebted to the 
referee for suggestions, which materially simplified the proof. 
224 








wa 


ee 








SUMMABILITY OF CONJUGATE DERIVED SERIES 225 


we say that 
conj. der. lim g(t) = S(R’, a) 
if 
(1) ‘ [ e(y)y (tt —y)**dy—+0 as t— +0. 
The series 


is said to be (C, a) summable to the sum S if 





l m 
42 2, 42.208 as m-—->®, 
«im n=0 
where 


~ Ta + 1)T(n + 1) 


The theorems which we propose to prove are: 
TuroreM I.’ If 


(2) conj. der. lim g(t) = S(R’, a) (a > 0), 


then the conjugate derived series of f(x) is (C, 8) summable to the sum S for 8 > 
a+l. 

TueoreEM II. [f the conjugate derived series of f(x) is (C, a), a 2 1 to the sum 
S, then for 8B > a 


conj. der. lim g(t) = S(R’, 8). 


These theorems are analogous to those proved for Fourier series by Paley* 
and Bosanquet,’ for the conjugate series by Paley‘ and for the derived series by 
Takahashi.’ These proofs used Riesz summability instead of Cesaro, but the 
two methods are known to be equivalent. 

We may assume without loss of generality that the point under consideration 
is the point z = 0 and that the function is even. We also shall assume that 


! Theorem I for a = 0 has been proved by Gupta in a paper of a similar title, Proc. Acad. 
Sci. Allahabad, vol. 1 (1936), pp. 7-17. A generalization of this has been given by Mour- 
sund, Amer. Jour. Math., vol. 57 (1935), pp. 854-860. It might be mentioned that Bosan- 
quet has considered the problem of the boundedness of the (C, a) transforms of the sequence 
n(a, cos nz + b, sin nz) by similar methods, Trans. Amer. Math. Soc., vol. 39 (1936), pp. 
189-205. 

?R. E. A. C. Paley, Proc. Cam. Phil. Soc., vol. 26 (1930), pp. 173-203. 

3L. S. Bosanquet, Proc. Lon. Math. Soc., vol. 31 (1930), pp. 144-164. 

* L.c. 

5 T. Takahashi, Tohdku Math. Jour., vol. 38 (1933), pp. 265-278. 








226 W. C. RANDELS 


a = Oand f(0) = 0. This is permissible, for otherwise we would deal with the 
function 


fo(t) = fi) — ao — [f(0) + ag] cos t. 
Then at z = 0, g(t) = 2f(t). We have 


¥y 1 [* 
#(t) + i | gly) eot* sy dy = -i | e(y)dy — —ta=0 as t+ +0. 
t t 


Hence, if a > 1, 


a-l 


fet 


t 
(y(t — y)**dy—'S as t—>+0 
0 


implies that 
t r 
= [ {[ g(x) cot 4x az} (t—y)*"*dy>S as t— +40. 
0 v 


Moreover it is clear that for a > 0, 


t 
tf @a-wtay—o as t—+0 
“jo y 


implies that 
t 
| oly) cot by (t _ y)** dy —0 as t>4+40. 
0 


We set g(t) = ¢(t) cot }t and, since g(t)/t CL, 9) CL. Ify) =g(e +8 - 
g(x — t), then in the notation of Paley (2) implies that 

(3) conj. lim y(t) = 4S(R’, a). 

We can easily see that conversely (2) implies (1). The Fourier coefficients of 


g(t) are a, = 0 and 


bn 


2 [ g(t) cot 4¢ sin nt dt 
0 


7 . 1 r 
: [ g(t) Sain + Hy _ 2 [ v(t) cos nt dt. 
wT Jo T Jo 


sin 3¢ 
We are now in a position to prove our theorems. First by a theorem of Paley*® 
(3) implies that for y > a > 0, 





-_1 2 
= = Ay_.b,—~4S as m— 2. 
im n=0 

But 
>? g(t) cos nt dt = ¢(0) = 0, 
n=1 7 JO 


®* L.c., Theorem II. 











BS 




















SUMMABILITY OF CONJUGATE DERIVED SERIES 227 


so that (3) implies that, fory > a > 0, 


-1< 1< 
Gi 2s AtnSs = Gy Ls Adnan > S as n— o, 
m n=0 m n=0 


By a theorem of Andersen’ this in turn implies that 


1 m 
ge 2 Amana, S as no (@=y+1>a+1). 


This completes the proof of Theorem I. 
Now let us suppose that 


IV 


1). 


1i¢< , 
— > At_,na, > S=0 as n> o@ (a 
mn=0 


By the theorem of Andersen’ used before, this implies that the series 
LU Sn 
n=l 


is (C, a — 1) summable to 0. By a theorem of Paley’ this implies that if a = 1, 


conj. lim y(t) = 0(R’, 8) (8 > a), 
and as we have seen, this implies that 
conj. der. lim g(t) = 0(R’, 8) (8 > a), 


and completes the proof of Theorem II. 

It is necessary to use the condition a 2 1 in Theorem II in order to apply the 
theorem of Paley. As a matter of fact, Paley’s results hold for one lower index. 
Compare Bosanquet’s result for the analogous problem in Fourier series.” I 
have a direct proof of Theorem II for a > 0. 

It might. be mentioned that Theorem I is true for a = 0 if 


conj. der. lim g(t) = S(R’, 0) 
is defined by replacing condition (1) by 


o(t) 


; —0 as t—-+40. 


NORTHWESTERN UNIVERSITY. 


7 A. F. Andersen, Proc. Lon. Math. Soc., vol. 27 (1927), pp. 39-71; see Theorem 3. 
8 L.c., Theorem 3. 

*L.c., Theorem IV. 

10 T.c., Theorem 4. 











ORTHOGONAL POLYNOMIALS ON A PLANE CURVE 
By DunHamM JACKSON 


1. Introduction. Polynomials in two real variables x and y orthogonal with 
respect to integration along a curve in the (x, y)-plane can be constructed by 
the usual process for building up a systém of orthogonal functions. If the 
curve is not algebraic, they have formal properties closely corresponding to 
those of polynomials orthogonal over a two-dimensional region.’ For an alge- 
braic curve the relations are different in important respects, reverting in some 
degree toward those which are familiar in the case of orthogonal functions of a 
single variable. This is to be pointed out in detail below, under hypotheses 
which, though not of the utmost generality, are still sufficiently illustrative. 


2. Orthogonal polynomials on a non-algebraic curve. Let ¢(t), y(t) be con- 
tinuous functions of t, of period A. If they are not both constant, the equations 
xz = g(t), y = V(t) may be regarded as defining a closed curve C (not necessarily 
of simple character). A relation of linear dependence connecting any finite 
number of the functions [g(2)]", [y())‘, h = 0, 1, 2, --- , k = 0,1, 2, --- , would 
mean that a polynomial in z and y vanishes identically on the curve, and so 
that the curve is the locus or a part of the locus of an algebraic equation. Let it 
be assumed for the present that no such relation of linear dependence exists. 

Let p(t) be a non-negative integrable function of period A, which, if not every- 
where positive, is at any rate such that its product with any polynomial in ¢(¢) 
and y(t) (having a non-vanishing coefficient) is different from zero for a set of 
values of ¢ of positive measure in a period. This condition will be satisfied, 
for example, if there is an interval in which p(t) is almost everywhere different 
from zero and for which the corresponding points (z, y) do not belong to an 
algebraic locus. 

Under the hypotheses that have been formulated any finite number of the 
quantities p', pr, py, pir’, pry, py’, --+ , regarded as functions of ¢ for 0 < 
t < A, are linearly independent. It is possible by “‘Schmidt’s process of or- 
thogonalization” to construct from them a sequence of functions which are 
orthogonal and normalized over the interval. When the functions are 
taken in the order indicated, the members of the orthogonal set are of the form 
[o(t)}’qnm(2, y),n = 0,1,2,---,m=0,1,---,n, where x = g(t), y = (0), 
and dam is a polynomial of degree n in x and y together, while m is the exponent 


Received November 16, 1936; presented to the American Mathematical Society, Septem- 


ber 3, 1936. 
1 Cf. D. Jackson, Formal properties of orthogonal polynomials in two variables, this Jour- 


nal, vol. 2 (1936), pp. 423-434; referred to hereafter as paper A. 
228 




















sat = li lg te Ales 








ORTHOGONAL POLYNOMIALS ON A PLANE CURVE 229 


of the highest power of y occurring in a term of the n-th degree.” The q’s will 
be said to constitute a set of normalized orthogonal polynomials on the curve C, 
with respect to ¢ as parameter and p(t) as weight function, satisfying the con- 
ditions that 


/ p(t)qulx, y)dnm(x, y) dt = 0, In—k| +|m-—l1| #0, 


[ p(t) [qan(a, y)Fe dt = 1. 


Certain properties of these polynomials can be stated briefly, the proofs being 
so similar to those for the case of polynomials orthogonal over a region’ that 
it is unnecessary to give them in detail; it is to be borne in mind that under the 
present hypotheses, if the product of p by a polynomial in x and y vanishes 
almost everywhere as a function of ¢ on the interval (0, A), the coefficients of 
the polynomial must all be zero. Any polynomial of the n-th degree which is 
orthogonal (in the sense under consideration) to every polynomial of lower 
degree must be a linear combination of qno, --- ,Qnn. Any n + 1 polynomials 
Pro, *** 5 Pon Which are expressible in terms of gno, +--+ , Yan by an orthogonal 
linear transformation are orthogonal to each other, normalized, and orthogonal 
to every polynomial of lower degree. And any n + 1 polynomials of the n-th 
degree which are normalized, orthogonal to each other, and orthogonal to every 
polynomial of lower degree are related to any other such set, and in particular 
tO Guo, *** nn, by an orthogonal transformation. 

Suppose there is a number y» between 0 and A such that 


o(t + nw) = ag(t) + BY(d), 
(1) v(t + wu) = vet) + dy(d), 
p(t + uw) = p(t) 


for all values of t, where a, 8, y, 6 are constants. The determinant ad — fy is 
certainly different from zero; otherwise there would be a linear relation con- 
necting g(t + uw) and ¥(¢ + yw), in contradiction with the hypothesis that the 
curve is not algebraic. If (x, y) is a point of the curve, the point (2’, y’) with 
coérdinates given by 


(2) x’ = ax + By, y’ = yx + dy 


is also a point of the curve, and vice versa; the transformation (2) carries the 
curve into itself. 


* In a corresponding passage in the paper A, p. 424, ‘‘the degree with respect to y’’ should 
be amended to read ‘‘the degree of the leading homogeneous aggregate of terms with re- 
spect to y’’. 


3 See the paper A, pp. 424-425. 








230 DUNHAM JACKSON 
If P(x, y) is any polynomial in z and y, 


[ p(t)Plag(t) + BY(t), ve(t) + dy(t)]dt 
= i p(t + u)Plo(t + uw), W(t + w)|d(t + w) 
7 | o(t)Plolt), pilat 


os | o(t)Pig(t), y(t)ldt, 


the last equality resulting from the periodicity of the functions involved. In 
summary, if P(2’, y’) as a polynomial in z and y is denoted by II(z, y), 


(3) / p(t)II(x, y)dt = [ p(t)P(2’, y’)dt = [ p(t)P(x, y) dt. 

For any polynomial p(x, y) which is normalized on C, let p(x’, y’) = x(z, y). 
Then the last relation, with P(x, y) = [p(z, y)]’, says that x(z, y) also is nor- 
malized. If p(x, y) and s(z, y) are two polynomials which are orthogonal to 
each other, and if p(2’, y’) = x(z, y), s(2’, y’) = o(z, y), the relation (3) with 
P = ps means that xr(z, y) and o(z, y) are orthogonal. If p(z, y) is of the n-th 
degree, and the condition of orthogonality is satisfied for every polynomial 
s(x, y) of lower degree, then o(z, y) also is an arbitrary polynomial of degree 
lower than the n-th, and r(z, y) is orthogonal to every such polynomial. Applied 
in particular to any set of n + 1 polynomials pyo, --- , Pan Of the n-th degree 
which are normalized, orthogonal to each other, and orthogonal to every poly- 
nomial of lower degree, this reasoning shows that pro(z’, y’), --- , Pan(2’, y’) 
are expressible in terms of pro(z, y), --- , Pan(z, y) by an orthogonal linear 
transformation. 

The work of the last three paragraphs applies also, with obvious modifications 
in detail, if the equations (1) are replaced by 


o(u — t) = ag(t) + BY(d), 
(4) v(u — t) = volt) + dy¥(0), 
p(u — t) = p(t). 


Furthermore, the entire discussion remains valid if the functions g, y, p are 
defined merely for 0 S t S A, the hypothesis of periodicity being dropped, 


and the closed curve C replaced by a non-algebraic arc, except that in place of 
(1) and (4) only the equations 


eA — b) 
¥(A — t) 
p(A — bt) 


ag(t) + BY(t), 
ye(t) + sy(t), 
p(t) 


















oh 





ORY WON eran opie, 








ORTHOGONAL POLYNOMIALS ON A PLANE CURVE 231 


come into consideration. Other extensions readily suggest themselves, though 
it is not so clear what the most general possible formulation would be. 

Under conditions of some generality the transformation (2) must itself be 
orthogonal. Let the curve C be rectifiable, and let it be represented by the 
equations 

xr = ¢(s), y = ¥(s) 

in terms of the are-length as parameter. For simplicity, let it be made up of a 
finite number of ares, on each of which g(s) and ¥(s) have continuous deriva- 
tives. Then 
(5) [e'(s) + ¥’(s)F = 1 
except for a finite number of points at most. Let ¢ and y satisfy identically 
the equations 

o(u + s) = ag(s) + BY(s), 

Yu +s) = rls) + 44s), 


or the alternative equations with » + s replaced by u» — s. Then 
le’(u + 8) + (Cu + 8)F 
(a’ + y)le’(s)F’ + 2(aB + vd)e’(s)¥'(s) + (B + &)[Y'(s)I’, 
which with (5) means that 
(a° + ¥° — Ile'(s)F + 2(aB + vd)e'(sW'(s) + (F + & — IlW’(s)/° = 0, 


except at a finite number of points. Let the slope dy/dx = y’(s)/¢’(s) be de- 
noted by \». If \ takes on three or more distinct finite values, the quadratic 
equation 


1 


II 


(a + 7° — 1) + 2(e8 + 74)A + (8° + & — 1) = 0 
has more than two distinct roots, and its coefficients must vanish; the equations 
a +7 =1, e+e =1, af + 75 = 0 


are the conditions of orthogonality. The same conclusion holds if ¢’(s)/y’(s) 
takes on three distinct finite values, and so if the slope takes on the value « 
and two finite values different from zero; and again if there are regular points 
of the curve for which respectively ¢’(s) = 0, ¥/(s) = 0, and ¢’(s)¥(s) ¥ 9, i.e., 
if the slope takes on the values 0 and @ and a finite value different from 0. 

These brief observations raise numerous questions which are left unanswered. 
It may be pointed out, however, that the hypothesis that the curve has at least 
three different directions is not entirely irrelevant. For the parallelogram with 
vertices at the points (+2, 0) and (0, +1) (an algebraic curve, to be sure) is 
carried over into itself by the non-orthogonal transformation 2’ = —2y, y’ = 42, 
arising from the substitution of s + 5? for s. 

The pertinent parts of the discussion in the paper A relating to particular 























232 DUNHAM JACKSON 


transformations (2) and the orthogonal transformations of po, --- , Pan in- 
duced by them, specifically in the case of the transformations x’ = —2z,y’ = —y 
(see footnote 6 of paper A) and 2’ = y, y’ = 2, can be carried over to the present 
situation. Also the reasoning of the second section of that paper, which is 
concerned with a recursion formula and a Christoffel-Darboux identity for the 
orthogonal polynomials, is applicable here with the obvious formal adaptations. 


3. Definition and orthogonal transformation of orthogonal polynomials on an 
algebraic curve. The foregoing conclusions have to be modified if the curve 
to which the discussion relates is algebraic. Let the weight function p(¢) for 
simplicity be taken as positive everywhere or almost everywhere on its range 
of definition, so that questions of linear dependence shall not be complicated 
by the possibility of vanishing of p. Let g(t) and Y(t) once more be continuous 
for 0 S t S A, or continuous everywhere and of period A, and not both con- 
stant, but let it be supposed now that there is a polynomial in x and y which 
vanishes identically for 0 < ¢ S A when z and y are replaced by ¢(t) and (0). 
Any multiple of such a polynomial will naturally have the same property. If a 
factor of a polynomial of this sort vanishes at only a finite number of points 
of the curve C, the quotient obtained by dividing out this factor will vanish at 
points arbitrarily near the finite number of points in question, and will vanish 
at those points by continuity and so at all points of the curve. There is there- 
fore a polynomial vanishing identically on C, and composed of irreducible factors, 
each of which vanishes at infinitely many points of C. 

If 2,(z, y) is a particular polynomial meeting these specifications, any irreduci- 
ble polynomial vanishing at infinitely many points of C must vanish at infinitely 
many points simultaneously with one of the irreducible factors of Q,(7, y), and 
must be identical with that factor except for a constant multiplier.‘ There 
can be only a finite number of essentially distinct irreducible polynomials 
vanishing at infinitely many points of C; their product, determined except for a 
constant factor, may be characterized as the polynomial of lowest degree vanish- 
ing identically on C. Let this polynomial be denoted by Q(2, y), and let its 
degree in the two variables together be N; the choice of the constant multiplier 
is immaterial. (The irreducible factors are essentially real; if an irreducible 
polynomial with complex coefficients vanishes at infinitely many real points, 
its conjugate, vanishing at the same real points, must be a constant multiple 
of it.) Any polynomial which vanishes identically on C is a multiple of Q(2, y). 
The curve C is by no means necessarily the complete locus of the equation 
Q(x, y) = 0; it may be a triangle or a square, or otherwise composed of a number 
of algebraic arcs. 

Let the rank of a monomial zy‘ for the moment denote the index of its posi- 
tion in the sequence 1, x, y, x, zy, y’, «++ , so that if two terms are of different 
degrees in the two variables together, the one of higher degree has the higher 


‘ See for example, M. Bécher, Introduction to Higher Algebra, pp. 210-211. 














* 
it 








ORTHOGONAL POLYNOMIALS ON A PLANE CURVE 233 


rank, while if two terms are of the same degree the one which is of higher degree 
in y has the higher rank. If the terms of two polynomials are arranged accord- 
ing to rank in descending order, the leading term of the product of the two 
polynomials is the product of their leading terms. If the leading term of 
Q(x, y) is x”y*, p + q = N, the leading term of any polynomial which vanishes 
identically on C must be divisible by x”y*. And if h and k are any two exponents 
such that h = p, k = q, multiplication of Q(z, y) by x” ’y* gives a polynomial 
vanishing identically on C, and having x"y‘ for its leading term. In other 
words, among the monomials x", «""'y, --- , y" of the n-th degree, when n = N, 
each member of the set x" “%y’, 2" "y"", ... , 2?y"” is linearly expressible 
on the curve C in terms of monomials of lower rank than its own, while none of 
the remaining N monomials of the n-th degree, in which the first exponent is 
less than p or the second exponent less than q, is connected with terms of lower 
rank than its own by any relation of linear dependence on C. If all terms xy‘ 
in which simultaneously h = p and k = q are omitted from the sequence 1, 
x,y, x, ty, y’, ---, Schmidt’s process can be applied to the remaining terms, 
taken in the order indicated, and yields a set of normalized orthogonal poly- 
nomials comprising just N polynomials of the n-th degree for each value of 
n = N (and n + 1 polynomials of degree n for n < N). 

The preceding argument as it stands is of course based essentially on a par- 
ticular arbitrary arrangement of the monomials z’y* in serial order. It remains 
to be seen to what extent it possesses more general significance. 

In the orthogonal system that has been described let the polynomials of the 
n-th degree be denoted by gui, Gna, --+ » Qnw , for n 2 N, where the first sub- 
script indicates the degree in the two variables together, and for fixed n the 
second subscript increases with the exponent of the highest power of y occurring 
in a term of the n-th degree, but is not in general equal to that exponent. The 
modifications required for values of n < N will be obvious. Let the leading 
term of a polynomial be characterized still in terms of the notion of rank already 
employed. Each of the monomials x", 2"‘y, --- , y", with an appropriate co- 
efficient, is leading term either of one of the polynomials q,, or of a polynomial 
which vanishes identically on C. If p(a, y) is an arbitrary polynomial of the 
n-th degree orthogonal to every polynomial of lower degree, suitable constant 
multiples of the n + 1 polynomials just mentioned can be subtracted from 
p(x, y) to remove successively all the terms of the n-th degree in order of de- 
creasing rank, leaving a polynomial which, being orthogonal to every polynomial 
of degree lower than the n-th, must be orthogonal to itself and so identically 
zero on the curve. This means that p(x, y) is equal on the curve to a linear 
combination of gai, +--+ , Qa», Or as a polynomial in x and y is the sum of such 
a linear combination and a polynomial which contains Q(z, y) as a factor. 

It follows that any set of N polynomials px, --- , Pay of the n-th degree 
which are normalized, orthogonal to each other, and orthogonal to every poly- 
nomial of lower degree, can be expressed on C in terms of ga, +++ , Gay by an 
orthogonal transformation. Clearly any set of NV polynomials thus expressible 


| 
| 
| 
| 
| 
) 
| 
| 
| 








234 DUNHAM JACKSON 


in terms of the q’s will have the properties mentioned. And since for points 
on the curve the q’s are given in terms of the p’s by the inverse orthogonal 
transformation, any polynomial of the n-th degree orthogonal to every poly- 
nomial of lower degree can be linearly represented on the curve in terms of the 
p’s, and any two sets of p’s are related to each other on the curve by an orthog- 
onal transformation. It is perhaps not superfluous to emphasize that all these 
relations, which for the polynomials in x and y have the character of congruences 
with respect to Q(z, y) as modulus, are identities in the functions of f which 
are obtained on replacement of x and y by ¢(t) and y(t). In terms of x and y 
the individual polynomials of the n-th degree in the orthogonal system are sub- 
ject not merely to the indeterminacy involved in the admissibility of orthogonal 
transformation, but also to the addition of arbitrary polynomials of the n-th 
degree containing Q(z, y) as a factor. 

If the curve C is a straight line segment, NV = 1, and the orthogonal system 
contains just one polynomial of each degree. With s as parameter and weight 
function unity the orthogonal polynomials are essentially the Legendre poly- 
nomials in s, and introduction of a weight function p(s) gives rise to the cor- 
responding set of orthogonal polynomials in a single variable. 

If C is the unit circle, with the parametric representation 


r= cost, y = sin t, 0st Ss 2z, 


and p(t) = 1/7, a polynomial in z and y is a trigonometric sum in ¢, the orthog- 
onal polynomials of the n-th degree (V = 2), chosen with the exercise of a 
particular option as regards orthogonal transformation, reduce on the curve to 
cos nt and sin nt for n 2 1, and in terms of x and y they are any polynomial 
representations of cos nt and sin nt in terms of cos ¢ and sin ¢, e.g., a polynomial 
of the n-th degree in x and the product of y by a polynomial of degree n — 1 in z, 
or, alternatively, the real and pure imaginary parts of (x + iy)". A general 
orthogonal transformation on cos nt and sin nt has the effect merely of replacing 
them by cos (nt + k) and + sin (nt +k). Admission of a non-constant weight 
function p(t) leads to more general orthogonal trigonometric sums of the type 
discussed in a recent paper by the writer.’ 

Substitution of the ellipse x = a cost, y = b sin ¢ for the circle, with a given 
p(t), does not change the orthogonal functions as to their dependence on ¢, 
but changes the coefficients of their representation in terms of x and y, by the 
substitution of x/a and y/b for z and y. Solution of the problem for the ellipse 
with s as parameter and constant weight function would correspond to the 
construction of orthogonal trigonometric sums in ¢ (not trigonometric sums in s) 
with p(t) = ds/dt. 

The rather obvious remarks of the last three paragraphs are inserted merely 
for the sake of showing more clearly how the theory under discussion may be 


5D. Jackson, Orthogonal trigonometric sums, Annals of Mathematics, vol. 34 (1933), 
pp. 799-814. 






















» 





eer 





ORTHOGONAL POLYNOMIALS ON A PLANE CURVE 235 


regarded as a generalization of that of the most familiar orthogonal systems in 
one dimension. 

If ¢, ¥, p satisfy a set of relations (1) or (4), and if 2’, y’ are defined by (2), 
the reasoning carried through for a non-algebraic curve can be used now to show 
that pra(z’, y’), --- , Paw(a’, y’) are expressible on the curve in terms of pri(x, y), 
++, Dan(a, y) by an orthogonal transformation, provided that C is not a straight 
line segment. In the excluded case the inference as to the non-vanishing of 
ad — SF is inadmissible; there is perhaps no need of dwelling further on this case 
in the two-dimensional setting, the facts with regard to orthogonal polynomials 
in a single variable being well known. In particular, if (2) is the transformation 
a = —2,y' = —Y, Pni(2’, y’) is identically equal on the curve to (—1)"p,:(z, y), 
fori = 1, 2,---,N. But the transformation 2’ = y, y’ = x requires further 
consideration, which will not be undertaken here; for in the two-dimensional 
case,® or in the case of a non-algebraic curve, it is possible to say that the poly- 
nomial q,0(z, y), for example, is neither symmetric nor skew-symmetric, while 


on the circle z° + y° = 1, with constant weight function, the corresponding 
polynomial for n = 2 satisfies the congruence-identity 
227 — 1 = —(2y’ — 1). 


4. Recursion formula and Christoffel-Darboux identity on an algebraic curve. 
By reason of the fact that the number of polynomials of the n-th degree in the 
orthogonal system does not increase indefinitely with n, the recursion formula 
and the Christoffel-Darboux identity have a somewhat closer resemblance to 
those found in the case of a single variable than when the domain of orthogo- 
nality is a non-algebraic curve or a two-dimensional region. It is to be kept in 
mind, however, that the various identities hold in general only on the curve. 

Let p,i(z, y) be an arbitrary one of the polynomials of the n-th degree. The 
product zrpn:(z, y), as a polynomial of degree n + 1, is expressible on the curve 
in the form 


(6) xpaz,y) = Dj AniiPasrla, y) + Doi Bris Pnsl2, y) + Di CnijiPr—rs(2, y), 
with 


A.ij = [ oozpnte, Y)Pn4ii(2y y) dt, 
Brij = [ p(t)rpailr, y)pnaj (a, ydt = Bayi, 


Cai = [ expat, Y)Pn—1, (2, y)dt = An-rji. 


Terms of lower degree are absent from the right-hand member because the 
corresponding coefficients vanish, by reason of the property of orthogonality. 


5 See paper A, p. 427. 





236 DUNHAM JACKSON 
In this section, in the absence of express indication to the contrary, the sign = 
always denotes summation over the designated index from 1 to N. The 
expression (6) is then appropriate in the first instance for values of n 2 N; it 
can be made formally valid for 0 < n < N — 1, however, by adoption of the 
convention, in the integral formulas for the coefficients as well as in the identity 
itself, that p,,(z, y) = Owhen0 Sn+1<jy SN. With this convention, 
which will be maintained henceforth, the identity holds trivially when 1 S n + 1 
<i N in the left-hand member, as well as significantly when p,i(x, y) is not 
identically zero. The recursion formula (6) resembles that for a single variable 
in that the number of its terms does not increase with n. 

Multiplication of (6) by p,i(u, v), followed by summation over the index 7% 
from 1 to N, with replacement of the coefficient C,;; by its equal Aj_1,;;, gives 


© Le Prilt, y)Pniluy v) = Qos Di Anis Posti(®, Y)Pnilu, 2) 
_ >: >> 5 Basi Pnj(2, Y)Pni(u, v) + Di DA n—1, ji Pa—1,i(2> Y) Pil, v). 


On interchange of (2, y) with (u, v) this becomes 
u Doi Pni(z, y) Pailu, v) = > d;A nij Pn+i,i(U, v)Pni(x, y) 
+ > , Bai; Pnjlu, v)Pnil2, y) + >: YA n—1,ji Pn—1,j(U, V)Pni(2, y). 


In the second summation on the right-hand side of (8) let the index symbols 
i and j be interchanged, the coefficient B,;; being restored, however, by virtue 
of the fact that B,;; = B,j; ; the sum in question then becomes identical with 
the corresponding sum in (7). Subtraction of (7) from (8), with interchange 
of ¢ and j in the last summation of each identity, gives 


(7) 


(3) 


(u — 2) Ss pns(x, y)pns(u, v) 
(9) = Di Di AnislPngs, AU, v)Pni(x, y) — Pasri(Z, y)Pni(u, v)] 
— Si dD jAn-rislpni(u, v)pn—i(z, y) — Pail, y)Pns(u, v)]. 


For the evaluation of 
n N 
K,.(z, y, u,v) = D> >> puslz, y)pes(u, v) 
k=0 i=1 
let (9) be written with k in place of n for each value of k from Oton. The result 
of summation with respect to k, when due account is taken of the fact that 
pi. = 0 for all values of 7, is 


(u — 2)Ka(z, y, u,v) = Dos DA nisl Pnsri(U, v)Pailz, y) — Pnsr.i(t, y)Pni( Uy v)]. 


In this relation, which has the character of a Christoffel-Darboux identity, 
the number of terms again remains constant as n increases. There is naturally 
a corresponding formula for (v — y)K,(z, y, u, v), differing from this only in the 
values of the coefficients Aj; . 


THe UNIVERSITY OF MINNESOTA. 




































y, 
ly 


1e 








een 


et ne 





THE CLASSES OF INTEGRAL SETS IN A QUATERNION ALGEBRA 
By CLarBorNE G. LATIMER 


1. Introduction. Let % be a rational generalized quaternion algebra with 
the fundamental number d.’ A set of integral elements in %, or more briefly 
an integral set, is one with certain properties R, C, U, M as defined by Dick- 
son.” Two integral sets are said to be equivalent, or of the same type, if there 
is a one-to-one correspondence between the elements of the sets which is preserved 
under addition and multiplication. All the sets equivalent to a given set will 
be said to form a class. Two integral sets, G@ and @,, belong to the same class 
if and only if there is a non-singular element a in %{ such that G@, = aGa™’.* 

By a result due to Artin,* the number H of classes of integral sets in & is equal 
to the number of classes of equivalent right ideals in an arbitrarily chosen inte- 
gral set G, of A, Artin’s definition of equivalent ideals being broader than the 
usual definition. 

The principal purpose of this paper is to show that there is a one-to-one cor- 
respondence between the classes of integral sets in % and certain classes of 
ternary quadratic forms. These classes of forms are the non-negative classes 
or the improperly primitive non-negative classes in a certain genus G, according 
as d is even or odd. G is uniquely determined by d. If d < 0, by a known 
theorem there is a single class of forms in G and therefore H = 1.° 

We shall also determine a relatively simple basis of an arbitrarily chosen 
integral set in %. 


2. Anormal basis of an integral set. Ifo, --- , As form a basis of an integral 
set G, then AA; = Vicipndr (i, 7 = 0, ---,3). A set G, is equivalent to G if 
and only if it has a basis &, --- , & such that ££; = Dierks, (i,7 = 0, --- , 3). 
The following theorem is a consequence of certain results due to Brandt.* 

THEeorEeM 1. Let % be a generalized quaternion algebra with the fundamental 


Received November 16, 1936. 

' For the definition of d, see Brandt, Idealtheorie in Quaternionenalgebren, Mathematische 
Annalen, vol. 99 (1929), p. 9. 

2 Algebras and their Arithmetics, pp. 141, 2. It may be shown that our definition of 
an integral set is equivalent to Brandt’s definition of a mazximaler Integritatsbereich, loc. 
cit., p. 11. 

® Deuring, Algebren, p. 89. 

*Abhandlungen aus dem Mathematischen Seminar der Hamburgischen Universitit, 
vol. 5 (1927), p. 288, Theorem 20. 

5 In another paper, it was shown that if d < 0, then every one-sided ideal in an integral 
set is principal. (Transactions of the American Mathematical Society, vol. 40 (1936), 
p. 322.) From this and Artin’s result, cited above, it again follows that if d < 0, then 
H = 1, 

® Loc. cit., pp. 8-11. 

237 
























238 CLAIBORNE G. LATIMER 


number d, and let \)» = 1, --- , 3 be a basis of A. The d’s form a basis of an 
integral set if and only if N(Zx Ai) = ® (xo, «++ , Xs) = 4 Vgijxix;, where 
(a) the coefficients of © are integers, gi; = 9ji, Go = 2; 
(b) the determinant | 9;;| = a’; 
(c) every third order minor in the matrix (g;;) is divisible by d and every principal 
third order minor is divisible by 2d. 

Two integral sets in A are equivalent if and only if they have basal elements Xo, «++ , 
As and &, «++ , & respectively, such that N(X2,;\;) = N(2z;éi). 

Let @ be an integral set in M% with the basis \> = 1, --- , As and let N(Zz,A,;) = 
} Sgi;aix; as in Theorem 1. Suppose every go; were even. Thend would be 
even and, since every gi; = 2g;; is even, by (c) of Theorem 1 every g;; would be 
even. Then d would be divisible by 4, whereas d contains no square factor > 1.’ 
Therefore one of the go; is odd. We may then assume, after an integral trans- 
formation of determinant unity, that go = 1, go = gos = 0. 

The trace, or double the scalar part, of X = Yz,\; is T(X) = 2m + 1. 
Then T(A,) = 1, TQe) = TAs) = 0. Such a basis of & will be called a normal 
basis. For a normal basis, the matrix (g;;) of 2 @ is in the form 


2 1 0 O 
1 gu gw Gis 


(1) M = 
QO ga G22 G23 
O gs: 932 933 : 
Since the determinant | M | = d’, and every gi; is even, it follows that go: = 


d (mod 2). 


3. The class of ternary forms corresponding to a class of integral sets. Let 
ho = 1, «++ , As be a normal basis of the integral set G, and let § be the module 
consisting of all elements in @ of trace zero. Since T(22,A;) = 2% + 2%, an 
element is in § if and only if it may be written in the form X = 2,(2A, — 1) + 
Toke + X3A3, Where the z’s are integers. The norm of the general element 
in § is N(X) = f(x, 22, 23) where f is the ternary quadratic form with the matrix 
















2 ou — l gJi2 9is 






r= 921 922 } 923 





931 } 932 933 





f is a classic form, i.e., the coefficients of all cross-product terms are even, if 
and only if dis even. Since | M | = d’, it may be shown that | I | = d’/4. 
Suppose £, --- , &; form a normal basis of G. Then 2& — 1, &, & form a 
basis of § and therefore these elements are obtained from 2A; — 1, Az, As by an 
integral transformation of determinant + 1. Hence the form N[y,(2& — 1) + 














? Brandt, loc. cit., p. 12. 
* Brandt, loc. cit., p. 10. 














ay 





CLASSES OF INTEGRAL SETS IN QUATERNION ALGEBRA 239 


yet. + ysé3] is equivalent to f. Hence the class ©, of forms equivalent to f 
is independent of the particular normal basis employed and is uniquely deter- 
mined by G. 

More generally, suppose @, is an integral set in the same class as G. By 
Theorem 1, @; has a basis m, --- , 3 such that N(22in;) = ®. Hence G; is 
uniquely determined by the class of integral sets containing ©. 

Suppose d = 2d, iseven. It will be found that every second order minor of T 
is equal to one half or one fourth of one of the third order minors of M. There- 
fore by Theorem 1 every such second order minor is divisible by d,. Since 
|T | = di, it follows that + d, is the g.c.d. of these second order minors. By 
the definition of d, ® is positive or indefinite according asd > Oord <0. Hence 
the same is true of f and the invariants of f are Q = d;, A = 1.’ Since d; is odd 
and contains no square factor > 1, f is a properly primitive form. 

If d is odd, it may be shown that the invariants of the improperly primitive 
form 2f are Q = d, A = 2. 

The class of forms containing f or 2f, according as d is even or odd, will be 
said to correspond to the class of integral sets containing G. 

Let F be a quadratic field which is imaginary if d > 0. It is known that % 
contains a field equivalent to F if and only if no prime factor of d is the product 
of two distinct prime ideals in the set of all integral numbers of F."° If d is even, 
f is a properly primitive form and if d is odd, 2f is improperly primitive. There- 
fore, in either case, f represents a positive integer k, prime to 2d." Then § 
contains an element 7, such that 7° = —k. Since % contains the quadratic 
field F(n), it follows that —k is a quadratic non-residue of every odd prime fac- 
tor, 91, 92, °-*,q,0fd. Hence if d is even, the characters of f are” 


(2) Xi = (*) om (—j)***?* (i on 1, 2, ae n). 

qi 
If d is odd, the invariants of 2f are Q = d, A = 2 and therefore by the last refer- 
ence, the characters of 2f are 


(3) x= (7*) 
Vi 


We have then all but the last sentence of 
THEOREM 2. Let % be a rational generalized quaternion algebra with the funda- 
mental number d. For every class of integral sets in A there is a uniquely deter- 


( — 3 )estOester (i ia bh 2, vee, n). 


® For the definitions of these invariants, see Dickson’s Studies in the Theory of Numbers, 
p. 10. This book will be referred to hereafter as Studies. 

10 This is a consequence of a more general theorem by Hasse, Die Struktur der R. 
Brauerschen Algebrenklassengruppen tiber einem algebraischen Zahlkérper, Mathematische 
Annalen, vol. 107 (1933), p. 731; Deuring, Algebren, p. 118. See also The quadratic sub- 
fields of a generalized quaternion algebra, this Journal, vol. 2, p. 681. 

'' Dickson, Studies, Theorems 6, 7, p. 8. 

'2 Dickson, Studies, p. 52. 





240 CLAIBORNE G. LATIMER 


mined corresponding class © of non-negative ternary quadratic forms. If d = 2d, 
is even, the invariants of the forms in € are Q = d,, A = 1 and their characters 
are the x; in (2). If d is odd, the forms in © are improperly primitive and have 
the invariants Q = d, A = 2 and the characters x; of (3). No class of forms cor- 
responds to two classes of integral sets. 

To prove the last sentence of the theorem, we employ the following which 
will be proved in §6. 

Lemma 1. Let G, G, be sets of integral elements in A and let §, 1 be the sets 
of elements in G, @,, respectively, of trace zero. If § = F:, thenG = G,. 

Let Ao, «++ , As and &, --+ , & be normal bases of the integral sets G and G, 
respectively. Suppose the form N[x,(2A: — 1) + ede + asd3] = f(x, 22, Xs) 
is transformed into N[y:(2& — 1) + yet + ysts] = filyr, ye, ys) by a trans- 
formation z; = Li; (i = 1, 2, 3), where the ?’s are integers, | ¢;;| = + 1. 
If wis rational and 7(Z) = 0, then N(u + Z) = u? + N(Z). Hence N(Zy:é:) = 
Yo + your + yi/4 + filyr/2, ye, ys) = Pilyo, --+, Ys) Let no = 1 and let 
m, m2, ns be defined by 


2m — 1 = tn(2A1 — 1) + terre + bards, 
(4) ne = ty(2A1. — 1) + tere + teers, 
ms = ty3(2d. — 1) + tesrd2 + tssds. 


Then Y; = yi(2m = 1) “+ Yone oa Y3n3 = 4(2r; = 1) + Lede oa Z3A3 and N(Y¥1) = 
Sily:, ye, ys). Therefore if Y = Sym, then N(Y) = (yo, mi, ye, Ys) = 
N(Zyié:). Since the n’s form a basis of AM and the é’s form a basis of @,, by 
Theorem 1, the y’s form a normal basis of an integral set @' equivalent to @,. 
But by (4) and Lemma 1, @' = @. Hence @ and @, belong to the same class. 
This completes the proof of Theorem 2. 

If d is even, the forms in € and also their reciprocals have odd determinants 
and therefore are properly primitive. Therefore by Theorem 2 and a known 
result,’”® we have the 

Corotiary. If d < 0, any two integral sets in A are equivalent. 


4. The correspondence between classes of integral sets and classes of forms. 
We shall now prove 

THeoreM 3. Let A be a generalized quaternion algebra with the fundamental 
number d. If d is even, let G be the genus of forms with the invariants Q = d/2, 
A = 1 and the characters x; of (2). If d is odd, let G be the genus of forms with 
the invariants 2 = d, A = 2 and the characters x; of (3). There is a one-to-one 
correspondence between the classes of integral sets in A and the non-negative classes 
in G or the non-negative improperly primitive classes in G, according as d is even 
or odd. 

If d < 0, the theorem follows from Theorem 2 and the theorem cited in the 
proof of the corollary. Hence we shall assume that d > 0. By Theorem 2, it 


'8 Dickson, Studies, Theorem 47, p. 54. 









































SO, 





ee 





CLASSES OF INTEGRAL SETS IN QUATERNION ALGEBRA 241 


will be sufficient to show that every positive class or every improperly primitive 
positive class in G corresponds to a class of integral sets. 

Suppose d = 2d, is even. Let € be a class of positive forms in G. Let 
y = Lhj;x,z; be the reciprocal of aformin ©. We have seen that y is a properly 
primitive form and therefore it represents an integer prime to d."* We may then 
assume that h33 is prime tod. Let f = Ya;;x;r; be the form in € reciprocal to 
y, and let A;; be the cofactor of a;; in the matrix (a;;). Then aya2 — aj: = 
Az3 = dh, where h = hj is prime to d. Consider the binary form f(x, 22, 0). 
Since d, contains no square factor > 1 and is prime to h, the g.c.d. of the co- 
efficients of this form is prime to d,. Therefore f(x:, x2, 0) represents an 
integer prime to d;. Hence we may also assume that a, is prime to d,. Let 
h = ak’, ay, = at’, where a, a contain no square factor > 1. In f(x;, x2, 23), set 


ai2 Ais 

m1=2 an? * de 
A 

2 = y+ 7-2, 
33 

i3= z, 


and in the resulting form, replace 2, y, z by x/t, atz/k, aky respectively. We 
obtain 


V(x, y, z) = ax + By” a aBz", 


where 6 = ad. 

Since h is prime to d;, 8 contains no square factor > 1. Let a = a6, B = 46, 
where 6 is the g.c.d. of a, 8. Since ay is prime to d;, §; is divisible by d;. Let 
B be the least positive divisor of 8, such that y° + a = 0 (mod #,/B) has a 
solution. Since f represents ay, = at’ and the characters of f are the x; of (2), 
it follows that —q@ is a quadratic non-residue of every prime factor of d;. Hence 
d, divides B. 

Let %’ be an algebra with the basis 1, i, j, ij, where * = —a, 7° = —8, ij = 
—ji. The fundamental number d’ of %’ is divisible by B."* Hence d’ is divisible 
by d,. 

Since y = Lh;;a,x; is properly primitive, after an interchange of variables 
necessary, we may assume that hy is odd. Replace 7 by 2, + &2 + nas, 
where £ = 0 or 1 according as he: is even or odd and » = 0 or 1 according as hss 
is even or odd. In the resulting form, yi = Lkijriaz;, ku + 1 = ke = ky = 0 
(mod 2). Since | k;;| = d; is odd, it follows that kx; is odd. Let fi = Dejjriav; 
be the form in © reciprocal to ¥,. Then ey, = Keoks; — ks, Cneos — Cie = dikss, 
CuCss — C1, = dike, and therefore 


(5) eu 


—] (mod 4), Co = Cy, C33 = C13 (mod 2). 


4 Dickson, Studies, p. 14, Theorem 16. 
% This Journal, vol. 1 (1935), p. 435. 














242 CLAIBORNE G. LATIMER 


Let 9i;, gi; be the integers defined by equating the matrix I’ of §3 to (c;;) 
and let ges = 292,933 = 2933. Consider the matrix M = (g,;) of §2 with the ele- 
ments g;;asthusdefined. Since | ¢;; | = dj, it may be shown that | M | = 4d} = d’. 
Let Go = | gi; |, (i,j = 1, 2,3). We have d° = 2Gu — (g20gss — gs). Since 
Jess — G23 = 4(C22C33 — C33) = 4d,ky and ky is odd, it follows that Go is divisible 
by 2d. Noting that by (5), gx + 2gi2 = gss + 2gis = 0 (mod 4), and employing 
the fact that every second order minor of (c;;) is divisible by d;, we may show 
that M = (g;;) satisfies the remaining conditions of (c) in Theorem 1. 

We have seen that f is transformed into W by a non-singular transformation 
with rational coefficients. Hence ¥(y;, yz, ys) is transformed into fi(x1, 22, 23) 
by such a transformation, y; = 2js;;7; (¢ = 1, 2,3). Leta, we, ws; be elements 
of M1 defined by 


2m, — 1 = syt + SaJ + 80), 
we = Sit + S8nJ + Syolj, 


w3 = S13t + S37 + S330). 


Then w = 1, --+ ,@s form a basis of YW’. We have X; = 2(20, — 1) + Zowe 
+ ryw3 = Yl + Yoj + ystj and therefore N(X,) = fi(ai, 22, 23). Since T(X1) = 
0, it follows that if X = Yzw;, then N(X) = 2 + aor: + 21/4 + filai/2, 22, 25) 
= (x , 2; , ®2, 23), Where M is the matrix of 2%. 

We may set 1 = %,7 = ais, j = p! 2, = (aB)*is, where the 7; are the 
hamiltonian quaternion units. Then w, = Xxlent, where the ?’s are real. 
Since the determinant of N(2z,,) is d’/16, it follows that, after changing the 
sign of ws; if necessary, d = 4 | tk, |. 

Feuter considered an integral set in a quaternion algebra with the funda- 
mental number d." If w = 1, --- , ws form a basis of this set, he determined 
explicitly the integers c¢;;, defined by ww; = 2c; jj, in terms of the coefficients 
of the form N(Sz,;) = }2g;;ra@; and the g’s defined by certain relations be- 
tween the w’s. Since his w’s form a basis of an integral set, it follows from his 
definitions of the q’s that they are integers. 

Consider Fueter’s argument, beginning on p. 651 and leading to equations 
(11) on p. 654, as applied to the present w’s. The argument holds without 
modification except that the definitions of the g’s now imply only that they are 
rational. However, since they satisfy his equations (7), p. 653, it follows that 
every dg,;; is equal to the cofactor of one of the elements of M. Since M satisfies 
(c) of Theorem 1, it follows that every g;; is an integer. Hence by Fueter’s 
equations (11), p. 654, ww; = Dei jw,, where each c is either an integer or half 
an odd integer. By a result due to Brandt, the c’s are all integral.” 

Let @ be the set of all elements X = zw; with integral z’s. By definition 





'® Zur Theorie der Brandtschen Quaternionenalgebren, Mathematische Annalen, vol. 110 
(1934-5), pp. 651-44. 

‘7 Der Kompositionsbegriff bei den quaterndren quadratischen Formen, Mathematische 
Annalen, vol. 91 (1924), p. 303. 






























CLASSES OF INTEGRAL SETS IN QUATERNION ALGEBRA 243 


of the w’s, T(X) = 27) + 2. Since N(X) = (2x, --- , 23), it follows that G 
has the property # used in our definition of an integral set. By the preceding 
paragraph, @ has the property C. It obviously has the property U. Let G’ 
be an integral set in Y’ containing G, i.e., a maximal set with the properties 
R, C, U and containing @. The norm of the general element in GW’ is a form 
&’ and by Theorem 1, the determinant of 2’ isd’. Since #’ is transformed into 
& by an integral transformation, it follows that d’ divides d. But we have seen 
that d, divides d’. Hence d’ = d, ord’ = d. Since every positive fundamental 
number is the product of an odd number of prime factors, it follows that d’ = d 
and hence %’ is equivalent to %." We may therefore identify %’ with 9. 
Then @’ = G@ is an integral set in A. Since N[x,;(Qw, — 1) + xewe + 233] = 
fi(ai, 22, 23), it follows that € is the class of forms corresponding to the class of 
integral sets containing @. This proves the theorem for the case where d 
is even. 

Suppose d is odd. Let © be a class of positive improperly primitive forms in 
G and let 2f = Saj;r;z; be a form in GC. Since | a;;| = 2d’, one of the a;;, 
Say G23, is odd. If ay is odd, replace x3; by 2; + 23 and if aj; is odd, replace 
rt by 2; + 22. We may then assume a): = @)3 = 23 + 1 = 0 (mod 2). Every 
term in the expansion of | a;; | is divisible by 8 except —ana3;. Hence ay = —2 
(mod 8). Let gi;, g:; be the integers defined by equating the matrix T of §3 
to the matrix (a;;/2) of f, let gos = 2922, gss = 2933 and let M be the matrix of §2 
with the elements g;; as thus defined. Since the invariants of 2f are Q = d, 
A = 2, it may be shown that M = (g;;) satisfies all the conditions of Theorem 1. 

We may then show, in the same manner as for the case where d is even, that 
there is an integral set G in A with a normal basis wo, ---+ , #3; such that N(Zzw;) 
= @(x, --- , 23), where M is the matrix of 26 and N[x,(2w; — 1) + rewe + r3ws] 
= f(x1, %2, 73). Then € is the class of forms corresponding to the class of 
integral sets containing @ and the theorem is proved. 


5. A basis of an integral set. Let F be a quadratic field with the discriminant 
— 7, where 7 is a prime not a divisor of d. By a result previously referred to, % 
contains a field equivalent to F if and only if no prime factor of d is the product 
of two prime ideals in F. This condition is satisfied if and only if — 7 is a quad- 
ratic non-residue of every odd prime factor of d and rt = 3 (mod 8) if d is even. 
It is known that for every such field, called a canonical splitting field, % has a 
basis 1, 7, j, ij, where * = —7, 7? = —d, ij = —ji.”* Such a basis of % will be 
called a canonical basis. 

Let @ be an integral set in % and let F be a canonical splitting field of % 
with the discriminant —r. The intersection of @ and F is a ring with a basis 1, 
cw, where 1, w form a basis of all the integral elements of F and c is a positive 


integer. Hull determined a basis v, v;, v2, v3 of @ such that vm = 1, v1, = cw.” 


18 Brandt, loc. cit., pp. 12, 13. 
19 This Journal, vol. 1 (1935), p. 435. 
20 Transactions of the American Mathematical Society, vol. 40 (1936), p. 8. 











244 CLAIBORNE G. LATIMER 


c is not in general equal to unity. We shall show that there is an infinitude of 
canonical splitting fields such that for each of them, ¢ = 1." Choosing F as 
such a field and setting c = 1 in the v;, we obtain from them a relatively simple 
normal basis of G. 

Let wo, -** ,w3 be anormal basis of G and let f(a, x2, 23) be as defined in §3. 
We shall first prove 

LEMMA 2. f(2:, 222, 2x3) represents an infinitude of primes. 

Let d’ = d/2 and f'(21, 22, 23) = f(a1, 22, 23) or let d’ = d, f'(a,, 22, 23) = 
f(x, 22, 2x3) according as d is even or odd. If d is even, we have seen that the 
invariants of f’ are Q = d’, A = 1 and the reciprocal y’ of f’ is properly primitive. 
It may be shown that f’ and y’ have these properties if d is odd. 

For properly chosen integers C;, ¥’(C:, C2, C3) = C > 0, where C is prime 
tod’.~ After adding properly chosen multiples of d’ to the C;, we may assume 
C, = 0, C; = C; = +1 (mod 4). Finally, without disturbing the preceding 
conditions, we may assume that the g.c.d. of the C; is unity. Let aq = 
(Cy + C3)/6, ag = as = —C,/6, where 6 is the g.c.d. of C; and C, + C;. Then 
a, is odd, az and a; are even, the g.c.d. of the a; is unity and Ya;C; = 0. We 
may then determine integers b;, such that™ 


de be ay b; ay; b; 
a; bs a; bs dz be 


In f'(21, Ze, Zs), set ri = ay, + bye (¢ = 1, 2,3). We obtain a primitive 
form $(y:, y2) = ay; + 2tyye + byz, where ab — t = Cd’.™ We havea = 
f'(a,, a2, a3). Since the leading coefficient of f’ is = 3 (mod 4) and a, is odd, az 
and a; are even, it follows that a = 3 (mod 4). Then $(y:, 2ye) is a properly 
primitive form whose discriminant is not a perfect square. Hence it represents 
an infinitude of primes. Since a2, a3 are even, the lemma follows. 

Let + be one of the infinitude of primes represented by f(a, 272, 273) and 
not a divisor of d. Then there is an element ¢ = 2;(2w; — 1) + 22ewe + 273w3 


in ¥ such that? = —r. Since 2, is odd, @ contains \y = (1 + 7)/2. The field 
F (2) is a canonical splitting field of % and A has a basis 1, 7, j, 77, where j = —d, 
ij = —ji. 


Identify Hull’s Q, M, p, u, P, ao” with our A, G, 7, j, F(), —d respectively. 
Then @ has a basis in the form below,” if we note that @ contains \,, and hence 
Hull’s ¢ = 1. 


m= 1, m= (—7r +1)/2, 2 = g(X+ jd), w= g&+)) E me us i! 


* Cf. Hull, loc. cit., p. 11, lines 9-13. 

2 Dickson, Studies, p. 4, Theorem 16. 

2 Dickson, Studies, p. 11, Theorem 9. 

* Dickson, Studies, Theorem 27, p. 25 and Theorem 37, p. 32. 
*% Loc. cit., Theorem 6, p. 7 

% Hull, loc. cit., p. 8 



















w 


oT 














CLASSES OF INTEGRAL SETS IN QUATERNION ALGEBRA 245 


Employing Hull’s equations (16) and (17), p. 7, loc. cit., and setting b = 1 — 2h, 
we find 

3= ko(1 — bi) /2 + es — xi = = ij. 
Since 6 is odd and g) is an integral element in F (7), it follows that G has a basis 
ho, -** , As as below. N(A;) is an integer and therefore the congruences (5) 
are satisfied. 

To prove the last sentence of the following theorem, let G be the set of all 
elements =2,A; with integral z’s. It may be verified that G has the properties 
R, C, U used in our definition of an integral set. Since the form @ = N(2zr,A,) 
is obtained from N(x + yi + 27 + wij) by a transformation of determinant 
(47), it follows that the determinant of 2@ is d*. Hence @ is maximal. We 
have then 

THeoreM 4. Let G be an integral set in A. There is an infinitude of primes r 
such that for every r there is an element i in © such that (a) 7” = —r, (b) the field 
F (i) is a canonical splitting field of A, (c) G contains (1 + 7)/2. For every such 
i, there is a canonical basis 1, i, j, ij of A and @ has a basis in the form 

b _ 


P ° a. ° 
(4) » = 1, hi = (1 + #)/2, Ae = 9); Ms = t+ 2g) + Org |? 


where a, b, g > 0 are integers such that 


(5) 4a°q’ + d = 0 (mod 7), rb’ + 1 = 0 (mod 4g’). 


Conversely, if 1, i, j, ij form a canonical basis of A, i = —r, and if a, b, g are 
integers which satisfy (5), then Xo, «++ , Xs of (4) form a normal basis of an integral 
set. 

We shall show by an example that a canonical splitting field F(7) cannot 
always be chosen so that an integral set has a basis in the form (4) with g = 1. 
Let % be the algebra with a basis 1, 7, J, TJ, where J” = —83, J* =-—10, IJ = 
—JI. The fundamental number of % is d = 2-5-83.% It may be verified 
that the following elements form a basis of an integral set @ in Y. 


wo = l, w, = #14 J), w = 3J, ws = (-5 + I)J. 


It may also be verified that @ contains no element of trace zero and norm d. 
Hence there is no canonical splitting field F(/) such that @ has a basis in the 
form (4) with g = 1. 


6. Proof of Lemma 1. Let Ao, --- , As of (4) be a basis of @. Since 2A, — 1 
= tis in @,, the intersection of @, with the set of all integral elements of F(:) 
is a ring with the conductor ¢ = lore = 2. Ife = 1, G, contains A, and, since 
T(\2) = T(As) = 0, OG, also contains 2, As. Hence @, contains “ and the 
lemma follows. We shall show that the case ¢ = 2 cannot occur. 


** This Journal, vol. 1 (1935), p. 435. 
























246 CLAIBORNE G. LATIMER 


Suppose c = 2. By Hull’s result,” G, has a basis in the form 


th—7r+ ‘) 
291 P 





v% = 1, my = t, vo = (A + J)g:/2, vy = (A+ a( 


where \ is a certain element in F(z) and g; is a positive integer. If fo, --- , & 
is a normal basis of @,, the trace of each of the elements 2¢; — 1, fs, &3 is zero. 
It follows that the double of every element of @, is in G. In particular, 2v is 
in @. Hence g divides g,. Since dz» is in G,, g; divides 2g. But by Hull’s 
Theorem 9 (loc. cit., p. 10), g:isodd. Hence g = g:. By Hull’s Theorems 7, 8, 
the lemma on p. 9 and (16) on p. 7, d and h are odd. Then in his (17), ki = 
ke + 1 (mod 2) and we find 


een __ he _f , ey. , &-1 +8, 
V2 5) wR Tae U3 Bt ma (w+ 2g toi te, 





where z, w are certain rational numbers. Since k; or kz is even and T(A3) = 0, 
the right member of one of these equations is in @ and hence is expressible as a 
linear function of Ao, A1, Az with integral coefficients. But this is obviously 
impossible in the first case and, since h and b are odd, it is impossible in the 
second case. Therefore c # 2 and the lemma follows. 





7. Certain integral sets. Let J, J be elements of % such that 
(a) 1, 7, J, IJ form a basis of A; 

(b) I? = —a, J* = —8,1J = —JI, where a, 8 are integers, neither divisible 
by a square > | and a = B (mod 2). 

All the integral sets, finite in number, which contain J and J were determined 
by Darkow™ and the writer,” the former treating the case where a and B are 
even. Of course, a and 8 are not uniquely determined by A. Let G be an 
arbitrarily chosen integral set in A. The question arises as to whether or not 
(% may be obtained from the above mentioned results by proper choice of I 
and J, i.e., whether or not G contains elements J and J which satisfy the condi- 
tions (a) and (b). We shall show that it does contain such elements. 

Let Xo, «++ , Ay Of Theorem 4 be a basis of ©. G contains i = 2d; — 1, 










J = —axi + yr + rxrd3 = (rbx + 2’y + xi)j/2g 





and iJ, where z and y are rational integers to be determined later. We have 
J* = —do(z, y), where ¢(z, y) = rex” + rbry + g’y’, c = (rb’ + 1)/4g°. The 








28 Loc. cit., p. 8. 
2% Determination of a basis for the integral elements of certain generalized quaternion 
algebras, Annals of Mathematics, vol. 28 (1926-27), pp. 263-270. 

3° Arithmetics of generalized quaternion algebras, American Journal of Mathematics, 
vol. 48 (1926), pp. 57-63. 











on 





ee ee 





ane Hey 





CLASSES OF INTEGRAL SETS IN QUATERNION ALGEBRA 247 


discriminant of ¢ is —r and hence it represents an infinitude of primes. There- 
fore, for properly chosen integers x and y, J” = —dp, where p is a prime not a 
divisor of 2dr. Then dp has no square factor > 1 and is prime to r. It may be 
shown that 1, 7, J, 7J are linearly independent. Hence they form a basis of %; 
furthermore, 7J = —Ji. 

Let J = torl = iJ according as d is odd or even. Then J and J satisfy 
all the conditions (a) and (b). Since @ contains J and J, it follows that it is 
one of the sets obtained by properly specializing the above mentioned results 
by Darkow or by the writer, according as d is even or odd. 


UNIVERSITY OF KENTUCKY. 








THE ERGODIC FUNCTION OF BIRKHOFF 
By Monroe H. Martin 


Introduction. In his New Orleans lecture, Professor Birkhoff' introduced 
the concept of the ergodic function 7'(e) as the least time 7 which elapses before 
the point P of some motion can come within a distance ¢ of every point of the 
phase space. He is led to conjecture that in the general closed recurrent case 
possessing no stable periodic motions the ergodic function is of the order of 
e ‘""” n being the number of dimensions of the phase space. The purpose of 
this paper is to consider the closed, transitive dynamical systems provided 
by the geodesics on topologically closed surfaces of constant negative curvature. 
The principal result is the establishment of upper bounds for the ergodic func- 
tions of these dynamical systems. 

The metric chosen for the phase space is patterned after that used by Morse.” 
The “time” 7 along a motion is taken as the “H-length” measured along a 
geodesic on the surface (see §9 of the present paper). 

If « be chosen in compliance with (67), (68), we find that the ergodic function 
T(«) satisfies an equality of the form 


Tle) <€* [4 log - + c|, 


where A, B, C and w depend on the genus p of the surface as set forth in (77), 
(78), (79) and (80). All of these constants tend to + « for p— + @, the con- 
stant w, in particular, being bounded by the inequalities (81), so thatw > 4 > 2 
for p = 2,3,---. Since n — 1 = 2 in the case under consideration, the order 
of T(e) as conjectured by Birkhoff lies well below that of the upper bound we 
have found. 

It may not be amiss to point out that the upper bound we have secured can 
doubtless be sharpened considerably, for in deriving it we compute an upper 
bound for the magnitude of the interval of time during which a point P of a 
certain motion comes within a distance ¢e of every point of the phase space at 
least once, the point P perchance coming repeatedly within a distance ¢ of some 
or all of the points of the phase space. In addition, the upper bound, (4p)°”, 
given in §10 for the number of sets Sy can in all probability be improved upon, 
with a resultant improvement in the upper bound for the ergodic function. 


Received August 23, 1935. This paper had its inception while the author was National 
Research Fellow at Harvard University in 1933. 
1G. D. Birkhoff, Bull. Amer. Math. Soc., vol. 38 (1932), pp. 375-377. 
*M. Morse, Jour. de Math., vol. 14 (1935), pp. 52-53. This paper will be referred to as 
Morse I. 
248 

















ERGODIC FUNCTION OF BIRKHOFF 249 


1. Preliminary notions. Let (zx, y) denote ordinary Cartesian coérdinates 
in the plane. Let W denote the unit circle z° + y’ = 1, and let the interior of ¥, 
provided with the metric 


(H) ds* = 4(1 — 2° — y’)*(dx" + dy’), 


be denoted by #. The Gaussian curvature of this metric equals —1, so that 
’ may be regarded as the hyperbolic plane of non-euclidean geometry. Linear 
fractional transformations of hyperbolic type on the complex variable x + ty 
which take the interior of V into itself leave (H) invariant, and are called H- 
transformations (hyperbolic transformations). The geodesics on ® are circular 
ares orthogonal to ¥ and ending upon it. They are called H-lines. Through 
two points on ® only one H-line can be drawn. Consequently if two H-lines 
intersect in more than one point, they coincide. Two H-lines which have no 
common point either on ® or on W are said to be non-intersecting. A segment 
of an H-line which has both end-points A, B in @ is an H-line segment, and is 
denoted by AB; a segment which has one end-point in @ and one on ¥ is an 
H-ray. The H-line of which a given H-line segment is a segment is, at times, 
conveniently referred to as the H-line segment produced. If A, B are two dis- 
tinct points on W(#), there is exactly one H-line (H-line segment) joining them. 
On directing this H-line (H-line segment) from A to B, we obtain a directed 
H-line AB (directed H-line segment AB) for which A is the a-end-point and B 
the w-end-point. If an H-line (H-line segment) and a segment of itself are 
directed in the same sense, the (latter) directed H-line segment is a segment 
of the (former) directed H-line (directed H-line segment). A directed H-line 
(directed H-line segment) intersects a set of directed H-line segments if one 
of the directed H-line segments in the set is a segment of the directed H-line 
(directed H-line segment). A set S, of directed H-line segments is a subset 
of a set S of directed H-line segments (directed H-lines) if every member of S; 
is a segment of some member of S. 

The length of an H-line segment AB calculated in the metric (H) is its H- 
length, and is denoted by | AB|. The H-distance between two points A, B of 
is then defined to be | AB|. When an H-line segment is designated by a single 
letter, say n, its H-length is designated’ by | n |. The H-mid-point of an H-line 
segment is defined to be that point of the H-line segment which divides it into 
two H-line segments of equal H-lengths. Angular magnitudes measured in 
the metric (H) equal the ordinary Euclidean magnitudes. No distinction will 
therefore be made between the two. If A is a point not lying on a given H-line 
land B the point of 1 such that AB meets I at right angles, AB is the H-per- 
pendicular let fall from A tol. | AB! is the H-distance between A and Il and is, 
as a matter of fact, the minimum H-distance between A and the points of l. 
If two H-lines l, m are non-intersecting, they possess a unique mutual H- 





5 At times, we also employ the symbol | | in its customary usage, that of denoting abso- 
lute values. No confusion need arise, since the proper interpretation will be clear from 
the context. 














250 MONROE H. MARTIN 


perpendicular, the H-length of which is denoted by | lm |, and is called the H- 
distance between land m. |lm| is then the minimum H-distance between the 
points of l and m. 

If A, B, C are three non-coincident points of ® such that two of the H-line 
segments AB, BC, CA meet at right angles, say BC and CA, the following rela- 
tions* hold: 

_ sinh | BC | 


— wes sinh | AC | 
a) sn A= sinh | AB\’ cos A = cosh | BC | sinh |AB|’ 


cosh | AB| = cosh | AC| cosh | BC], 





where A denotes the angle CAB. If A, B, C are three arbitrary points of %, 
the following “triangle” inequality holds: 


|AB| <|BC|+|CA|. 


2. Gaussian geodesic, and geodesic polar codérdinates. Let n be a given 
directed H-line segment. By a Gaussian geodesic coérdinate system [u, , v,] on ®, 
we mean a codrdinate system on © in which the coérdinate lines v, = const. 
are H-lines orthogonal to the directed H-line of which n is a segment, and the 
coérdinate lines u, = const. are the orthogonal trajectories of the coérdinate 
lines vy, = const. The origin [0, 0] is taken as the a-end-point of n. The posi- 
tive sense on u, = 0 is chosen to coincide with that on n and the sense on v, = 
const. is then affixed in the usual way, i.e., so that an observer moving in the 
positive sense along u, = 0 finds the positive sense of v, = const. directed to 
his right. A unique coérdinate system [u, , v,] is thereby associated with any 
directed H-line segment n. The codrdinate lines u, = const. are circular ares 
in & which connect the end-points of u, = 0. They cut off equal H-lengths 
upon the codrdinate lines v, = const. If we take u, as the H-length of are 
measured along v, = const. from u, = 0, and », as the H-length of are measured 
along u, = 0 from [0, 0], the differential form (H) becomes’ 


2 2 2 2 
(G) ds’ = du, + cosh'u,dv, 
in the [u,, v,] coérdinate system. Hence the H-length « of the segment of 
u, = k comprehended between v, = 0 and v, = | n | is given by 
n 
(2) <= [ cosh kdv, = | n| cosh k. 
0 


Let P, P’ be two points of u, = a. Denote by o the H-length of the segment 
PP’ of u, = a. Let a, a’ denote the directions (measured in the usual way in 


‘See, for example, H. S. Carslaw, The Elements of Non-Euclidean Plane Geometry and 
Trigonometry, London, 1916, p. 109. 

5 See, for example, W. Blashke, Vorlesungen tiber Differentialgeometrie, vol. 1, 1930 
pp. 155-156. 





NS SE tener eee 























ERGODIC FUNCTION OF BIRKHOFF 251 


the x, y-plane) of the coérdinate lines v, = const. at P, P’ respectively. We 
proceed to calculate an upper bound for |a@ — a’ | when P, P’ both lie in the 
region 


of @. Since the codrdinate lines v, = const. are orthogonal to u, = a, it is 
sufficient to obtain an upper bound for the difference | 8 — 8’ | in the directions 
8, 8’ of u, = a at P and P’ respectively. 

Let us assume (with no loss of generality) that u, = 0 is drawn orthogonal 
to the positive z-axis of the x, y-coérdinate system at a point which lies at an 
H-distance — from the origin of the x, y-codrdinate system. The equation of 
u, = ain 2, y-codrdinates is then 


2 cosh & sinh § + sinha _ 
sinh a — sinh & sinh é — sinha — 





ty + 
Letting K denote the ordinary Euclidean curvature of this circle, we find 


| sinh a — sinh €| 








\K| = 
7 cosh a 
Now 
|dg| _ |dp||ds’| |. |ds’ 
|ds| = |as?| {ae | = '*! |ae|> 








where s, s’ denote the H-length and Euclidean length of are respectively, meas- 
ured along u, = a. From (H), we have | ds’/ds| < 3, so that |dB8/ds| < 
3!K |. Therefore 


ja—a'ls | |Bla<aiKlo, 


and hence we have from (2), since « < | | cosh a, 


ait 
|B—Bp’|< 1s] | sinh a — sinh €|, 
| K | being replaced by its value given above. Since —6 S a S 6, = O and 
|a — a’| = |B — B’ |, we find that 
(3) la-a’|< + (sinh 6 + sinh &£), 


which is the required upper bound. 
The set of H-rays concurring at a point O of @ combined with their orthogonal 
trajectories form a geodesic polar coérdinate system [r, ¥] on ®. ris the H-length 








252 MONROE H. MARTIN 


measured along an H-ray from O, and y is the angle at O measured from a fixed 
direction through O. In these codrdinates (H) becomes*® 


(P) ds’ = dr’ + sinh r' dy’. 


The point O is called the pole. 

A coérdinate line r = const. will be called an H-circle and O is its H-center. 
The H-line segments connecting the points of an H-circle to its H-center are its 
H-radii. The two halves into which an H-cirele is divided by an H-line through 
its H-center are H-semicircles. An H-circle is an ordinary Euclidean circle. 
However, its Euclidean center coincides with its H-center only when the H-center 
lies at the center of WV. 


3. Preliminary lemmas. In this section we prove a number of elementary 
lemmas. 

LemMa l. If BC and AD are two H-line segments perpendicular to the H-line 
segment CD at its end-points, and if the H-line segment AB meets BC at right 
angles,” then 
(4) sinh | AB| = sinh | CD | cosh | AD |, 

(5) cosh |CD| = sin Z BAD cosh | AB}. 


Draw the H-line segment AC. (4) follows from (1) and the relation cos Z 
ACD = sin Z ACB. 

From (1) we have cosh | AC | = cosh | AB| cosh | BC | = cosh | AD | cosh 

CD |, and in addition, since sin Z ACD = cos Z ACB, there results sinh 
AD,| = cosh | AB} sinh | BC |. (5) follows when these relations are used 
together with (1) and (4) to simplify the expression for sin Z BAD, obtained 
when sin Z BAD is expressed in terms of sines and cosines of Z CAD and 
Z CAB. 

Let n be a given directed H-line segment and [u,, v,] be the associated Gaus- 
sian geodesic codrdinate system. A, p denote positive numbers and c.& 
are two H-semicircles of H-radius p drawn in the region of ® for which v, 2 0 
about the points [+(A: + p), 0] as H-centers. 

LemMa 2. If C is an H-semicircle of H-radius p drawn in the region v, 2 0 
of & about a point on the segment —(\y + p) S un S A + pofv, = 0 as H-center, 
it is intersected by any H-line | which intersects both Ci, CY. 

First, suppose [0, 0] coincides with the center of ¥. The codrdinate lines 
u, = 0, v, = 0 are then perpendicular diameters of ¥. Consider the H-line t; 





6 See, for example, L. P. Eisenhart, A Treatise on the Differential Geometry of Curves and 
Surfaces, New York, 1909, pp. 207-209. Formula (P) above is not given explicitly here, 
but may be readily obtained from the formulas to be found on the pages indicated. As a 
matter of fact, (P) is a well-known formula for the are length in non-Euclidean geometry. 

7 The H-line segments BC, AD, CD, AB form a figure which is sometimes called the tri- 
rectangular quadrilateral. See, for example, D. M. Y. Sommerville, Non-Euclidean Geam- 
elry, Chicago, 1919, p. 70. 


eat 





= es 








ERGODIC FUNCTION OF BIRKHOFF 253 


drawn tangent to both C;, Cy. In order to prove the lemma, it is sufficient to 
prove it when | coincides with t;. The H-distance from a point of the segment 
—(A. + p) S uz S A + p Of v, = O to t, is seen from (4) to be a monotone 
increasing function of its H-distance from [0, 0], and as such takes its greatest 
value, namely p, at the end-points of this segment. Since the H-center of C 
lies on this segment, C meets ¢, . 

When [0, 0] does not coincide with the center of VY, an H-transformation 
carrying [0, 0] into the center of V reduces this case to the one above. 

Lemma 3. If C’, C” are two H-semicircles of H-radii p drawn in the region 
v, 2 0 of ® about points on the segment —(\. + p) S un S A + pofv, = Vas 
H-centers, the set L of H-lines intersecting C’, C’’ contains the set L, of H-lines 
intersecting two arbitrarily chosen H-radii of C Rd 

An H-line of LZ, intersects both c. c. According to Lemma 1, it intersects 
both C’, C’’ and consequently belongs to L. 

Lemma 4. The interior of the region of © enclosed by v, = 0, two arbitrary radit 
of Ci, Cy, and the H-line segment connecting their end-points on Cj, Cy cannot 
contain an H-circle of H-radius p/2. 

The proof of this lemma is simple and is omitted. 

Lema 5. If C’, C” are two H-semicircles of H-radius p drawn in the region 
0 of & about points [+(A + p), 0] (A > 0) as H-centers, and if n is such that 
| n | ts tangent to both C’, C’’, we have | n | bounded by the inequalities 


s 
- 
I NV 


(6) pet) < |n| < 2e~°* sinh p. 


Fixing our attention on one of the H-semicircles, we let A denote its H-center, 
B its point of tangency with v, = ||, C the point [0, | n |], and D the point 
{0, 0]. According to (4), we have 


sinh p = sinh | n | cosh (A + p), 





since | AB| = p,|CD| = |n|, | AD| = \ + 9p, and the upper bound in (6) 
follows at once from this equation, since 
sinh |n| > | n], cosh (A + p) > 3e°7”. 


To obtain the lower bound in (6), consider the segment of u, = + p taken 
between v, = Oandv, = |n|. According to (2), the H-length of this segment 
is |n| cosh (A + p). Now 
p = | AB| <|n| cosh (A + p), 
since AB is the H-perpendicular let fall from A tov, = |n|. The lower bound 
given in (6) is then obtained from this inequality, inasmuch as | n | cosh (A + p) 
< |n|e**, 
Lema 6. When the H-line segment AB lies in the region 
Tet 0 < uw S17, 0<sv, < \n| 
of band A lies on u, = 0, we have 
|AB| <|n|+r. 








254 MONROE H. MARTIN 


If AB lies in the region —p S uy S 7,0 Sv, S | n|, with A onu, = —p and 
Bin T, , we have 
|AB|<|n|+utr. 
For the first part of the lemma, take C on u, = 0, so that its v,-coérdinate 
equals that of B. Now 
AB| = |AC|+(|CB| Ss |n| +1, 


which prove- ‘his part of the lemma, inasmuch as the equality signs cannot hold 
simultaneously. 

To prove the second part of the lemma, we take C as before and again have 
|AB| s | AC| + |CB|, and since | AC | < |n| + u,|CB| S 7+, the second 
part of the lemma is proved. 

Lemma 7. When sinh (| n|/2) < tanh 6, an H-circle of H-radius 6 can be 
inscribed in the region T,, of Lemma 6 if r 2 7, , where 


(7) tT, = 6+ log 4 sinh é 
| m| 
Let A be a point on the positive half of v, = |n|/2, B be the foot of the 


H-perpendicular let fall from A to v, = |n|, C be the point [0, | |], and D 
be the point [0, | n{/2]. An H-circle of H-radius | AB | described about A 








as an H-center is tangent to both’ v, = 0, v, = | n{ and will, in addition, be 
tangent to u, = Oif |AD| = | AB. If, however, |AD| > | AB|, the H- 
circle lies in the region 

(8) 0<u, S|AB|+|AD|, 0s», Ss |n|, 

of ®. 


From (4) we have 
sinh | AB| = sinh (| n |/2) cosh | AD | < tanh 6 cosh | AD |, 
and therefore the condition that | AD| > | AB| is met, when | AB| = 6. By 
way of obtaining an upper bound for | AD |, we have 
sh| AD! = sinh 6 
cosh | AD| sinh (|n|/2)’ 


and therefore 





' La sinh 6 + [sinh®  — sinh? (| n vat} 
140} = lo { sinh (| n|/2) 


so that 
2 sinh 5 ate 4 sinh 6 
sinh (| |/2) a | n| 


8 To show that it is tangent to v, = 0, let B’ be the foot of the H-perpendicular let fall 
from A to v, = 0, C’ be the point [0, 0]. From (4) we have, since | C’D | = | CD |, the 
result sinh | AB’ | = sinh | C’D |-cosh | AD | = sinh | CD | cosh | AD | = sinh | AB}. 
Hence | AB’ | = | AB! , which completes the proof. 


|AD| < log 











ERGODIC FUNCTION OF BIRKHOFF 255 


The upper bound in the first inequality in (8) may then be replaced by 6 + log 
: ae This proves the property stated for 7, in the lemma. 

Lemma 8. The H-length of the mutual H-perpendicular n between two non- 
intersecting H-lines l, , l, drawn tangent to an H-circle T of H-radius y is given by 


(9) |n| = 2 log {sin @ cosh y + (sin’@ cosh*y — 1)'}, 


where 26 [are sin (sech y) < @ S 2/2] is the angle between the H-radii of Y drawn 
to the points of tangency. 

Let A be the H-center of I, B be the point of tangency of I with l,, C the 
end-point of n on 1, , and D the H-mid-point of n. From (5) we have 


cosh (| n |/2) = sin @ cosh y, 


since |CD| = |n|/2, ZBAD = @,| AB| = y, and (9) follows from this equation. 
When 0 S @ S arc sin (sech y), the two H-lines |, , , intersect. 
Lemma 9. Let a, a2, b:, be be four H-lines each making an angle 0 (—71/2 < 
6 < 2/2) with an H-line segment AB. Let those H-lines labeled with a intersect 
at A, and those labeled with b intersect at B. Choose @ so that no line labeled with 
a intersects a line labeled with b. The four H-distances 


| axbi |, | abe |, | aaby |, | debe | 


resolve into two pairs such that the H-distances in each pair are equal, the greater 
H-distances occurring for the pair in which the H-lines are placed so that their 
mutual H-perpendicular intersects AB. 

The H-distances from the H-mid-point O of AB to a,, a2, b; , be , are all equal 
to one another, so that these four H-lines are all tangent to the same H-circle 
with H-center at O. The proof of this lemma is then readily seen to follow from 
(9) in Lemma 8. 


4. The domain D,. Let p denote a positive integer greater than unity and 
let p, 5 (0 < p < 4) be defined by’ 


(10) | cosh 5 = cot i’ 

(11) cosh * a of —. 
2 4p 

In & construct the H-circles 

(12) ety = tanh’ ¢, 

(13) r+y= tanh’ ¢, 


* I am indebted to the referee for the elegant form of these equations. 








256 MONROE H. MARTIN 


having H-radii p/2 and 6/2 respectively, and beginning with the point on (12) 
where it intersects the positive z-axis, divide the circumference of (12) into 4p 
equal ares. At each of the 4p division points draw an H-line tangent to (12), 
thereby forming a curvilinear polygon” having 4p sides and 4p vertices. We 
denote the interior of this curvilinear polygon by Do and label its sides 


(14) €1,€2,°°* » Cap, 


e; being taken as that side of Dy which is tangent to (12) where it cuts the posi- 
tive z-axis, and the subscripts increasing when the boundary of Do is traversed 
counter-clockwise. The vertices of Dy are labeled 


(15) Vu» W809 *** » Ves 


V;, being the vertex at which e; and e2 concur, and the subscripts increasing 
when the boundary of Do is traversed counter-clockwise. The vertices of Do 
all lie on (13). The H-cirele (12) is inseribed in Do , which in turn is inscribed 
in (13). The H-length of each side of Dp equals p and the interior angle at 
each vertex equals +/2p. 

V,and V;_; (Vo = V,4,) each divide the side e; of Do produced into two H-rays. 
The H-ray ending on V; (Vj) which does not contain the side e; of Do is de- 
noted by ¢; (e;). It is assumed that V; (V;_:) is a point of e; (e7). The region 
of & lying outside of (13) is divided into two subregions by ej: , ej-1 (Cap41 = 41, 
€) = ¢y). The subregion bounded on ¥ by the shorter (Euclidean) are when 
taken with its boundary is denoted by £;. The region of #, lying outside of 
(13) and between £; and F;,; (Fy,.1 = 2;) at V; , when taken with its boundary 
is denoted by E;. With respect to the region E; , we prove the following lemma. 

LemMa 10. The boundaries ej. , e}-. of E; cannot be joined by an H-line seg- 
ment lying in E; . 

The proof of this lemma is elementary, but because of the importance of the 
lemma in subsequent arguments the proof is given in some detail. From sym- 
metry considerations it is clear that we need consider only the case 7 = 1. 

In the x, y-coérdinate system the equation of the circle containing the side 
é2 of Do as a segment is 





9 ° T T Tr T . Tv 

r+y-2 4 cos —— cos —- a — 2 cos — V sec — sin—y +1 = 0. 
2p 4p 4p 2p 2p 

The minimum value of y on this circle is caleulated to be sin x/(4p) + cos x/(2p), 

occurring, as a matter of fact, at V;. Consider the H-line 1 drawn tangent to 

(13) at the point where it cuts the positive z-axis. 1 is a segment of the circle 

whose equation is 


r+y V sec ap (: + cos x): +i =G, 


1 That the H-lines so constructed intersect in pairs is well known. This fact may 
also be drawn from (10) in connection with the remark at the end of Lemma 8 [ Referee]. 





al 


P= 

















ERGODIC FUNCTION OF BIRKHOFF 257 


and lies exterior to (13). The maximum value of y on 1 is tan’r/(4p) and it is 
readily seen that 





tan? 2 < sin & 4/ cos = (p = 2,3, ---), 
so that | cannot intersect the side e2 of Dy produced, the situation being analogous 
in regard to the side e,, of Do. I therefore lies in E, . 

Suppose an H-line segment lies in E,; and connects e2, Cip . Since l connects 
(13) to W, this H-line segment meets l. It cannot, however, be a segment of 1 
and therefore when produced intersects ¥ in two points, one in each of the ares 
into which W is divided by the end-points of 1, so that it cannot intersect both 
€2, €s», Which is a contradiction. The lemma is therefore true. 


5. The group G and the net N. When certain of the points on the boundary 
of Dy are adjoined to Dp , a fundamental domain Dy is obtained for a well-known" 
Fuchsian group G. Transformations of G which take the sides of Do into one 
another in the following manner: 


€1 —> €3, C4 —> C2 ; C5 —> C7, Cg —> G5 >>> 5 Cap—3 — Cap_1, Cp — Cap-2, 


form a set of generators for G. The transformations of G are all H-trans- 
formations. 

Two point sets in ®, or two sets of directed H-lines, or two sets of directed 
H-line segments which can be transformed into one another by transformations 
of G are congruent, or copies of one another; two symbols forming a pair above 
are congruent symbols. The copies of Dp are meshes and their totality covers 
® without lacunae. The copies of the sides (vertices) of Do are the sides (vertices) 
of the meshes. Two meshes with a side in common border upon the common 
side. Those meshes which border Dy upon e:, é2,--- ,@s» are denoted by 
D,, Dz, --- , Dy respectively. Two vertices which are end-points of the same 
side of a mesh are adjacent vertices. ‘Two sides of a mesh concurring at a vertex 
of the mesh are adjacent sides. Two sides of a mesh which are not adjacent 
sides are non-adjacent sides. Two non-adjacent sides of a mesh adjacent to 
the same side of the mesh are alternating sides. Alternating sides, as well as 
adjacent sides, always belong to the same mesh. When two meshes border 
upon a common side, this common side is adjacent to two pairs of alternating 
sides, one pair being found in each mesh. If a side be selected from each of these 
pairs of alternating sides so that the two sides selected do not concur, the two 
sides thus selected are opposite sides. Opposite sides do not belong to the same 
mesh but belong to meshes which have a side incommon. Neither non-adjacent 
sides nor opposite sides intersect, even when produced. 

The H-length of each side of a mesh equals p. Any given mesh is inscribed 


11 See, for example, M. Morse, Trans. Amer. Math. Soc., vol. 26 (1924), pp. 25-32. This 
paper will be referred to as Morse II. See also, J. Nielsen, Acta. Math., vol. 50 (1927), 
pp. 191-224. 











258 MONROE H. MARTIN 


in an H-circle of H-radius 6/2 and its 4p sides are all tangent to an H-circle of 
H-radius p/2. The interior angle at each vertex of the mesh equals 2/2p. 

The sides of the meshes form a network N of H-line segments. The end- 
points of the sides in N are the vertices of N. The sides in N align themselves 
into H-lines, 2p of these H-lines concurring at each vertex of N. A vertex of N 
therefore serves as a vertex for 4p meshes. We label each side of a mesh with 
the label assigned in (14) to that side of Dy into which it is carried by the trans- 
formation of G carrying the mesh into Dy. To each side of a mesh there is then 
assigned two of the symbols in (14), 6ne coming from each of the meshes border- 
ing upon the side in question. N, together with the labeling of its sides, is 
transformed into itself by a transformation of G. 

The two points in which the side e; of Dy produced intersects V are the base- 
points of e;. The base-points of e; divide V into two ares of unequal euclidean 
length and e; subtends the shorter are (end-points included). Let [e;] denote 
the are on W which e; subtends. Adjacent sides of Dp subtend overlapping 
ares on W and the part of [e;] remaining when the overlapping parts are re- 
moved is denoted by [e;],. Obviously [e;], and [e;], have no common point, 
unless ¢ = jy. The transforms of the ares [e;], [e;], (¢ = 1, 2, --- , 4p) by the 
transformation which carries Dp into one of D,, De, --- , Ds, say Dy, are 
denoted by [e,]“”, [e,]\“ respectively. If we follow the conventions used above, 
the side e; of D, subtends the are [e,]“’ on V and the end-points of [e;]” are the 
base-points of the side e; of D, . 

We now prove four lemmas. 

Lema 11. The H-length of an H-line segment s whose end-points lie on two 
non-adjacent sides, or on two opposite sides, cannot fall below the constant x, where 


( 9 WT Tr ’ 
(16) x = 2log<2 cos — + (4 cos’ — — 1)}, 
{2eos’ ip"), 


/ 


and is actually greater than x in the latter case. 

First, suppose the end-points of s rest upon two non-adjacent sides. | s | 
cannot fall below the H-distance between the two non-adjacent sides produced. 

If we identify these two non-adjacent sides produced as l,, lz in Lemma 8 and 
the H-circle of H-radius p/2 to which they are drawn tangent asT’,, the H-distance 
between two non-adjacent sides produced is readily seen to be least when the 
two non-adjacent sides are alternating sides, inasmuch as | n | in (9) is a mono- 
tone increasing function of @in the given interval. Let x denote the H-distance 
between two alternating sides produced. We have |s| 2 x, and in order to 
obtain the value of x given in (16), we place 6 = +/2p, y = p/2 in (9), p being 
taken as given in (10). 

Second, suppose the end-points of s rest upon two opposite sides. According 
to the definition of opposite sides given above, the end-points of s lie upon two 
sides of N which do not belong to the same mesh but to two meshes bordering 
on a side, say AB of N. One of the opposite sides emanates from A and the 
other from B, the angles measured from AB to the sides being simultaneously 

















ERGODIC FUNCTION OF BIRKHOFF 259 


r/2p or —x/2p. The two opposite sides produced may now be interpreted 
as a pair selected from the four H-lines a, , az, b; , b2 in Lemma 9; one of the 
opposite sides produced will be labeled with a, the other with 6. From Lemma 9 
it follows that the distance between two opposite sides produced exceeds the 
distance between two alternate sides produced. Hence when the end-points 
of s rest on two opposite sides, | s | exceeds x. 

Lemma 12. If k denotes any fixed positive integer taken from 1, 2, --- , 4p, 
exactly 4p — 3 sides of the mesh D, have both their base-points in the interior of 
lec. If these 4p — 3 sides are taken in their circular order around D, , consecu- 
tive sides are adjacent, except two, between which the three remaining sides of D, 
intervene. 

For simplicity we shall prove this lemma for p = 2, the proof permitting an 
obvious extension to any value of p. In addition we may restrict ourselves 
with no loss in generality to the case k = 1. The meshes Dy and D, border 
along a side of N which when reckoned to Dy (D,) is labeled e; (es). If we proceed 
from this side in a counter-clockwise fashion around the periphery of D, , the 
labels of the sides of D, when reckoned to D, are met in order e;, es, €s, 6, €7, €s, 
€1, C2, Cs. 

We now prove that the sides es, es, e7, €s, €: of D, have both base-points in 
the interior of [e,], . 

Let us begin with e;. Suppose both base-points of e; do not lie in [e;], . 
Since the sides es , e; of D, when produced cannot intersect, the only way this 
can occur is for the side e; of D, produced to meet the side es of Dy produced. 
On interpreting the H-line obtained by producing the side e, of D, as a trans- 
versal which cuts across the two H-lines obtained by producing the side es of 
Dy and the side e; of D, , we easily see from elementary non-Euclidean geometry 
that the latter two H-lines cannot intersect. This follows from the fact that 
the sum of the interior angles lying exterior to D, is for any p equal to (2 — 
3/2p) 2 r(2 — 3/4) > =. 

In like manner it may be shown that the base-points of the side e; of D, lie 
in [e;], . 

The truth of the statement for the remaining sides es, e7, es of D, is now 
obvious. 

Let 2 denote an are on V which is the sum of 4p — 4 consecutive [e;]’s and let 
A = ¥ —@. When Dy is transformed into a given D; (i = 1, 2, --- , 4p) bya 
transformation of G, the transform of A under this transformation will be de- 
noted by A‘. 

Lemma 13. There are at least two different values of i such that [e,], D> A“. 

In view of the symmetry of Dp, it is sufficient to prove the lemma when 


Leledtbod+indtiel, #0 5 ial. 


v=1 


Here A © [e4p-s] + [es»-2] + [esp>—1] + [ep], so that [e,], D> A“ whenever each of 
the sides e4)-3 , €4p—2, C4p-1, 4p Of D; has both base-points in [e;], . 














260 MONROE H. MARTIN 


According to Lemma 12, 4p — 3 consecutive sides of D; have both base- 
points in [e;],. From these 4p — 3 sides 4p — 6 combinations of four consecu- 
tive sides each can be formed. In all the meshes D,, D2, --- , Dy, there are 


4p(4p — 6) such combinations, not all different to be sure, but any one com- 
bination occurring as often as any other. Since 4p different combinations of 
four consecutive sides are possible, any one combination such as @4p-3, €4p-2, 
Cip—1, C4p OCCUrS 4p(4p — 6)/4p = 4p — 6 times among the 4p(4p — 6) com- 
binations. Now a given combination of four consecutive sides occurs only 
once in a given D;, so that the combination ¢4,—3 , @4»—2 , C4p—1 , Cap Occurs in the 
desired manner in 4p — 6 different D,’s. The number of values of 7 such that 
fe], D A’ is at least 4p — 6 = 2, since p = 2. 

Lemma 14. When sinh (| n|/2) < tanh 6 and if r = 7r,, where 7, is given in 
(7), the region T,, in Lemma 6 contains at least one entire mesh, together with the 
H-circle in which it is inscribed. 

A region of @ comprising the interior and boundary of an H-circle of H-radius 
6 drawn about an arbitrary point of ® as H-center contains at least one entire 
mesh together with the H-circle in which it is inscribed.” In view of this fact, 
and Lemma 7, this lemma is obvious. 


6. Admissible sequences. Let A, B be two points of #, not lying on N, such 
that the H-line segment AB connecting them intersects k sides of N without 
passing through a vertex of N. The directed H-line segment AB generates 
a sequence of the e’s in the following way: beginning at A, we proceed along 
AB until we meet a side in N labeled a;, say, when the side is reckoned to the 
mesh we are just leaving; after leaving this mesh, we continue along AB until 
we meet a second side of N labeled, say, az, when the side is reckoned to the 
mesh we are just leaving; etc. The process ends when B is reached and yields 
a certain sequence " 


(17) Q, Aq +++ Ap_1Qy 


of the e’s which is the admissible sequence of AB. The admissible sequence of 
the oppositely directed H-line segment BA is then 


, , , , 
QpAp-1 +--+ Q2Q,, 


where a, eae as, a, designate symbols congruent to a;, G1, --- , @2, @ 
respectively. 

If AB is a segment of A,B, and if they meet N at the same points, their 
admissible sequences are identical. Here neither AB nor A,B, is supposed to 
meet a vertex of N or to end upon N. The admissible sequence of AB where 
A and B are points of N but not vertices of N will be defined to be the admissible 
sequence of A, B,, where A;, B, are two points not lying on N, such that A,B, 





12 Let P denote an arbitrary point of #. P necessarily lies in at least one mesh, which 
in turn is inscribed in an H-circle T of H-radius 6/2. An H-circle of H-radius 6 taken about 
P as an H-center will contain I. 

















ERGODIC FUNCTION OF BIRKHOFF 261 


contains AB as a segment, and meets N only at its intersection points 
with AB. 

For brevity we denote the side in N which gives rise to a; (¢ = 1, 2, --- , k) 
in (17) by a;. The segment of AB comprehended between two sides i,, Gi, 
(i; < t) of N is conveniently denoted by [a;, a;,]. 

We now prove three lemmas related to admissible sequences. 

Lemma 15. If A, B are two points on N, not vertices of N, such that AB does 
not pass through a vertex of N, and if the admissible sequence of AB contains 
2pr + 1 symbols, where r is a positive integer, | AB | is bounded from below by 


(18) rx = | AB|, 
where x is defined in (16). 
Let 
(19) Q,Q2q +++ AepAeps1 +++ Agpyir Aepya2 ++ + Aep(v4t) Aep(v41)41 ++ * Aepr4t 


be the admissible sequence of AB and pick out from it the subsequences 
2prs1A2prs2 +++ Arpo41 Apoinsr (v= 0,1,---,r—1) 


each containing 2p + 1 symbols. The proof of the lemma is achieved by show- 
ing that none of the H-lengths 


| [Gepr+1 Gep(r41)41] | (v ” 0, Enso? 4? = 1), 


can fall below x. Let us prove this for » = 0, the method of proof being the 
same for the remaining values of v. Consider the subsequence aye - - - @2,)@2p+1. 
The end-points of [a;a2,;] lie on N but are not vertices of N. 

If at least one of the 2p segments into which [a;d2,,;] is divided by the 2p 
meshes through which it passes has its end-points on non-adjacent sides, we 
have | [a;@2)4:] | 2 x by virtue of Lemma 11. 

If, however, each of these 2p segments has its end-points on adjacent sides, 
we proceed as follows: We first note that two sides in N from which two consec- 
utive symbols in a; dp - - - @2p@2ps1 arise meet at a vertex of N. Let V, V, .- 
Vv” denote the vertices (some of which may coincide) where the sides a, a2; 
da, G3} +++ 3 G2», A2p41 Of N respectively concur. At least two of these vertices 
do not coincide. For suppose they all coincide. The 2p meshes through which 
[a; 4241] passes ‘then possess V™ as a vertex and their interior angles at V"” 
would exhaust a straight angle at V™. [a;q2)4:] intersects both sides of this 
straight angle, and therefore intersects an H-line passing through V” in two 
distinct points, thus coinciding with a segment of this H-line. This requires 
that [a,a2)4:] pass through a vertex of N, contrary to our assumption for AB. 
Hence all the vertices cannot coincide. Let V‘” (1 < j < 2p) be the first 
vertex which does not coincide with V™. V and V™ are adjacent vertices 
and are the end-points of the side a; of N. Now consider the H-line segment 
[a;-1@;.:]. The end-points of this H-line segment lie on opposite sides” and 


“4 


'3 For the definition of opposite sides see §5, or the proof of the second part of Lemma 11. 








262 MONROE H. MARTIN 


therefore from Lemma 11, we have | [aj;:4@;,:]| > x, from which it follows 
a fortiori that | [a;ae,4:]| > x. 

Lemma 16. If A, B,, A, B, are two directed H-line segments such that Ao, Ai 
lie on the same side in N, and By, B, lie on the same side in N, and if neither of 
A, B,, A, B, meets a vertex of N, their admissible sequences contain the same number 
of symbols. 

Let a (b) denote the side in N on which rest Ag, A; (Bo, Bi) and let a@ (8) de- 
note the H-distance along a (b) measured from one of the end-points [arbitrarily 
fixed] of a(b). A directed H-line segment with a-end-point on a and w-end-point 
on b determines a pair a, 8 uniquely, and vice versa. Let ap, Bo be the pair 
determined by A,B, and a, 6; the pair determined by A,B,. The directed 
H-line segment A,B, determined by the pair a , B;, where 


ar = aw + t(ay i a), Br = Bo + t(B, _ Bo) (0 S t <= 1), 


is deformed from A, B, into A,B, when ¢t ranges from 0 to 1, and its end-points 
A, , B, are displaced along a, b respectively, without meeting a vertex of N. 

Suppose A, B, intersects N in k points. None of these points is a vertex of 
N and therefore A, B, intersects k H-lines in N. Let P denote an intersection 
point of A,B, with one of these k H-lines, say with the H-line 1. During the 
deformation of A, B, from A, By to A, B, the point P moves along both A,B; and l. 
At no time can P pass off the end-points of J, since they rest on ¥. Likewise 
it cannot pass off the end-points of A,B,, since this would require that either 
A, or B, coincide with a vertex of N; which is contrary to hypothesis. Con- 
ceivably the point P might be lost by A, B, and I becoming tangent to each other. 
This, however, cannot occur for it would imply that A,B, is contained in 1 
and therefore that A, , B, are vertices of N. At the conclusion of the deforma- 
tion, A, B, therefore intersects N in at least k points, none of which can coincide, 
since A, B, does not meet a vertex of N. On interchanging the réles of A, By and 
A, B, in the deformation, we see that A,B, and A,B, intersect N in the same 
number of points. The number of symbols in each admissible sequence is 
therefore equal to k. 

Lemma 17. The H-length of an H-line segment A,B, which terminates on the 
same sides of N as does the H-line segment AB in Lemma 15 cannot fall below rx. 

If A,B, does not pass through a vertex of N, according to Lemma 16 the 
number of symbols in its admissible sequence is the same as the number of 
symbols in the admissible sequence of AB. Hence, from Lemma 15, | A,B; | 
= rx. 

This result holds, even if A, B, passes through a vertex of N. Note that there 
is a one-to-one correspondence between the points of the square 0 S a &S p, 
0 S B S pin the a, B-plane (where a, 8 are taken as introduced in the proof of 
Lemma 16) and the directed H-line segments’ A, B, which have an end-point 
upon each of the above sides. The points in this square which correspond to 
H-line segments passing through a vertex of N are limit points of the set of 
points in the square corresponding to H-line segments which do not pass through 














ERGODIC FUNCTION OF BIRKHOFF 263 


a vertex of N. Now the H-length of A,B, is a continuous function of the argu- 
ments a, 6; , so that the H-length of A,B, in any position in which it passes 
through a vertex of N cannot fall below rx. 


7. Orthogonal families of directed H-lines. Let n’ be an arbitrarily given 
directed H-line segment. Following §2 we adopt the coérdinate system [u,-, 
v,’] on ® associated with n’. The set of H-lines v,, = c (0 S ¢ S | n’ |) directed 
from u,, = —* tO Uy = +© is an orthogonal family of directed H-lines and is 
denoted by F,-, the orthogonal family of oppositely directed H-lines being 
denoted by F,,. F,, Px are based on n’. The are on W occupied by the 
a (w)-end-points of the directed H-lines in F,, is the a-are of F,» @-are of F,’) 
and is denoted by a, (w,"). Let an, b,.» denote the two directed H-lines of F,,, 
drawn through the end-points of n’, choosing the notation so that when w,, is 
traversed counter-clockwise it leads from a, to b,». According to Lemma 14, 
the region T,,: 0 < un S ty ,O S vy S | n’|, of &, where sinh (| n’ |/2) < 
tanh 6, r,, = 6 + log 4 sinh 6/| n’ |, contains at least one entire mesh together 
with the H-circle in which it is inscribed, and may therefore be transformed by 
a transformation of G into a region of @ which contains (13). The transforms 
of n’, Fur, Qnty Wn, Any, by, Ty by this transformation are indicated by 
dropping the primes. Note that although either, or both, of a, , b, can be 
tangent to (13), neither intersects the interior of (13), and that since n’ lies 
without 7, , n lies without (13). 

The transforms of n, F,, , an, @n, Gn, 6,, 7, by the transformation carrying 


Dy into a stated D, (i = 1, 2, --- , 4p) are indicated by adding the superscript 
(¢) to n; thus 7,,::) is the transform of T,,. Note that we have 


(20) Ti) DD; 


We now prove three lemmas. 

Lemma 18. w, contains at least 4p — 4 consecutive [e ;]’s in its interior. 

For reasons of symmetry, it is sufficient to prove the lemma in the case 
when the w-end-point of a, lies in [es,], but not in [e:]; and we restrict our atten- 
tion to this case. 

From Lemma 10 it is seen that a, lies in the region Ey. + Eipa + Esp. 
Suppose the lemma were false. The w-end-point of b, would have to lie in 
one of the ares [es,-4], [es»-s], --- , [ai]. Let us assume that the w-end-point of 
b, lies in [e4,4]. From Lemma 10 it follows that b,, lies in the region Es + Eup 
+ Ew + Ey,-s. The H-line segment n which joins a, and b, lies outside of 
(13) and is therefore required to intersect the side e,,2 of Do produced in two 
distinct points. This would necessitate that n contain the side esp)-2 of Do as a 
segment, which is impossible, since n does not intersect (13). In like manner, 
a contradiction may be reached in assuming the w-end-point of b, to lie in any 
one of [és,~s], [es>-6], --- , [er]. The lemma is therefore true. 

Lemma 19. There are at least two different values of i such that [e;], contains 
a,(i) together with the end-points of w<i> . 














264 MONROE H. MARTIN 


According to Lemma 18, , contains an are © in its interior, a, and the end- 
points of w, being contained in ¥ — 2 = A. Hence a,«i) and the end-points of 
w,(«) lie in A“, which in turn, from Lemma 13, is contained in [e,], , for at least 
two different values of 7. 

Let F,,; denote an orthogonal family of directed H-lines which may, or may 
not, be identical with F,-. We suppose sinh (|; |/2) < tanh 6, so that the 
region 


— ry 
in: ~Ta, SM, <% 0s», S/n |, 


where r,; is defined by (7), contains at least one entire mesh together with the 
H-circle in which it is inscribed. We introduce 


(i) ’ — . 
ny ’ I ats Any”, Wnt", an}", b,{*, nj” (@ = 1, 2, ial , 4p), 


‘ . ) 7 a 
which are defined like n‘”, Paci, Anti), Onli, And, Dac, Taco, the rdle of the 
region T,,, above being taken now by T’,;, so that in place of (20) we have 


(21) T.®? D> D; (i = 1,2, ---, 4p). 


The following companion lemma to Lemma 19 is then evident if we apply 
Lemma 19 to F,,.:. 

Lemma 20. There are at least two different values of i such that [e;], contains 
w,{, together with the end-points of a,‘*. 


8. 6-sets of directed H-line segments. Let n’ be an arbitrarily given directed 
H-line segment and let [u,-, vn] be the codrdinate system on ® used in §7 to 
define F,,. <A 6-set of directed H-line segments A,-, or briefly, a 6-set A,’ , is 
defined to be the totality of directed H-line segments having their a (w)-end- 
points on the segment 0 S vy S | n’| of uv = —6 (uy = 8). A, is based 
on n’, and A,,, F,, are said to be associated. 

We now prove the following lemma. 

Lemma 21. Let A,: [A,;] be a 6-set based on the directed H-line segment n’ [ni], 
where sinh (| n’|/2) < tanh 6 [sinh (| nj |/2) < tanh 4]. Let F [F,{] be the 
orthogonal family of directed H-lines based on n’ [nj] and associated with A,’ [A,]. 
Let Ani) [A,{?] (¢ = 1, 2, --- , 4p) denote the 5-set associated with the orthogonal 
family Fy [F<] of directed H-lines obtained from F,. [F,;] in §7, noting that 
Anco [An{?] is a copy of An [A,/). 

Among the F,::) (i = 1, 2, --- , 4p) there exists one, say Fx), which contains 
a set S,«) of directed H-line segments having the following properties: 

I. Each member of S,«) intersects’ A,««) and there exists a A,{®, say A,{*v, 
which is intersected by all the members of S,(«). 

II. The a-end-points of the members of S,,) lie on the H-line segment 


(22) Unk) = —36, 0S r. S|n'|, 


“Cf, §1. 











ERGODIC FUNCTION OF BIRKHOFF 


and the H-length of each member is less than d, , where 
(23) hi = 118 + log ae. 
|n’| | ny | 
III. The members of S,«) intersect n“ in the points of an arc m,, which con- 
tains an are v; whose H-length is given by 
(24) ln | =|njle 
Property I. Following Lemma 19, choose k from 1, 2, --- , 4p so that [ex], 
contains a,() and the end-points of w,«). This may be done in at least two 
ways. Having chosen k, take k; so that [e:,], contains w,(":) and the end- 
points of a,(*:). According to Lemma 20, this may be done in at least two 
ways, so that we can realize k; # k. Since [eg], , [ex,], (& # k:) have no common 
points, we see that 


—r; 


(25) wn(41) and the end-points of a,(*:) © interior of wn, 
(26) ante) and the end-points of wnx) C interior of an{*. 


Associated with n™, n{‘? there are the codrdinate systems [u,«®), vn], 
[un(*t, Un{4v] respectively. n“ (n{“) is the segment 0 < vaw S | n’| (0 < 
vice) < | ny |) of uncer = O (un{*? = 0) directed in the sense of increasing v,<«) 
(v,{*:)) and the directed H-lines in F,.«) (F,{*))) are the coérdinate lines v,«) = 
d (v,{*) = d)), where 0 < d < |n’| (0 S d; S | n;}), directed from u,u) = 
—& (Un{*.? = —o) tO Unk) = +2 (Un{*1? = +o), 


Using (25), we may show that u,{*:) = 0 lies in the region 
(27) 0< un < +a, 0 <n <|n'|, 


of &, and therefore that u,{*:) = ¢ (c > 0) lies in (27). Conversely, using (26), 
we find that u,«@) = 0 lies in the region 


(28) — © < unl? <0, 0 < v4) < | nj] 


of &, and therefore that u,«) = —c (c > 0) lies in (28). 

Consider the 6-sets A,@, A,{*?. Here A, (A,(*2) is the set of directed 
H-line segments having their a- and w-end-points respectively on the segments 
of une = —6, Une) = 6 (ult? = —4, unl? = 6) for which 0 S v,@ S | n’ | 
(0 < v(t) < |mj,]). In order that a directed H-line 1 intersects A,(*), it is 
sufficient that the a-end-point of I lies on a,{*:) and that I intersects the H-line 
segment 


(29) Unt? = 4, 05,4) Ss | ny I. 


Any directed H-line in F,,«) intersects A,«@), and since a, C a,‘*), those 
directed H-lines in F,,««) which intersect (29) will intersect A,({*). That di- 
rected H-lines in F,,««) exist which intersect (29) is trivial, since u,{*)) = 6 lies 
in (27), and through any given point of (27) there passes exactly one directed 
H-line of Fc). 











e 


266 MONROE H. MARTIN 


A set S,« of directed H-line segments which has property I of the lemma 
can then be extracted from directed H-lines of F,.«). 

Property Il. Let 1 denote an arbitrary member of F,,«) which intersects 
A,\*:) in addition to intersecting A,«). Let A, B, C (Ai, Bi, C1) denote the 
points where | intersects unw) = —6, Uae = 0, Une = 6 (ul? = —4, 
Un")? = O, up{*:) = 6) respectively. Obviously, B (B;) lies between A (A;) 
and C (C;). A point moving on | from the a- to the w-end-point of | conceivably 
meets A, C, A; , C; in five possible orders, namely, 


(30) ACA,C,, AA,CC), AA,C,C, A, ACC), A, AC,C. 
If we take these cases in the order given, the directed H-line segments 
(31) AC;, ACG, AC, A\G, AC 

intersect A,«) and A,\":’. Consider the region 

(32) —36 < unw) S —4, 0S vm S| n’| 


of @. The a-end-points of the directed H-line segments in (31) all lie in (32). 
This statement is trivial for the first three cases. For the remaining two cases 
we note that | A,B | < | A,B,|. According to Lemma 6, | A,B; | < | nj | + 6 
and therefore, since 


(33) ini | < 28, 


inasmuch as sinh (| n; |/2) < tanh 6 < sinh 6, we have | A,B| < 34, which proves 
the statement for the remaining two cases. 

A set S,«) of directed H-line segments having property I therefore exists 
such that the a-end-point of each member lies on (22). 

Let s be an arbitrary member of such a set S,«). We shall show that | s | 
can be taken less than \, , where \, is defined in (23). Denote the a-end-point 
of s by a. 

For the first case in (30), construct the broken H-line segment aA’CiCi, 
where A’ is a point on the side & of Do along which D, , D; border and C; is a 
point on the side ex, of Dy along which Do, D,, border. Since Dy is contained 
in the region [see (20)] 


T nie): 0 < Uae S tw, 0S vx. Si n'l, 
and D,, is contained in the region [see (21)] 
T(t: —Tr, & uf") < 0, Os »,{* S$ | my ly 
we have, on using Lemma 6, 
|aA’| <|n’| + 35+ tw, [CiCi| < | my | +8 + tal. 
Now | A’C{| < 4, so that 


(34) jal, | < 56+] n’| +n) + ore + ri, 














ERGODIC FUNCTION OF BIRKHOFF 267 


inasmuch as | aC;| < |aA’| + | A’C}| + | CiC,|. Using (33) and the in- 
equality 





(35) |n’| < 2%, 
which is proved in the same way as (33), and employing (7), we find 
: 2 
(36) |aCs| < 118 + log 4 sinh 8) 
|m"| | mi | 


Before disposing of the remaining four cases, we note that 


| AC | = 26, | AiCi | < | my | + 28 < 46. 
For the second case, we have 
(37) la€;| S |aC| +] AiC,| < 46 + 46 = 86, 
inasmuch as | aC | = |aB| + | BC| = 46. 
In the third case, we have, since | aA | = 26, 
(38) |aC | = |adA| + |AC| = 26 + 26 = 46. 


In the fourth case, we have, since | aA; | S 26, 


For the fifth case, we proceed as in the third case, and obtain the same in- 
equality (38). 

On comparing (36), (37), (38), (39) with (23), we see that | s| can always be 
taken less than ); . 

Property III. Consider the region 


, 
Ri: — ao < Unit) < fo, 0 S o,f) S| | 


of @ and denote the two remaining regions of @ by R., R;. From (28), we see 
that n™ lies in R,. Those members of F,«) which intersect (29) (and there- 
fore intersect A,(*:)) intersect n“’ in the points of an are 4; , the two members 
of F,«) drawn through the end-points of 4 passing through the end-points of 
(29) into the regions R,, R;. Denote these two members of F,%) by re, 73, 
choosing the notation so that rz enters the region R,. A point displaced along 
rz (r3) from n“ to the w-end-point of rz (rs) enters, and never leaves Rz (Rs). 

One readily sees from the analysis accompanying property II that when 
v,(k) = const. intersects (29), the H-length of the segment of it comprehended 
between n“ and (29) is less than \,. Hence u,«) = d, intersects rz, rs at 
points interior to R:, R; respectively. Therefore the H-length of the segment 
Of Une) = A, comprehended between 72 and r; exceeds | mi |, since | ni | is the 
greatest lower bound of the H-distances between points of R: and R;. 

On placing n = 4, k = \y, x > | n;| in (2), we find 


| | cosh Ay > | nj |, 














268 MONROE H. MARTIN 


and therefore that 


A set S,«) of directed H-line segments having properties I and II of the 
lemma, and which also possesses property III, therefore aways exists. 

Lemma 21 forms the basis for the following lemma. 

LemMa 22. Let q denote a positive integer and let Ay», Ani, Ang, +++ , Ang be 
1 + q 6-sets based on the directed H-line segments n’, ni, 2, --+ , N respectively, 
where 


(40) sinh (| n’|/2) < tanhé, sinh (|n; |/2) < tanhd (j = 1,2,---,9). 


Let F ,» be the orthogonal family of directed H-lines based on n’ and associated with 
An’. 
Among the copies of F, there is one, say F(a), which contains a set S,tei of 


directed H-line segments having the following properties: 
I. The members of Syici intersect the copy Anta) of A,» and intersect copies of 


each of the 6-sets Ani, An; , +--+ , Ani. 

II. The a-end-points of the members of S,(0\ lie on the H-line segment 
(41) Unla] = —36, 0 S vatel S |v’), 
and the H-length of each member is less than dX, , where 


(4 sinh 6)” 
(42) = lléq + log ———_;—— ——-—-5 
[n’ | | Mr} | Mel s+ | Mga] | Mg} 





III. The members of Sata) intersect n'*' in the points of an arc yu, which contains 
an arc v, whose H-length is given by 
(43) v,) =|n le 

For g = 1, this lemma is equivalent to Lemma 21. The proof for an arbi- 
trary q is now obtained by induction. 

Suppose the lemma is true for gq = r — 1. On this assumption, there is a 
copy F,tr-1) of F,- which contains a set S,{--1) of directed H-line segments 
having the properties I, II, III of the lemma, g being replaced by r — 1. 

Let F,;_, denote the orthogonal family of directed H-lines composed of those 
members of F,{r-1) which intersect v;_, and let S,:_, denote the subset of S,t--1) 
composed of those members of S,(--1) which intersect v/_1. Let A,’_, denote 
the 6-set based on y,_; and associated with F,:_,. We have 


(44) Aalr-1) > A,'_,, 


and note that S,;_, possesses the properties I, II, III (¢ = r — 1) possessed by 
Sylr-1) 

Apply Lemma 21 to A,;_,, 4,;. Using the same notation as was employed 
in §7 and Lemma 21, we see that among the F,‘)), (¢ = 1, 2, --- , 2p) there is 




















ERGODIC FUNCTION OF BIRKHOFF 269 


one, say F,‘®,, which contains a set S,‘®, of directed H-line segments having 
the following properties: 
I’. Each member of S,‘#, intersects A,‘®, and there exists a A,‘*), say A,(*:’, 


which is intersected by all the members of S,‘*, . 
II’. The a-end-points of the members of S,‘®, lie on the H-line segment 


(45) ui? = —36, 0 sv, <|%-l, 
and the H-length of each member is less than i, , where 


(4 sinh 5)? 


| va] | me | 








(46) AX, = 116 + log 


III’. The members of S,‘*, intersect v in the points of an are Z, which con- 
tains an are 7, whose H-length is given by 
(47) |p, | = |n, |e. 

Placing g = r — 1 in (43) and substituting the resulting expression for | 7-1 | 
in (46), we find 
(4 sinh 6)? 


A, = 116 + AW + log —| 7) 
| mp1] | n, | 


and therefore, from (42) for gq = r — 1, we obtain 


(4 sinh 5)”” 
{ {my |? | ne? --- | neal? | ne] 





Ay = 116, + log — 
ln 


so that 

(48) hy = de. 
On comparing (47) and (48) with (43), we find 
(49) |o,| =|» |. 


Using (40), we easily show that 
(50) Aru < Ay . 


Let 2,{®, denote the transform of S,;_, by the transformation of G which 


carries F,’_, into F,{®,. =,{®, has the following properties: 

I’. The members of 2,‘*, intersect the copy A,‘”, of A,;_, and intersect copies 
of the 6-sets Ans, Any, --- , Ani. 

II”. The a-end-points of the members of 2,‘#, fill up the H-line segment (45) 
and the H-length of each member is less than \,1 . 

III’. The intersection points of the members of 2,{®, with »{®, fill up v{®,. 

According to I’, I’’, the members of both S,‘*, , =,{#, intersect a copy of A,’_,. 
From (44), they therefore intersect a copy of A,,. 

In order to construct a set S,t-) of directed H-line segments possessing the 
properties I, II, III for q = r, we take n'” as that copy of n’ which contains 

















270 MONROE H. MARTIN 


k ’ . r . 
v\, as a segment. F',(-) is the copy of Fy, based on n'’, On comparing I’, 


II’, III’ with I’’, II’, III” in the light of (48), (49), (50), we see that a set S,1+) 
exists; the are v, in III (q¢ = r) being taken as the arc 9, in III’, and the members 
of S,1r) constructed as directed H-line segments drawn perpendicular to 7, 
with their a-end-points on (45) and their H-lengths less than 4, . 

The following corollary is important in the subsequent development of the 
paper. 

Corotiary. If the H-lengths | n’|, | ni|, |n2|,---, |g] all exceed a posi- 
tive constant y, the H-length of each member of S,1c\ is less than d, , where 


Ng < a{ 118 + thee ‘|. 
Y 


The proof of this corollary is immediate in view of the value given for \, in 
property II of the above lemma. 





9. The surface ¢ and the phase space M. When congruent points on # are 
taken as identical, a topologically closed, orientable surface ¢ of genus p and of 
constant curvature —1 is obtained.” 

Let ge denote an arbitrary directed H-line segment on. From Lemmas 15 
and 17 it follows that g» is divided into a finite number of directed segments 
by the meshes of N, each segment being directed in the same sense as ge. If 
a mesh containing a segment of ge is transformed into Dy by a properly chosen 
transformation of G, the transform of the segment of g@ which it contains is a 
directed H-line segment lying in Dy). The totality of directed H-line segments 
lying in Dy obtained in this manner from ge constitutes a directed geodesic seg- 
ment g, on g. ge represents g, on &. This representation is not unique, any 
copy of gs also representing g, on ®. The directed H-line segments in Dy) com- 
prising g, are pieces of g,. The sum of the H-lengths of the pieces of g, is the 
H-length of g, and equals the H-length of ge. 

Let P be an arbitrary point on @ and to P attach a direction @ measured in 
the usual way in the z, y-plane. Following Morse, the pair (P, 6) is an element. 
Two elements whose angles differ by an integral multiple of 27 will be taken as 
identical." An element (P, @) is on a directed H-line segment ge if P is a point 
of ge and @ is the direction of gp at P. Two elements (P’, 6’), (P”, 0’) are copies 
of one another if P’ can be transformed into P” by a transformation of G which 
takes @’ into 6’. The distance between two elements (P’, 6’), (P’”’, 0’) will be 
defined to be 


| P’P’”’ | + min | 0 — 0” + 2nz |}; 


8 See, for example, Morse II or Nielsen, loc. cit. 

16 Morse I, p.52. Our convention differs from that of Morse, since he regards two direc- 
tions as identical when they differ by an integral multiple of +, whereas we take two direc- 
tions identical if they differ by an integral multiple of 27. 




















ERGODIC FUNCTION OF BIRKHOFF 271 


where min | 6’ — 6’ + 2nz| represents the minimum of | 6 — 6” + 2nz | for 
all integers n, positive, negative or zero.” A pair of elements (P%, 6’), (P¢, 0”) 
in which Po, P% are points of Dp possess a g-distance equal to the greatest lower 
bound of the distances from the copies of one to the copies of the other.” 

The set of elements (Po , @) for which Po lies in Do constitutes the phase space 
M. The distance between two elements in M is taken to be their ¢g-distance as 
defined above. The set of elements in M afforded by the points and directions 
on a directed geodesic segment g, is a phase curve gu , and we shall say g, gener- 
ates gu. If gs represents g, on ®, we shall also say that ge generates gu. 


10. The subsets S\)’, Sw, St of M. Let gs be a directed H-line segment on ® 
which terminates on N, intersects Dy , and meets no vertex of N. In addition. 
ge is supposed to have an admissible sequence 


(51) Aq +++ AmAm41 +++ Gam, 


in which @,, Gms; arise from the intersections of ge with the boundary of Do. 
Consider the set of directed H-line segments such as ge which have their a ()- 
end-points on the side a; (a2) of N. Denote this set by S°’, and consider the 
subset of S“ formed by those directed H-line segments which are segments of 
members of S“ and which lie on Dp. Denote this subset by S$? and let S\’ 
denote the set of elements in M which are on the members of S52. S”, S32, 
SY? are generated by the admissible sequence (51). 

For fixed m, the number of admissible sequences (51) cannot exceed (4p)*", 
so that the number of sets SY? does not exceed (4p)*". An arbitrary element 
(Py, 0) of M is, however, contained in at least one SY’. For take the point 
P, of Dy and construct the directed H-line AB passing through Py with the 
direction 6. If AB does not meet a vertex of N , & segment A,B, of AB can be 
taken which has an admissible sequence such as (51), and the set SY’ generated 
by this admissible sequence contains (Pp, 0). When AB meets one or more 
vertices of N two cases are possible: (a) AB coincides with an H-line in N; (b) 
AB does not coincide with an H-line in N. 

Let us consider (a). Here AB is divided into segments each of H-length p 
by the vertices of N and from any vertex V of N on AB radiate 2p — 1 H-rays 
of N into each of the regions 4, , 2 into which # is divided by AB. The 2p — 1 
H-rays radiating from V into 4, are now directed positively away from V. A 
directed H-ray r; precedes a directed H-ray re if the angle at V taken positively 
and less than + between r, and the negative sense on AB is less than the cor- 
responding angle for r.. Giving those directed H-rays emanating from a vertex 
V” on AB precedence over those emanating from a vertex V of AB if V 
precedes V® on AB, we may arrange the directed H-rays radiating from the 
vertices of N on AB into 4, in an unending sequence 


(52) oo PuimTimp. °° * TeTTi%e +++ Tenalem * °° . 


17 Compare Morse I, p. 53. 














272 MONROE H. MARTIN 


Let us take #, to be the region of ® containing Do and choose the notation in the 
above sequence so that r_,, ™ correspond to the sides of Do. Consider the 
set S“’ comprising those directed H-line segments with their a (w)-end-points 
on that side of N which lies on r_» (7m) and meets AB. The element (Po, 6) 
is on the segment of AB which belongs to S°. Consequently (Po , @) is con- 
tained in a set S\’. 

The treatment of (b) requires a slight modification of the procedure in (a). 
Here AB meets N at points of two types, those which are vertices of N and those 
which are not vertices of N. As before, @ is divided by AB into two regions 
®,, ;, each of which may now contain points of. Do. In certain instances 
one of &,, &, may contain no points of Dy, as is the case when AB meets Do 
in a vertex only. In any event, #,; shall denote one of the regions which contains 
points of Dp. When AB meets a vertex V of N, exactly 2p H-rays in N radiate 
from V into®,. These 2p H-rays are then directed and ordered by a repetition 
of the process employed in (a) for the 2p — 1 H-rays there considered. When 
AB meets a point P of N which is not a vertex of N, only one H-ray in N radi- 
ates from P into #,;. This H-ray is directed positively from P. A convention 
similar to that employed above in the construction of the unending sequence 
(52) then leads us to a similar sequence, and from this point onward the pro- 
cedure is analogous to that in (a). 

Take m = 2pr + 1 in (51), where 


(53) r> p/x 
and is a positive integer. From Lemma 15, we see that 


| [aan] | = rx > p, | [@m41@2m] | 2 rx > p, 


and therefore, since | ge! > | [ai:am] | + | [amsi@em] |, we have 

(54) | ge | > 2p. 

This last inequality shows that a; , d2m cannot have a common end-point. For, 
if they do, ge lies in the H-circle of H-radius p described about the common 
end-point as an H-center, so that |g»| S 2p, which contradicts (54). In addi- 
tion, a; , 2m cannot be segments of the same H-line of N, as this would require 
ge to meet a vertex of N. 

Four distinct members of S*’ therefore exist which connect end-points of a; 
to end-points of azn. Two of these, say 1, l’, do not intersect. At least one, 
say I, intersects the boundary of D). For suppose this is not the case. Since 
ge intersects the boundary of Dp, the interior of the region of @ enclosed by J, I’, 
@ , dam contains Dp together with its boundary, and hence contains the H-circle 
(12) of H-radius p/2. This last is impossible, according to Lemma 4; thus l 
may be taken as intersecting the boundary of Dp . 

Let A, B (Ao, Bo) denote the a- and w-end-points respectively of 1 (ga). Let 
A’, B’ denote the points (possibly coincident) where I intersects” the boundary 


(1) 


18 In case / intersects the boundary of Do in a side of Do , A’, B’ denote the end-points 
of this side. 

















i 
w 
a 
4 
4 
q 


niko. 








ERGODIC FUNCTION OF BIRKHOFF 273 


of Do, choosing the notation so that | AA’ | S | AB’ |. A review of the proof 
of Lemma 17 shows that a member A, B, of S not meeting a vertex of N can 
be selected so that it intersects the boundary of D, in points As, B; which lie 
on the same sides of Do as do the points A’, B’ respectively, and in addition is 
such that A; (B;) lies on a; (@2m) between A(B) and Apo (Bo). _Using the method 
employed in the proof of Lemma 16 to deform A,B, into A, B,, we easily see 
that the number of symbols in the admissible sequences of A, Aj and BB, is 
in each case equal tom = 2pr+1. On interpreting A,B, in Lemma 17 as AA’ 
(B’B) and AB in Lemma 15 as A, A! (Bi B;), we have 


(55) | AA’| = rx >», | B’'B| = rx > p, 
so that | AB | > 2p, and therefore 
| AB| = 2\. + 2p (A > 0). 





We introduce two Gaussian geodesic coérdinate systems [un, vn], [Unr, Vn’). 
In the first, A, B are the points [—(A1 + p), 0] and [A; + p, 0] respectively, and 
the region v, = 0 of ® contains the region of bounded by I, l’, a: , d2m. In the 
second, A’, B’ are the points [—k, 0], [k, 0] respectively (k is a constant, positive 
or zero) and the region v,- 2 0 of @ contains the region of ® bounded by I, ’, 
a, , Gem. 

The points R’[—rx, 0], R’[rx, 0] in the [un’, v,] codrdinate system lie on AB. 
This follows from (55). Let Ci, C’, C”, Ci denote the H-semicircles drawn 
in the region v,, = 0 of & about the points A, R’, R’”’, B as H-centers respectively, 
each H-semicircle having an H-radius p. Let S denote the set of directed 
H-lines which intersect both C’, C” and are directed from C’ to C’”. The 
members of S intersect u,» = 0 in the points of an H-line segment which we 
denote by n’. n’ is therefore the H-line segment 


(56) uy = 0, . 0 < ow < | n'|, 


l . . . . , Mt 
and v, = | n’ | is tangent to both C’,C’’. Since a; (adem) is an H-radius of C; (C1), 
one perceives, on identifying the [u, , v,] codrdinate system introduced above 
with the [u, , v,] codrdinate system in Lemma 3, that 


(57 S> 8”. 


If Sz, denotes the subset of S which is formed by directed H-line segments 
which are segments of members of S and which lie in Dy , we have, from (57), 
the fact that S;, D> S}?. Hence if Sy denotes the set of elements in M which 
are on the members of Sz, , we see that Sy D S\’. 

For a fixed m, the number of sets Sw does not exceed (4p)"", each element of 
M being contained in at least one set Sy . 

Next we note that Sz, is a subset of a 6-set A, based on n’, the a @w)-end- 
points of the members of A,, composing the points of the are 0 < v, S | n’ | of 
Un = —6 (ux = 56). We observe that if denotes the H-distance from the 
center of VY to u,, = 0, we have — S 6/2 < 6. Those elements which are on 








274 MONROE H. MARTIN 


members of A, and belong to M constitute a set S%, of elements in M which 
contains Sy. 

For a given m, the number of sets Sy, does not exceed (4p)*”, each element 
of M belonging to at least one Sy. 


ll. The function n(e). Consider a 6-set A, of directed H-line segments 
based on a directed H-line segment n’ which is a segment of a directed H-line 
whose H-distance £ from the center of V is less than 6. Choose a member of A, 
at random. Let « denote an arbitrary, preassigned positive number. We pro- 
pose to calculate a function n(e) of « such that if | n’| < n(e), an arbitrary 
element on an arbitrary member of A,, lies within a distance ¢ of some element 
on the member of A, chosen at random above. 

We begin by introducing the Gaussian geodesic coérdinate system [u,y’, vn’ 
on ® associated with the directed H-line segment n’. The a@ (w)-end-points 
of the members of A,- lie on ux,» = —6 (uy = 6). Let (P, 6) be an arbitrary 
element on an arbitrary member of A,, , and let [a, 6] denote the coérdinates of 
P in the [u,-, v,-] codrdinate system. The member of A, chosen at random 
above intersects u,, = a at a point P’[a, b’] and has a direction @’ at this point. 
We proceed to derive an upper bound for the distance between the elements 
(P, 0), (P’, &). 

The H-distance | PP’ | cannot exceed the H-length of the segment of u,, = a 
lying between P and P’, and this H-length cannot exceed the H-length of the 


segment of u,,» = a comprehended between v,, = 0, v,, = | n’|, so that from 
(2) we find 

(58) | PP’| Ss |n’| cosh a. 

Now —6 S a S 6. Therefore 

(59) cosh a S coshé, .- 


and hence 
(60) | PP’ | < |n’| cosh 6, 


since the equality signs in (58) and (59) cannot hold simultaneously. 
We now take up | @ — @’|. Let Q denote an arbitrary point of the region 


(61) —6 S Uw 3S 4, 0S vy S|n'| 


of 6. Denote by ¢ (¢’) the magnitude of the angle filled up at Q by the direc- 
tions of those H-rays drawn from Q to intersect the segment 0 S v, < | n’ | of 


unr = 6 (uy = —6). If [uy , v»] are the coérdinates of Q, the angles ¢, ¢’ are 
functions of ux’, v, , and we write” 
t = (uy? ’ Un’), ra = (uy , Un’). 


'® Here ¢(5, vn’), ¢(—4, vy’) are not properly defined. We place 


£(6, vn) = £'(—4, tn) = 2x. 





PF ame: 


Pawer 





ERGODIC FUNCTION OF BIRKHOFF 275 


If we define Z = min {f, ¢’}, we have Z = Z(u, , vp’) yielding an upper bound 
for the magnitude of the angle filled up at Q by the directions of those members 
of A, which pass through Q. If we hold v,, fast, ¢ (¢’) is seen to be a monotone 
increasing (decreasing) function of u,, for —é < u, S 6. Moreover, from 
reasons of symmetry, it is apparent that ((—uy, vn.) = ¢'(un, vn), so that 
C(0, vn) = ¢'(0, v»). Hence 


Z = ¢when —6 S uy S 0, Z = ¢’ when OS u, 


IIA 


4, 
and therefore, for a given v, , 
Z(un , Vn) S Z(O, vn"), —-5 5 uy Sb. 


Finally, an elementary calculation which is rather tedious and need not be 
given here shows that” 


Z(0, Vn’) Ss Z(0, | n’ |/2), 0 


IIA 


Un’ Ss | n’ l, 
so that 
(62) Z(un, vn) S Z(O, | n’ |/2), —5<S u, <4, 0S vo S| n’). 


In order to obtain an upper bound for Z(0, | n’ |/2), let us introduce a system 
of geodesic polar coérdinates [r, y] on &, taking the pole as the point [0, | n’ |/2] 
in the [u,, vx] codrdinate system and measuring y positively in the counter- 
clockwise sense from the coérdinate line y = 0, which is taken as the H-ray 
drawn from the pole to the point with coérdinates [4, 0] in the [u,’, v,] co- 
ordinate system. According to (2), the H-length of the segment of u,, = 6 
which is comprehended between v, = 0, vx = | n’| is |n’| cosh 6. On using 
(P) to recalculate this H-length, one finds 


| n’ | cosh 6 = / {dr* + sinh? rdy*}?. 


Comparing the circle r = 6 with the circle of which u,, = 6 is a segment, we 
have r = 6 in the above integration, so that 


Z(0, |n’!/2) 
|n’| cosh 6 > / {dr* + sinh’ édy7}! > [ sinh édy, 
0 


from which we establish that 
(63) Z(0, | n’ |/2) < | n’| coth 6. 


On combining (62), (63), we see that the magnitude of the angle filled 
up at an arbitrary point Q of the region (61) by the directions of the members of 
A,” passing through Q is less than | n’ | coth 6. 

We are now in a position to obtain an upper bound for |@ — @’!. Take the 
member of A, passing through P (P’) which is a segment of the coédrdinate line 


20 Professor Morley has kindly pointed out to me that this result is obtained very neatly 
by using the methods of inversive geometry. . 











276 MONROE H. MARTIN 

v, = b (v~ = b’), directed in the sense of increasing u,- , and denote its direc- 

tion at P (P’) by a (a’). We have 
}\@—-@|s|@—al+la—a’|+ la’ — @|. 


Now neither of | @ — a|, |a’ — 6’| can exceed | n’ | coth 6, and from (3) we have 
|a — a’ | <|n’| sinh 6, inasmuch as § < 6. Hence 


(64) |@— & | <|n’|{2 coth 6 + sinh 6}. 
From (60), (64) we have 
(65) | PP’|+|0—0| <|n’|{cosh 6 + 2 coth 6 + sinh 6}, 


so that, on referring to the definition for the distance between (P, 6), (P’, 6’) 
given in §9, we see that the function n(e) defined by 


(66) n(e) = e{cosh 6 + 2 coth 6 + sinh 6}~* 
possesses the desired property. 
12. The upper bound for the ergodic function. Let « denote a positive 
number satisfying the inequalities” 
(67) e < (cosh 6 + 2 coth 6 + sinh 6)(1 — e~”), 
(68) «€ < 2(cosh 6 + 2 coth 6 + sinh 4) log{tanh 6 + (1 + tanh’s)'}, 


where p, 6 are defined in (10), (11) respectively, and let r denote the positive 
integer bounded by the inequalities 
] 2 sinh p | 2 sinh p 
69 -ie ae @ wie socee « 
(69) x °6 n(e) "<e¢ x °6 n(e) 


Inequality (53) follows from (67), (66) and the lower bound in (69). From (68) 
we find 


(70) sinh — < tanh 6. 


Going back to the construction of n’ in (56) and using Lemma 5, we see that 
(71) pe * <|n'| < 2e™ sinh p, 
when we place \ + p = rx in (6). On using (69), we find that (71) is replaced by 
rene p rt 
(72) ex sinh p n(e) <. |n | < n(e). 


Now consider a subset Sy of M and suppose the 6-set from which it is ob- 
tained to be based on n’. The upper bound in (72) coupled with the results in 


21 One of these inequalities probably insures the holding of the other. We need not 
however, determine which inequality possesses this property. 























ERGODIC FUNCTION OF BIRKHOFF 277 


$11 shows that the phase curve generated in M by an arbitrary member of A, 
comes within a distance ¢ of every point of Sy, the distance between elements 
of M being defined as in §9. 

r being chosen subject to (69), the number 2m (m = 2pr + 1) of symbols 
in (51) is consequently bounded by the inequalities 








a! lee 2 sinh p 2 sinh p 

n(e) n(e) 
Let 1 + q denote the number of sets Sj, occurring when m is chosen as above. 
Since 1 + q < (4p)’", we have 


(73) 24—- S 2m <4p +2 + Plog 


2sinhp 


4p eg ——— 
l+q< (4p)*?**(4p) x log ate) : 


which reduces to 

4p log 4p 
2 sinh *) x 
n(e) 


From the upper yound in (72) and the inequality (70), it is seen that Lemma 
22 may be applied to the 1 + q é-sets introduced above. If we apply Lemma 22, 
it follows that a directed H-line segment gs of H-length less than \, exists which 
intersects copies of these 1 + q é-sets. From the lower bound in (72) we see that 
we can set y = 3pe *n(e) csch p in the upper bound for J, as given in the corollary 
to Lemma 22. On doing this and making use of (74), we find 


(75) dy < | (4p yer ? a ) —_ 1|[2 log 8 “ane + 11]. 


If the —1 in (75) is repressed™ and if n(e) is replaced by its value given in (66), 
the inequality (75) may be replaced by one of the form 





(74) l+q< (pyr ( 


4p log4p 











(76) Ag<e ‘| log + c|, 


where A, B, C and w are given in terms of p as follows: 


4p log 4p 


(77) A = 2(4p)*”**{2sinh p(cosh 6 + 2cothés+sinhé)} * , 
(78) B 





8p ' sinh 6 sinh pe*(cosh 6 + 2coth 6 + sinh 4), 


(79) < OA, 
(80) a a 4p log 4p 


x 
x being defined in (16). 


22 That the first factor in the upper bound given in (75) is positive may be shown on the 
basis of the assumption (67) for e. 








or 


ps 


ine ape 2 sen i area aR PN 








278 MONROE H. MARTIN 


Since each element of M is contained in at least one of the sets Sy, these 
results enable us to state the following theorem concerning the phase curve gu 
generated in M by the above directed H-line segment ge. 

Tueorem. If ¢ denotes a positive number selected in accordance with (67) and 
(68), the phase curve gu comes within a distance « of every element of M. The 
H-length of g» and therefore that of g, can be taken less than \,, where \, is subject 
to the inequality (76). The right-hand member in (76) is then an upper bound 
for the ergodic function T(e). 

Remark. The constants A, B, C, w in (76) all tend to +o for p ~ +=, 
the constant w, in particular, being bounded by the inequalities 


4p log 4p — 4p log 4p ie 
log (7 + 4/3) ~ log((1 + 9/2) (2 + V2./2 + 1D)’ 


as may readily be verified from the definitions of w and x. 








(81) 


UNIVERSITY OF MARYLAND. 








a 


AIS ita > ee IY 


Pear eT 


wt 











Le.) ee ee 





ede es 





NOTE ON THE SIMULTANEOUS ORTHOGONALITY OF HARMONIC 
POLYNOMIALS ON SEVERAL CURVES 


By J. L. WatsH anp G. M. MErRRIMAN 


1. In the plane of the complex variable z = x + iy, the polynomials 


1, z, 2’, --- are mutually orthogonal, not merely on the circumference | z| = 1, 
but also on every circumference | z| = R, in the sense that 
/ z‘z'|dz| = 0 k 1. 
|z|=R 


The general problem of the existence of sets of polynomials in z which are 
simultaneously orthogonal, with respect to suitable norm functions, on each 
of several curves in the z-plane has been studied only recently. Let us say that 
the set px(z) of polynomials in z is canonical on a rectifiable Jordan curve C 
with respect to the norm function n(z) provided the set p;,(z) is found by or- 
thogonalization on C of the set 1, z, 2’, - -- with respect to the positive continuous 
norm function n(z), and provided the coefficient of z* in p,(z) is chosen positive. 
Walsh established’ the orthogonality with respect to a suitable norm function 
of certain Tchebycheff polynomials on all ellipses of a given confocal family. 
Szegé* and Walsh’ showed independently and by widely different methods the 
fact that if the same set of polynomials p,(z) is canonical on two distinct curves 
C and C’, then either C’ is a curve Cz or C is a curve Cx;* Szegé requires ana- 
lyticity of C and C’. [Let C be an arbitrary Jordan curve in the z-plane, and 
let‘the function z = ¥(w) map the exterior of C onto the exterior of the unit 
circle | w| = 1 in the w-plane so that the points at infinity in the two planes 
correspond to each other. We denote generically by Cz the image (Kreisbild) 
in the z-plane of the circle |w| = R > 1 under this transformation.] More- 
over, Szegé° exhibited all sets of polynomials in z, each set canonical simulta- 
neously on all Cz of a given family, 1 < R < «.° The general problem of the 
existence of sets of polynomials canonical simultaneously on only two curves 


Received October 20, 1936; presented to the American Mathematical Society, Decem- 
ber 1936. 

' Bull. Am. Math. Soc., vol. 40 (1934), pp. 84-88. Also Interpolation and Approximation, 
New York, 1935, p. 134, Theorem 12. 

? Trans. Am. Math. Soc., vol. 37 (1935), pp. 196-206. 

3 Interpolation and Approximation, p. 134, Theorem 11. 

‘The analogous result for harmonic polynomials follows directly by the methods of 
Walsh (loc. cit. and Trans. Am. Math. Soc., vol. 33 (1931), pp. 370-388, especially p. 385). 

5 Loe. cit. 

° These sets are enumerated in §2, below. 


279 














280 J. L. WALSH AND G. M. MERRIMAN 


has not been solved; the second-named writer has, however, some as yet unpub- 
lished results on this problem.’ 

The entire theory of expansions in harmonic polynomials’ in z and y is anal- 
ogous to, but by no means identical with, the theory of expansions in poly- 
nomials in z.” It is thus in order to study the general problem of sets of harmonic 
polynomials orthogonal, with respect to suitable norm functions, simultaneously 
on several rectifiable Jordan curves; and it is the object of the present note to 
establish the more immediate results concerning this problem. 

We investigate the sets of harmonic polynomials which are obtained by 
separating into real and pure imaginary parts the sets of polynomials in z 
which are canonical, with respect to suitable norm functions, on all curves Cz 
of a given family F. Some of these sets of harmonic polynomials are orthogonal, 
with respect to the same norm functions that were used in connection with the 
corresponding polynomials in z, simultaneously on every Cz of F; the remaining 
sets are orthogonal on no Cp. 

Besides this surprising and varied deviation of the facts concerning the 
orthogonality of these harmonic polynomials from the facts concerning the 
orthogonality of their generating polynomials in z, there is a further interesting 
consequence of our discussion: a new property of the orthogonal polynomials 
in z is derived, which is some cases yields a new formula for the polynomial 
expansion of an analytic function. 


2. We list for reference the sets of polynomials in z which are known to be 
simultaneously canonical, with respect to suitable norm functions, on all curves 
Ce of the given families, 1 < R < ; from these sets we derive the sets of 
harmonic polynomials to be studied. Where the basic curve C is not a circle, 
we enumerate also the transforms of the polynomials and norm functions under 
the mapping function z = y(w) defined in §1; we denote by. Ig the circle 
|w| = R, the image of Cg. The real and positive norm function can be ex- 
pressed in the form | D(z) |? = |A(w) |’, z = ¥(w), where D(z) [= A(w)] is 
known to be analytic and non-vanishing in the extended z- [or w-] plane out- 
side C [the unit circle | w| = 1].”° In all cases the norm function on a curve 
Ce may be altered by multiplication by a positive constant, and the entire 
configuration may be subjected to an arbitrary linear integral transformation. 
The orthogonality condition is chosen in the form 


/ pilz) p(z) | D(z) |? | dz| = [ pi(z) p(z)n(w)|dw| =0 k*lz=y(w), 
CR rr 

7 Presented to the American Mathematical Society, September, 1936; see Bull. Am. 
Math. Soc., vol. 42 (1936), Abstract 279. 

5’ For a report on this theory, see for instance Walsh, Bull. Am. Math. Soc., vol. 35 
(1929), pp. 499-544. 

® For example, an arbitrary function continuous on a circumference C can be uniformly 
expanded on C in harmonic polynomials; the analogous proposition for polynomials in 
is false. 
” Szegé, loc. cit., Theorem 1. 























ORTHOGONALITY OF HARMONIC POLYNOMIALS 281 


where n(w) = | A(w) | v’(w) | serves as the norm function in the w- 
plane. 
The list (Szegé, loc. cit.) follows: 
I. The set Cz is the set of concentric circles |z| = R > 0; 
D(z) = 1; pz) = 
II." The set Ce is the set of concentric circles, |z| = R > 1; 


D(z) = (1 — 2°)", wa positive integer; 
| D(z) = R*™2°/(z* — 1)(R* — 2°); 
pr(z) = 2 for0 < k < a; p(z) = 2 “(2% — 1) fork 2a. 
III.” The set Ce is the set of confocal ellipses, foci +1; 
De) = {2+ @ — 1)5}' @ - 17452 = vw) = Aww); 
A(w) = {3(1 — w)}*; n(w) = 1; pr(z) = w* + w™*. 
IV. The set Cz and z = ¥(w) as in III; D(z) = {z + (2 — 1)4} °° — 1); 
A(w) = {3(1 — w)}?; n(w) = (w* — 1)(R* — w)/4R*v*; 
pr(z) = (wt — w*)/(w — w). 
V. The set Ce and z = y(w) as in III; D(z) = (2 — 1)(z + 1)°; 
A(w) = (1 — w y (1 4 w'); jm(w) = (w — 1)(R® — w)/2R’u; 
pe(z) = (w'*? — w*')/(w* — w™). 

We shall suffix the pn H (e.g., In) to the above numerals I-V to 
designate the harmonic polynomials derived from the sets I-V by separation 
into real and pure imaginary parts. We reiterate that in the cases III, IV, V 
the notation n(w) does not represent merely the transform of the norm function 
n(z) = | D(z) |’ in the z-plane, but represents n(z) | ¥/(w) | = | A(w) || ¥/(w) |, 
the norm function in the w-plane. 


3. It would perhaps seem most natural to offer real-variable proofs of our 
results concerning the harmonic polynomials In—Vua , using trigonometric forms 
of both polynomials and norm functions. Although such proofs exist, it seems 
simpler and more fruitful to carry out the discussion by methods of complex 
variables. For this purpose we first proceed to obtain some general theorems 
concerning orthogonality of polynomials in z and of the harmonic polynomials 
which result from separating them into real and pure imaginary parts. 

Let the set of polynomials =: {p,(z)}, canonical or not, be orthogonal with 
respect to the positive continuous norm function n(z), on a given rectifiable 
Jordan curve C: 


(1) [ meorni@ne ae =0 kl. 


If each polynomial p;,(z) is separated into real and pure imaginary parts, 

. , . . . . . 

Pr = pt + ip, we obtain a set of polynomials in x and y each of which is 
harmonic in the entire plane: 


" This set was exhibited by Szegé (Math. Ann., vol. 79 (1919), pp. 323-339), but without 
mention at that time of orthogonality on more than one curve. Compare Walsh, Mémo- 


rial des sciences math., Fasc. 73, p. 43. 
“ This is the Tehebycheff set proved by Walsh to be orthogonal simultaneously on 


all Cr ° 











282 J. L. WALSH AND G. M. MERRIMAN 


= (po, Pi, Pry °° 
S: ” ” , . 
| Po, Pi, P2,*** 


If po(z) is real and of degree zero, which necessarily occurs if the set = is canonical, 
the polynomial po’ vanishes identically and may be omitted here. An arbitrary 
set of form S is orthogonal on C with respect to the norm function n(z), pro- 
vided that the three sets of conditions 


[ vivincy |ae| = 0 kl, 
(2) [ vine |dz| = 0 kx 


0, 


[ mining) | dz | 


are satisfied. Under the present assumption of the orthogonality of the set =, 
the second conditions in (2) are satisfied when and only when the first condi- 
tions are satisfied; thus, for harmonic polynomials obtained by separation of 
orthogonal polynomials in z into real and pure imaginary parts, the orthogonality 
conditions (2) are equivalent to 


(3) | pep n(z) |dz| = Ofork & l, [ vipi'ne |dz| = 0. 


We observe that if conditions (1) and (3) are satisfied, there are also satisfied 
the conditions 


(4) i pi(z)pi(z)n(z) | dz | = 0 k = I. 


Conversely, if both (1) and (4) are satisfied, and if in addition 


(5) [ me’ n(z)|dz| = 0 k=l, 
c 


we may infer the satisfaction of conditions (3). Conditions (5) may be re- 
placed by a condition on the pure imaginary component of an integral: 


"a 
(6) x {f [px(z)]’ n(z) | dz } = 0 k =0,1,2,--- 


We may thus state 

TueoreM 1. If the set {px} of polynomials in z is orthogonal on a rectifiable 
curve C with respect to the positive continuous norm function n(z), then it is necessary 
and sufficient for the orthogonality on C with respect to n(z) of the set of harmonic 
polynomials (p;., p:) obtained by separating each p,(z) into real and pure imaginary 
parts that conditions (4) and (6) [or (5)] be satisfied; that is to say, it is necessary 
and sufficient that the integral 




















ORTHOGONALITY OF HARMONIC POLYNOMIALS 283 


J= [ rserpeynte)| ae 


vanish for k *= l and be real for k = l. 

It is easy to prove, by combining formally conditions (2) so as to yield 
conditions (1), 

TuEoreM 2. If the set {p,, px} of harmonic polynomials, where p, and p, 
are conjugate harmonic functions, is orthogonal on a rectifiable curve C with respect 
to a positive continuous norm function n(z), then the set {px = pr + ip,’ } of poly- 
nomials in z is also orthogonal on C with respect to n(z). 

It is Theorem 1 to which we shall appeal in our study of the orthogonality 
properties of the harmonic polynomials In—Va. The integral J, however, is 
also of interest and importance in connection with an expansion problem. Let 
the function f(z) be analytic on and within C; the formal expansion of f(z) 
on C in terms of the polynomials {p,(z)}, now assumed canonical on C, is 


(7) f(z) ~ > a;p;(z), 


(8) a; = [serene iael / [ rierpi@nce\ael, j =0,1,2,---; 


of course we have the relation 
(9) [ pierpi@ ne |ae| #0 j = 0,1,2,---. 
Cc 


Under the present hypothesis, the relation (7) is an actual equation, valid uni- 
formly on and within C.” 

If now conditions (4) are also satisfied by the set {p,(z)}, we can obtain 
an alternative expression for the coefficients a; valid under certain circum- 
stances. Multiplication of equation (7) by p;,(z)n(z) and integration term by 
term over C yields 


(10) a;= [1 p;(z) n(z) | dz i/ | [p2)F n(z)|dz| § = 0,1,2,---, 


provided of course that 
(11) [wserne) |dz| &0 j = 0,1, 2, --- 
We collect these results as 

18 This theorem is due to Szegé in the case n(z) = 1 with C analytic, to Smirnoff in the 
case n(z) = 1 with C rectifiable and satisfying an auxiliary condition, and to Walsh in 


the general case that both n(z) and C are arbitrary. See Walsh, /nterpolation and A pprozi- 
mation, §5.2. 











284 J. L. WALSH AND G. M. MERRIMAN 


TueoremM 3. If the set of polynomials {p,(z)} is canonical on C with respect 
to a norm function n(z), and also satisfies conditions (4) and (11) (that is to say, 
if the integral J vanishes for k *< 1 but not for k = 1), then an arbitrary function 
S(z) analytic on and within C has the formal expansion (7), where a; is given both 
by (8) and by (10). The expansion (7) is valid uniformly on and within C. 


4. The general results established in §3 will now be applied to the study of 
the orthogonality with respect to the given norm functions, of the sets In—Vua 
of harmonic polynomials, on all Cg of the given families of curves. The com- 
putation will be carried out in the plane in which the family of curves is a set 
of concentric circles, that is, after the usual mapping z = ¥(w) in cases IIIa—Vu. 
Let Ip designate the circle |w| = R > 1; Tz coincides with Ce in cases Ia 
and Il« and is its image in cases [[In—-Vz . 


In case Ia, 
J = [ zl dz| = 4 | Pt" dz. 
Cr tJrr 


Hence J = 0 for all choices of k and l except k = 1 = 0; in the latter case, 
however, equation (5) is satisfied, since pp is real. Thus conditions (4) and (6) 
of Theorem 1 are completely satisfied, and we have established the well-known 
fact that the harmonic polynomials are orthogonal simultaneously on all Cex . 
But condition (11) is not satisfied, so formulas (10) for the coefficients a; are 
not valid. 

In case IIIa (the study of IIs is more complex and we postpone it temporarily), 


J = [ (w + w*)(w' + w™) | dw| 


CR 
R k+l-—1 k—i-1 —k+l-—1 —k-l—1 
=-—]| [w +w +w w | dw. 
idre 


Hence J = 0 immediately if k % 1, so conditions (4) are satisfied. If k = l, 


we have 
J -@/ ae _ oncR, 
I 


i Jrp w 


where c = 4 if k = 1 = 0; otherwise c = 2. Since this value of J is real and 
non-vanishing, conditions (6) and (11) are satisfied. We conclude from Theo- 
rem 1 that the set Illa is orthogonal simultaneously on every Ce, and from 
Theorem 3 that formulas (10) for the coefficients a; in (7) are valid." 


“A form of orthogonality other than (1) and (4) is also of interest here, namely 
[ew pe(z) plz) dz = 0 k#l, 
L, 


where Q(z) is suitably chosen. See Geronimus, Trans. Amer. Math. Soc., vol. 33 (1931), 
pp. 322-328. 


























ORTHOGONALITY OF HARMONIC POLYNOMIALS 285 


5. Among the sets (a = 1, 2, --- ) of polynomials IIn , the sets IIx , a > 2 
are orthogonal on no Cz ; the sets Ila , a = 1 or 2 are orthogonal on every Cz . 

We have the following three expressions for J for the three possible choices 
of k and I relative to a: 











: | stale ghtt-e-1 (2* a 1) dz 
= - 2 => : 
A= [. ae k2a,l2a; 
Rs ght} dz 
Jy = —— '~e— 2 k2z=a,l<a; 
| stati ghtitet dz 
ieee f ets k<a,l<a. 


The integrals J; and Jz both vanish, for all pertinent choices of k and l, by 
Cauchy’s integral theorem, since there are no singularities of the integrands 
interior to Tg. The integral J;, however, possesses singularities at points 
inside Tz , namely at the a-th roots of unity; we evaluate J; in the following 
manner. 


Lemma 1. Let w be a primitive a-th root of unity. The sum D w™ has the 


m=1 


value a or 0 according as q is or is not a multiple of a. 


The proof is left to the reader. : 

The value of the integral J; , aside from the constant factor, is the sum of 
the residues of the integrand at the points w, w ,---,* = 1. At the point 
w” the residue is 


aes| ' wee “] 
Ree — ze 2% — 1 Jemwm 
ted ghtite=t + (k a l + i 1)(z = Mneerrrs 
—— a( Re — z*) ze! — a(z* — 1)ze" tency 
m(k+l) 


a(R? — 1)" 


Hence the integral J; has the value 








= 27i 


2a+1 a 
2rR (k+l) 
J3 = p> w” 


a a(R - 1) m=1 
’ | 0, if k + lis not a multiple of a, 
| 2eR™*/(R** — 1), if k + Lisa multiple of a. 


(12) 


Concerning the polynomials IIx , a = 1, we conclude immediately that the 
conditions of Theorem 1 are completely satisfied: J; = 0, J2 = 0 in all circum- 
stances; the integral J; is of significance only when k = | = 0, in which case 
the condition on J; is (5), fulfilled by virtue of the reality of po. These poly- 
nomials IIn , a = 1, are therefore orthogonal simultancously on every Cg. On 








286 J. L. WALSH AND G. M. MERRIMAN 


the other hand, conditions (11) fail for 7 > a@, so Theorem 3 is of no signifi- 
cance here. 
We turn to the set Ila, a = 2.. By (12) we have J; = O fork = 1,1 = 0; 


and J; is real fork = 1 = Oor 1. These are the only cases not covered by 
J, = J: = 0; thus the conditions of Theorem | are satisfied, and the polynomials 
Iln , a = 2, are orthogonal simultaneously on every Ce. Again Theorem 3 


fails to be applicable. 

But if we examine any set IIs , a > 2, there always exist pairs of polynomials 
of the set which fail to satisfy conditions (4). For example if we choose 
k = a — 1,l = 1, we find from (12) that J; & 0, k *& 1; hence conditions (4) 
of Theorem 1 fail to be satisfied, so the polynomials Ila , a > 2 are mutually 
orthogonal on no Cx . 


6. We turn now to the proof of the orthogonality of the set [Vu on all Ce. 
We have 





1 [ at Se a (w* — 1)(R* — w’)dw 
CR 





"  4R5 w-w! w—w w 
1 f teu 4 wt b+ Mle — DUR! - wd 
~ 4R% Cr wetits : 


We expand the numerator, and evaluate the corresponding integrals separately. 
Suppose first k * 1; it is no loss of generality to assume, as we do,! > k. The 
terms of J resulting from 


(w* + wo + is + lw"? (R* a w’) 


vanish because in this product each exponent of w is greater than k + 1 + 2; 
we have 27 + 2>k+1+ 2. The terms of J resulting from 


(w* + w** + --- + 1)(R* — w’) 


vanish because in this product each exponent of w is less than k + 1 + 2; we 
have k + 1+ 2 > 2k + 2. Hence conditions (4) are satisfied. Suppose next 
k = 1; the only terms in J which do not obviously vanish are those contributed 
by the terms R*w*'** and w™** in the numerator of the integrand. We have 


Hence conditions (6) of Theorem 1 are satisfied, so the set [Vu is orthogonal 
simultaneously on every Ce. Conditions (11) of Theorem 3 are also satisfied, 
so that theorem is applicable. 

Methods similar to these serve to prove that the set Va is orthogonal simulta- 
neously on every Cz , and that Theorem 3 is applicable; we omit further details. 


7. One further pertinent deduction is to be made, from Theorem 2 of §3. 
In view of our study of the orthogonality, on all Cz , of the sets of harmonic 




















ORTHOGONALITY OF HARMONIC POLYNOMIALS 287 


polynomials In—Va , and in view of the fact that the polynomials I-V are the 
oniy sets of polynomials in z which are canonical simultaneously on all Cx 
of a given family, Theorem 2 yields the conclusion that the sets In , [In for 
a = 1 or 2, I[In—Vu are the only sets of harmonic polynomials orthogonal 
simultaneously on every Cz of a given family which can be obtained by separa- 
tion into real and pure imaginary parts of canonical sets of polynomials in z. 

The case of the polynomials IIx , a > 2, illustrates the falsity of the converse 
of Theorem 2. 


8. We remark that the results just proved on the orthogonality of the sets 
In—Vu can also be established by the use of real variables. Various integrals 
that present themselves are analogous to Poisson’s integral, and can be evaluated 
at once by identifying the given integrals with Poisson’s integral for suitably 
chosen integrands and suitably chosen values of the parameters; in each case 
the corresponding Dirichlet problem has for its solution a fairly obvious har- 
monic polynomial. It might be supposed that the real-variable proofs are 
preferable and more natural in the study of harmonic functions, but as a matter 
of fact the methods of the complex variable as we have used them are both 
more fruitful (see for example our Theorem 3) and more illuminating as to the 
general structure of the harmonic polynomials studied (see conditions (4) and 


(6) of Theorem 1). 
For the sake of reference we list, in trigonometric form, the harmonic poly- 
nomials In—Vu , together with their respective norm functions: 


In. pr(z) = R* cos k0, pe (z) = R* sin kO; n(z) = 1; 
z = R(cos 6 + isin @). 


IIn. pi(z) = R* cos k0, p, (z) = R* sin k0, for 0 S k < a, 
@ a positive integer; 
pi(z) = R* cos ko — R** cos (k — a)8@, 
ps (2) = R* sin ko — R** sin (k — a)0, fork = a; 
n(z) = R**/(1 — 2R* cos ad + R**); z = R(cos 6 + isin 8). 
IlIn. pi(z) = (R* + R™) cos k6, py (2) = (R* — R“) sin ko; 
n(w) = 1; w = R(cos 6 + isin 6);z = ¥(w) = }(w+w'). 
IVn. pi(z) = R'{(R** — R*")(R — R™’) cos (k + 198 cos 0 
+ (R**' + R*")(R + R*) sin (k + 1)@ sin 6}/(1 — 2R’ cos 26 + R'); 
pe (2) = R{(R* + R*")(R — R™) sin (k + 1)0 cos 0 
— (R** — R*")(R + R™) cos (k + 1)6 sin 6}/(1 — 2R* cos 20 + R*); 
n(w) = (1 — 2R’ cos 20 + R*)/4R*; w = R(cos 6 + i sin 8). 





288 J. L. WALSH AND G. M. MERRIMAN 


Va. pi(z) = R{(R* — R*)(R! — R™) cos (k + 4)0 cos 40 
+ (R*) + R*)(R + RY sin (k + 4)6 sin $6} /(1 — 2R cos 0 + R’); 
pe (z) = RU(R*** + R*)(R! — R74 sin (k + 4)0 cos 40 
— (RR — R*)(R' + R™) cos (k + $)0 sin 36}/(1 — 2R cos 6 + R’); 
n(w) = (1 — 2R cos 6 + R’)/2R’; w = R(cos 0 + i sin 8). 


Harvarp UNIversity AND Tue UNIVERSITY OF CINCINNATI. 

















CERTAIN TERNARY CUBIC ARITHMETICAL FORMS 
By E. T. Bei 


1. By an obvious oversight, G. B. Mathews stated (without restriction on 
the integer n) that if the integer m is representable by integers z, y, z in the form 
z+ ny + n°2* — 3nzyz, 


it can be represented in an infinity of ways.’ If n is the cube of a rational 
integer, and m + 0, the number of representations, if any, is finite. We shall 
show how these representations may be found. In certain simple cases (§5) 
it is possible to find the exact number of representations; in all cases we give an 
upper bound (§4) for this number. 


2. We consider the representations by integers x, y, z of the integer m in the 
form 
(1) a+ fy + 2? — 38 xyz, 
where ¢ is a constant integer ~ 0. If m = 0, we have the infinity of representa- 
tions (x, y, z) = (ht’, ht, h), where h is an arbitrary integer (and possibly further 
representations). 

Henceforth we shall take m # 0. Let m = dé, where d, 6 are integers. From 
(1) we have 

(a + ty + &2z)(2? + fy? + te? — tay — Prz — Byz) = dd; 
hence we may take 
(2) actty+tz=d, 
and equate the second factor to 6. Replacing t’z by d — x — ty, we get 
3(2° + try + fy’ — dx — dty) +d —6 =0. 

Thus 
(3) d — 6 = 0mod3; d’ — 6 = 3h, 
where h is an integer, and 


(4) t+ (ty —daxt+ ty — dy +h =0. 


Received November 19, 1936. 
1G. B. Mathews, Proceedings of the London Mathematical Society, vol. 21 (1891), 
pp. 280-7; see also Dickson’s History of the Theory of Numbers, vol. 2, 1920, p. 594. In 
applying Dirichlet’s theory of representations by norms o! algebraic integers, Mathews 
omitted to state, loc. cit., p. 281 (as he evidently intended) that the real cube root of the 
n he is considering must be irrational. 
289 











290 E. T. BELL 

In order that (4) shall have an integer root x it is necessary that the dis- 
criminant be an integer square a’; whence 
(5) 3t°y? — 2dty + a® + 4h — a’ = 0; 


and in order that (5) shall have an integer root y it is necessary that the dis- 
criminant be an even integer square 4b; ; thus 


dt? — 3t°(a® + 4h — d’) = Bi. 
It follows that ¢|b,, say b; = tb. Hence, by (3), 
3a” + b* = 45; 

and from (4), (5), (2) we find the values of x, y, z stated in the following 

THEOREM 1. All integer solutions (zx, y, z), if any, of 

at+ty +2 — 3ryz =m 

in which t is a constant integer # 0 and m is any integer ¥ 0, are given by 
(6) 6r=2—b+3a, B3y=d+b, 6fz=2d—b—3a, 
where a, b are integers such that 
(7) 3a” + b° = 46, 
and d, 6 are integers such that 
(8) m = dé, d=6 mod 3. 

Hence m, d are both positive or both negative, and 
(9) (d, 6) = (0, 0), (1, 1), (-1, I) mod 3. 

THEOREM 2. The only m # 0 representable in the form (1) are of the forms 
9n, 3n + 1. 

3. Taking the first of (9), we have (d, 6) = (3d, , 36,). Hence, from the first 
of (6), 3/6, b = 3b; 

x = d, — 3(b; — a), bh =a mod 2, 
the last since z is an integer. From (7) we now have a’ + 3b; = 44, . Taking 
(a, b:) = (2a , 2be), (2a3 + 1, 2b3 + 1), 


and dropping all suffixes, we get from Theorem 1, 
THEOREM 3. All integer solutions (x, y, z), if any, of 


aot+ty +2 — 3txyz = 9n, n ~ 0, 
are given by 


zr=d—b+a, ty = d + 2b, tz=d—b-—a, 

















CERTAIN TERNARY CUBIC ARITHMETICAL FORMS 291 


with a, b, d, 6 integers such that 
(10) n = dé, a’ + 3b° = 6, 
and 
r=d—b+a, ty=d+2b+1, (z=d—b-—a-—1l, 
with a, b, d, 6 integers such that 
(11) n = dé, (2a + 1)° + 3(2b + 1)° = 46. 


Similarly, the second and third of (9) give the following 
THEOREM 4. All integer solutions (x, y, z), if any, of 


at+ty + tz — 3@ryz = n, n=1 -mod 3, 
are given by 
z= (d-1)/3-b+a+1, ty = (d — 1)/3 + 2b, 
tz = (d — 1)/3 —b —a, 
with a, b, d, 6 integers such that 
(12) n = dé, d=é6=1 mod 3, 
3(2a + 1)° + (6b — 1)’ = 48. 
THEOREM 5. All integer solutions (x, y, z), if any, of 


ot+ty + 2 — 3@zyz = n, n mod 3, 


Ill 
| 
_ 


are given by 

z= (d+1)/3—b+a, ty=(d+1)/34+2b, 2 =(d4+1)/3-b-—a-l1, 

with a, b, d, 6 integers such that 

(13) n = dé, d=-6=-1 mod 3, 
3(2a + 1)° + (6b + 1)* = 46. 


4. An upper bound to the number F'(n) of representations (2, y, z) of n in the 
form (1) is easily obtained from the number N(n) of representations (a, b) of n 
in the form a® + 3b. Let n = 2°m, where m is odd. Then’ if E(m) denotes 
the excess of the number of divisors 3h + 1 of m over the number of divisors 
3h — 1, 


N(n) = 0, if a is odd; 
N(n) = 2E(m), ifa = 0; 
N(n) = 6E(m), if ais even, > 0. 


Referring to (10), (11), we have (n # 0), 
F(9n) = 6[X) N() + DE N(48)], 
2 L. E. Dickson, Introduction to the Theory of Numbers, 1929, p. 80, Exercise 3. 











292 E. T. BELL 


where the sums refer to all divisors 6 > 0 of 2"°m. These divisors are 2°5’, where 
0 <= B S a, and 3’ is any divisor > 0 of m. A short reduction of the resulting 
inequality for F(9n) gives 

TueoreM 6. If F(k) denotes the total number of representations of k in the 


form (1), 
F(Qn) S [6a + 5 + 3(—1)"] D Ed), 


where n = 2°m, a = 0, m is odd, and >> refers to the divisors 6 > 0 of m. 

The factor 6 enters, since a representation (x, y, z) for a particular (a, b) 
contributes at most 6 representations by permutations of rz, y,z. In the same 
way, (12), (13) give 

THeorEM 7. Ifn = +1 mod 3, 


F(n) < 12 >> E(), 
where >> refers to all divisors & > 0 of n. 


5. Taking ¢ = 1 in (1) we now consider representations in 
(14) at+y + 2° — 3zyz. 


THEOREM 8. All integers = 0 mod 9, or = +1 mod 3, and only these, are repre- 
sented in the form (14), and for integers # 0 the number of representations is finite. 
This follows from Theorem 2 and (10), (11), (12), (13), since each of these 
equations has at least one integer solution when 6 = 1. The first part of this 
theorem was proved otherwise by Carmichael,’ who showed also that for integers 
> 0 in each case there is a representation (x, y, z) with x, y, z non-negative. 
Here, taking ¢ = 1 in Theorems 3, 4, 5, we reach the same conclusion, with the 

additional information in 
TueoreM 9. In the form (14), 9n has the representation (n + 1, n, n — 1), 
and if n > 1, a representation (x, y, z) withx > 0, y > 0,2 > 0;n = 1 mod 3 
n+2n-1 n-1 


ooo ar 3 Se , and if n > 1, a representation 





has the representation 





(xz, y, z) with x > 0, y > 0, z > 0; n = —1 mod 3 has the representation 
(" - 4 > * $ . : *), and if n > 2, a representation (x, y, z) with x > 0, 
y>0,z>0. 


Considerably more may be stated for certain special forms of n. With N, EZ 
as in §4, let p bea prime > 0. Then N(p) = 4 or 0 according as p = 1 mod 3 
orp = —1mod3. Hence for p = 1 mod 3 there is precisely one representation 
(a, b) = (a, 8), with a > 0, 8 > 0, of pin the form a’ + 36°. A straightforward 
application of these remarks to Theorems 3, 4, 5 with ¢ = 1 gives 
TueoreM 10. If p is a positive prime = 1 mod 3, there are precisely 18 repre- 





*R. D. Carmichael, Bulletin of the American Mathematical Society, vol. 22 (1915), 
pp. 111-117. 



















‘here 
Iting 


n the 


epre- 
inite. 
these 
this 
egers 
tive. 
h the 


- I), 
od 3 
ation 
ation 
> 0, 
N,E 
od 3 


ation 
ward 


‘epre- 


1915), 


CERTAIN TERNARY CUBIC ARITHMETICAL FORMS 293 


sentations (x, y, z) of 9p in the form x* + y*° + 2 — 3:xryz, obtained by permutations 
of x,y, zim 


(x,y,z) = (p— 1, p,p + 1), 
(l+a+8,1— 28,1—a+68), 
(1 + a — 8,1 + 28,1 — a — 8), 


where (a, 8) ts the (necessarily) unique representation (a, b) = (a, 8) of p in the 
form a’ + 3b with a > 0, b > 0; if p isa positive prime = —1 mod 3, there are 
precisely 6 representations of 9p, obtained by permutations from (p — 1, p, p + 1). 

The implied distinctness of these representations is easily seen from the primal- 
ity of p and a simple contradiction. The corresponding theorems for p < 0 
follow at once if we note that 6 in preceding theorems is > 0. Similarly we find 

THEOREM 11. The only representations of the positive prime p = —1 mod 3 
in the form x° + y' + 2° — 3zxyz are the three obtained by permutations from 


(ptt pt+il 2-8) 





3 7." = 
The following immediate consequences of Theorem 7 are of some interest. 
THEorEM 12. If p is a positive prime, a an integer = 0, and if G(n) denotes 
the total number of representations of n in the form (14), 


G(p*) = 6(a + 1)(a + 2), p=1 mod 3; 
G(p*) S 3[2a + 3 + (-—1)*], p=-l mod 3. 


The first part of Theorem 8 (Carmichael’s result) has been generalized by 
Hua‘ to any circulant. 


CALIFORNIA INSTITUTE OF TECHNOLOGY. 


4L. K. Hua, Tohéku Mathematical Journal, vol. 39 (1934), pp. 316-321. 





ELEMENTARY PROOFS OF SOME KNOWN THEOREMS OF THE 
THEORY OF COMPLEX EUCLIDEAN SPACES 


By M. H. Srone anv J. D. TAMARKIN 


In this note we shall indicate strictly elementary proofs of certain well-known 
fundamental theorems concerning a complex Euclidean space 2.’ Such proofs 
are not difficult to construct and must be widely known, but the proofs actually 
available in the literature repose, in many cases, upon arguments drawn from 
topology or from the spectral theory. It may therefore be helpful to put on 
record proofs of a more elementary nature. 

We begin with a proposition due, in its general form, to Banach.” 

THeoreM 1. If the sequence {g,} in & has the property that {(f, g,)} ts a bounded 
sequence for each f in &, then the sequence { | gn | } is also bounded. 

In constructing a proof by contradiction there is no loss of generality in 
assuming that (f, gn) 0 for each f in 2. Indeed, if we make the assumption 
that { | In | } is not bounded, we can select a subsequence (g,} of tgn} such that 
|g. | 2 n°, and we can then consider the sequence {g,}, where g, = g./n, for 
which the peopertice lgn| =n, C, g..) > 0 are evident. 

We therefore proceed to obtain a contradiction from the assumption that 
|gn|— ©, (f, gn) — 0 for each f in ¥. Now by selecting an appropriate subse- 
quence and renumbering its elements, we may further suppose that 


g.| 22°, |(gm,9n)| S 1 when m # n. 


In fact, if we have chosen g;, gz, «++ , gw, the relations | g, | ~ ©, (gm, gn) 0 
form = 1, 2, --- , N allow us to choose gy; from the remaining members of the 
original sequence so that 

lena |.22°”, (gm, 9nu1)| S 1 form = 1, 2,---,N. 


An obvious inductive construction therefore provides us with the desired subse- 
2 


quence. The inequality | g,/ |g, |°| < 2°" shows that the series pe g-/|g»|* 


v=1 


is dominated (in norm) term for term by the convergent series b> 2°”, and 
1 


therefore is convergent in % to a limit f. By assumption (f, g,) ~ 0. On the 
other hand, we have the contradictory inequality which follows: 


Received December 30, 1936. 

1 By a complex Euclidean space we mean a complex linear vector space with definite 
Hermitian-bilinear inner product (f, g), complete in the metric defined by |f — g | = 
(f —9,f —g)*. Hilbert space is a special case. 

? Banach, Théorie des Opérations Linéaires, Warsaw, 1932, p. 80. 


294 











~ —- —— a 


aS oe - 





THEOREMS OF THEORY OF COMPLEX EUCLIDEAN SPACES 295 


(f, Gn) | = LG.,90)/l 9!" = J + y (9, Jn) |g» || 


vgn 


21-LD1@,l/le21—- Le’ =e. 
ven ri 3 
Consequently, our theorem is established. 

As an immediate corollary of this result, we have 

THEOREM 2. If |(f, g,)} converges for each f in &, then there exists a uniquely 
determined element g in ¥ such that (f, 9g.) — (f, g). This element g belongs to the 
closed linear manifold IN generated by the sequence {g,} and satisfies the inequality 
\g| = G, where G = lim inf |g, | < + &. 

Theorem 1 shows at once that G is finite. Now the limit L(f) of (f, 9.) obvi- 
ously depends linearly on f and, by virtue of the Schwarz inequality | (f, gn) | S 
if | | gn |, satisfies the inequality | L(f)| < G|f|. A theorem of Riesz’ there- 
fore asserts the existence of a unique element g in % such that L(f) = (f, g) for 
every f in &, and such, furthermore, that |g | < G. Since f is in the orthogonal 
complement 2* of M? if and only if (f, g.) = 0 for every n, we see that every 
such f satisfies (f,g) = L(f) = 0. Thus g belongs to the orthogonal complement 
of M*—that is, to M** = Me. 

It should be noted that the inequality |g| < G@ can hold; for example, if 
{gn} is an infinite orthonormal set, we have g = 0, G = 1. 

We now establish a well-known selection principle. 

TueoreM 3. In order that {g,} contain a subsequence {gi} such that {(f, 9.)} 
converges for each f in &, it is necessary and sufficient that G = lim inf | g, | < + ~. 


The necessity of the stated condition follows at once from Theorem 1. The 
sufficiency is proved by a quite familiar argument which we repeat only in out- 
line. By restricting attention to a suitable subsequence and renumbering it, 
we may suppose that | g, | < K, where K is a fixed but arbitrary constant ex- 


! 


ceeding G. The inequality |(gm,9n)| < |gm!{gn| < K° then permits us to apply 
the diagonal process so as to obtain a subsequence {g),} for which the sequences 
{(@ms Qn)}, m = 1, 2,3, --- are all convergent. This is the desired sequence. 
Dropping primes, we have |g, | < K and {(gm, g,.)} convergent for every m. 
If f is an arbitrary element in <, we can write f = f; + fo , where f; is in the closed 
linear manifold MM generated by {g,} and fz is orthogonal to M. Thus, if g is an 
arbitrary element of &, we have 


(SF, 9m) — (F,.9n) | = | (fis Gm) — (fis On) | 
S| (f — 959m) | + | (fi — 9, Gn) | + 1G Gm) — G Gn) | 
S 2K |fi— 91+ |G, 9m) — GY Gn) |- 
Here we can choose g as a linear combination of 9; , gz, --- so that the first 


°F. Riesz, Zur Theorie des Hilbertschen Raumes, Szeged Acta, vol. 7 (1934), pp. 34-38. 














296 M. H. STONE AND J. D. TAMARKIN 


term is rendered small; and the second term then becomes small whenever m 
and n are both sufficiently great. Hence {(f, g,)} converges for arbitrary f in ¥. 

We can restate the results of Theorems 1-3 in other terms by introducing the 
following definitions: a sequence {g,} is said to converge weakly or to be weakly 
convergent if {(f,g,)} converges for each f in 2; to have a weak limit g if (f, gn) — 
(f, g) for each f in &; to be weakly bounded if { (Jf, gn) } is bounded for each f in &; 
and to be bounded if { | g,| } is bounded. In fact, it is easily seen that our 
theorems imply the following propositions: 

(1) a sequence is weakly bounded (if and) only if it is bounded; 

(2) 2 is weakly complete—that is, every weakly convergent sequence has a 

weak limit in %, necessarily unique; 
every closed linear manifold 32 is weakly closed and weakly complete 
that is, a weakly convergent sequence in Jt has a weak limit in M; 

(4) the “sphere” K(R) specified by the inequality | g| < R is weakly closed 
and weakly compact—that is, every sequence in K(R) contains a weakly 
convergent subsequence with weak limit in K(R). 

On the other hand, Theorems 1-3 can be deduced from these four propositions 
in an obvious way. 

We turn now to a problem in operator-theory. We prove* 

TueoreM 4. If the operator A has 2 as its domain, then the following properties 

are equivalent? 





(3 


~— 


(1) A is a closed linear operator; 

(2) the domain D(A*) of the adjoint A* of A is everywhere dense in &; 

(3) A is bounded and linear. 

When A has any of these three properties, D(A*) coincides with % and A* has the 
same bound as A. 

We establish the implications (1) — (2), (2) — (3), (8) — (1), proving inci- 
dentally that (2) implies D(A*) = &. 

Assuming (1), we form the graph @(A) of A in the direct sum’ £ @ 2 and apply 
reasoning due to von Neumann.’ @(A) is the set of all elements of the form 
\f, Af}, where f belongs to %, the domain of A. Since A is closed and linear, 
(%(A) is a closed linear manifold. In fact, the identity 


[fn — Sl + | Afe — fF = | ifn, Atel — (FF 


shows that {f, , Af} — {f,f*} if and only if f, —f, Af, —-Jf*; and the assumption 
that A is closed permits us to conclude the relation f* = Af or the equivalent 


‘J. v. Neumann, Uber adjungierte Funktionaloperatoren, Annals of Mathematics, vol. 33 
(1932), pp. 294-310, especially Satz 12, p. 310. See also reference to Tamarkin in footnote 
on p. iv of the foreword to Stone, Linear Transformations in Hilbert Space, New York, 1932. 
5 For definitions of terms used, see Stone, loc. cit. 
® Stone, loc. cit., p. 30. 
7 J. v. Neumann, loc. cit. 






















THEOREMS OF THEORY OF COMPLEX EUCLIDEAN SPACES 297 


relation {f, f*} «@(A). Similarly, the assumption that A is linear leads to the 
relations 


) » ax{ fi ’ Af} = {= Ox fx , x aAfs) = > ax fr, A( afi) G@(A). 

k=1 k=1 k=1 k=1 k=1 
The orthogonal complement of @(A) is a closed linear manifold @*(A), and since 
(A) is a closed linear manifold, it coincides with the orthogonal complement 
of G*(A). Now the identity 


(Af, 9) — (f, 9") = (Ff, Af}, {-9*, 9}) 


shows that {| —g*, g} « G*(A) if and only if (Af, g) = (J, g*) for every f in *— 
in other words, if and only if g « D(A*) and g* = A*g. Since D(A*) is obviously 
a linear manifold, we can prove that D(A*) is everywhere dense in ¥ by showing 
that the only element h orthogonal to D(A*) is the element h = 0. If h is ortho- 
gonal to D(A*), then 


({0, A}, {-—g*, g}) = (A, g) = 0 


for all elements {—g*, g} in G*(A). Hence {0, h} is an element of G(A), the 
orthogonal complement of @*(A), and the relation h = AO = 0 is valid. We 
have thus deduced (2) from (1). 

Now let us assume (2). If g is an arbitrary element of %, we choose a sequence 
{gn} in D(A*) such that g, — g. Since the relations (f, A*g,) = (Af, gn) — 
(Af, g) hold for each f in 2, an application of Theorem 2 establishes the existence 
of a unique element g* such that (Af, g) = (f, g*) for every f in %. Hence g « 
D(A*) and A*g = g*. Our first consequence of (2) is therefore the identity 
D(A*) = & Since A and A* are both defined over &, it is evident that the 
adjoint A** of A* coincides with A and that A* shares property (2) with A. 
The identity A = A** shows that A is linear. If it were not bounded, there 
would exist a sequence {g,} with the properties 

lim inf |g,| < +, lim | Agn| = +; 
indeed, the first of these properties could be replaced by the stronger property 
|gn| = 1. Now the equation (f, Ag.) = (A*f, gn) holds for all f in ¥. Hence 
we could apply the sufficient condition of Theorem 3 to the right-hand member, 
the necessary condition of the same theorem to the left-hand member, so as to 
infer the relation lim inf | Ag,| < + «©. We would thus obtain a contra- 


diction. Hence A must be bounded. Since our argument can be applied also 
to A*, the latter operator is also bounded. If a is the bound of A—namely, 
the least non-negative real number such that | Af| < a|{f | for every f in {— 
and if a* is the bound of A*, then the inequalities 


| Af |’ = (Af, Af) = (A*Af,f) S$ |A*Af||f| S o*|Af||F1, Af] S oF lS) 


show that a < a*. By symmetry we have also a* < a and hence a = a*. Thus 














298 M. H. STONE AND J. D. TAMARKIN 


we have proved not only that (3) is a consequence of (2), but also that, if A 
has property (2), A* is defined over ¥ and is a bounded linear operator with the 
same bound as A. 

Finally, the implication (3) — (1) is trivial, for if A is a bounded linear operator 
with & as its domain and a as its bound, then the relation f, — f implies 

| Af. — Af|=|A(f.-f)| Salf. -—f\— 9, Af, — Af, 

and A is therefore closed. 

A very useful application of this theorem is the following result due to Hel- 
linger-Toeplitz.° 

TueoreM 5. The following properties of an infinite matrix {mn}, m,n = 
1, 2, 3, --- , of complex elements are equivalent: 


(1) {@mn} ts bounded—that is, there exists a non-negative constant a such that 


N ) N 2 N 
PT <a Dd z,|° 
g=l | v= v=l1 


for arbitrary complex numbers x, , «++ , 2x and arbitrary integral N; 
x 


(2) the convergence of > | 2, |? implies the convergence of 


v=] 


x oc 2 
Le | Le Me Ze! 
p=l | v= 


(3) the convergence o ; > ia, |* and | y,|° implies the convergence of the 
g Pp 


v=) u=l 
double series 


© oc 
as Ay» Ly Up ; 


uv=l 
i ro 
(4) the convergence of >. | a, |* and Dd | y, |* implies the convergence of the 
oe we “ty tie 
series > (= tt) de 
w= v=l 


These properties are equivalent to the corresponding properties for the adjoint 
° * * — , ° ° 

matrix {Amn}, WhET€ Ginn = Anam. When any of these equivalent properties holds, 

we have 


¥ awrite = L(Y owt.) = Da (Larus). 
2 


wel g=l \=l v=] 
We shall establish the implications 


(1) > (3) > (1) 
* (4) — (2) 


§ Hellinger-Toeplitz, Mathematische Annalen, vol. 69 (1910), pp. 289-330, esp. pp.321-322. 





















THEOREMS OF THEORY OF COMPLEX EUCLIDEAN SPACES 299 


and shall show further that (1) implies the final statement of the theorem. 
Since we have 


> On, YoLy = bo Ayr Ly Yu » 
wel Brel 
we see that property (3) holds for a matrix if and only if it holds for the adjoint 
matrix. Thus the equivalent properties (1)-(4) are equivalent to the cor- 
responding properties for the adjoint matrix. 

We shall obtain proofs by applying Theorems 1, 2 and 4 in the concrete 
Hilbert space po of all sequences x = {2,, 22, 23, «++ } for which ;u | 2, |? 

v=1 

converges. In §o, the elements &, = {din , den, 5sn, °** $, Smn = O for m # n 
and én, = 1, where n = 1, 2, 3, --- , constitute a complete orthonormal set. 
Hence the linear manifold 2% which they generate is everywhere dense in po . 
We denote by A " the operator which carries x = {2 , 22,73, +++ } into Ayr = 
{2 GinSe, °°?  » ay,x,, 0, 0, - +} It is evident that Ay is a linear operator 


v=1 v=l 


with po as its domain, and it is easily shown that Ay is bounded and has an 
adjoint Ay. 
Assuming (1), we first observe that, by virtue of the inequalities 


v 2 N oc 
| Av’ = Y| doz, sa QD \2/ sad [2,7 =a" |2/ 
p=1 | o=1 v=1 y=] 


the operators Ay , N = 1, 2,3, --- , are uniformly bounded. Since (Aw é,, &m) 
= Ann for N 2 mand N 2 n, the sequence {(Awz, y)} converges for all x, y 


in M. Now if z, 2’, y, y’ are arbitrary elements of So, we have 

| (Awa, y) — (Awaz, y)| S | (Au(@ — 2’), y)| + | (Av(@ — 2’), y) | 

y')| + | (Awa’,y — y')| + | Aur’, y’) — (Ava’, y’)| 
S 2a|x—2'|\y|+2a/2'||y—y'|+| Awe’, y’) — Ave’, y’) |. 


We can render the first two terms in the final expression small by choosing 
zx’ and y’ in IM so as to approximate zx and y respectively; and we see that the third 
then becomes small for all sufficiently great M and N. The sequence {(Awz, y)} 
therefore converges for all z, yin Sp. Since Ay has adjoint Ay with Gp as its 
domain, the equation (Ayz, y) = (rz, Axy) shows that the sequence { (x, Ay y)} is 
likewise convergent. According to Theorem 2, there exist unique elements 
Ax, A*y, such that 





(Aya, y) a (Az, y) baad (z, A*y) 


for all z, yin So. The operators A and A* so defined have po as their common 
domain, are both linear, and are adjoints of one another. According to Theorem 
4, we see that in addition A and A* are both bounded. We note in particular 
the relations 














D. TAMARKIN 





M. H. STONE AND J. 
» 2 * 
(A E,, , £,,) = Amn , (A* n> Em) = (é, b A Em) - Anm = Amn - 


Writing z = Dd (2, &.)& = Dd 2,£, for the arbitrary element x = {2,2%2,2%3,°°* } 
v=l v=l 


and applying the linearity and continuity properties of A, we now have 


Az)? = >>| (Az, %)) = = Ei GQ+)) ' 


w= a= v=1 

o | & o } @ 12 
_ > Px At,, &)t0| = Xe | de dwt, . 

y= pan b= y= 


Thus (1) implies (2). Also, putting y = p> y,~,, we find similarly that 


(E(Eas)e.s)- Lente. 


(A nw, y) 





= > Oy, X,Y, — (Ax, y). 


awl 


Thus (1) implies (3). By analogous reasoning we have 


(Az, y) = > (Ag, &,)(& ’ y) = > (= tet.) Yu» 
w=l u=l v 
(Az, y) = (x, Ay) = D2, BG, At) = Daa) 


p oe (= warts) 
v=1 pol 
Thus (1) implies (4); and also implies the equality of the sums of the three series 
expressing (Az, y). If we denote by Ey the projection of Sp on the (closed) 
linear manifold generated by &, --- , gy, we see that Ay = EyAEy. It is 
thus easy to show that Ayz converges in $o to Az, and that the bound of A 
is the least constant a for which the inequality of (1) holds. 

Next let us assume (2). We can then define an operator A which carries 
x = {a, %, %3,-°*: } into Ar = <>. ayz,, >. a2,, °° >. It is evident 

v=1 v=1 


that A is a linear operator with po as its domain. Since 


(x, Ax Em) = (A wo, Em) = 7 én Dd ane» = (Az, Em) 


v=] v=1 








when N becomes infinite through values exceeding m, we can apply Theorem 2 
to write (Az, &m) = (z, &) for m = 1, 2,3,---. Hence A has adjoint A* de- 
fined throughout the linear manifold I. By Theorem 4, the operator A is 
bounded. If ais the bound of A, we have 











THEOREMS OF THEORY OF COMPLEX EUCLIDEAN SPACES 301 





E) | -) 12 ) 
DL | La.z,| = |Arf sa®|z) =a’ D |z,/. 
u=1 | v=1 | v=1 


Applying this inequality to the case where x = {2,, %,--:,2y,0,0,--> }, 
we have 
N 


2 N 
3 Ap, Ly = a’ b | a, |?. 


y=1 | v=l 








ys Fas, s ps 


N 2 cl 
' 
p=1 | v=1 | p=l 





Hence (2) implies (1). We may remark that the operator A defined in the 
present paragraph is now easily identified with the operator A introduced in the 
preceding one, by virtue of the relation (Awz, &n) — (Az, &,) which holds for 
both operators A. 

Next let us assume (3). If x = {2 ,22,%3,--: }andy = {y,ye,ys,°°* } 
are arbitrary elements of $p , we see that 


N 
(Anz, y) = >» Oy Try, = (x, Any) 
b= 
converges. By Theorem 2 there exist unique elements Az, A*y such that 
(Ag, y) = dX Apr Ly Yu = (z, A*y), 
v= 


for all z, yin 9. The operators A and A* thus defined have po as their com- 
mon domain, are both linear, and are adjoints of one another. According to 
Theorem 4, they are both bounded, with common bound a. Consequently 
we have 


sa |z/*l|y/* = a> Bp ly |’. 


|S ann r 


12 
| 
| wel 


Applying this inequality to the case where z = {%,---, tv, 0, 0,--- } and 
y = Aya, we have 








| N | N "|? N N N 2 
| ax! | sa > |z,|* Dd ay 2, , 
| w= v=1 | y=1 p=1 | -=1 
and hence 
N 1 N N 
| Lo at} Sa” Dy | 2,/ 
u=1 | -=1 v=1 








Thus (3) implies (1). Again the operator A introduced here is easily identified 
with those previously defined. 
Finally, we assume (4). Then for arbitrary x = {2, 22, 2%, --- } in So the 


series z GmyX, converges to a sum Zz, ; and for arbitrary y = {y:, Y2, Ys, °°* } 


v=1 


oo) 
in So, the series > Zuy, converges. Let ¢y be the element {z, 22, --:, zy, 
a= . 





302 M. H. STONE AND J. D. TAMARKIN 


N 
0, 0,--- | in Ho. Then (fv, y) = te Zyy, converges for arbitrary y in Dp». 
l 


Hence Theorem 1 shows that 


ny 


| fw | . —_ Ay, Ty 
1 | v=1 i 


remains bounded. We see therefore that (4) implies (2). 
With this the proof of our theorem is complete. 


HARVARD UNIVERSITY AND BROWN UNIVERSITY. 

















THE STRUCTURE OF CERTAIN RATIONAL INFINITE ALGEBRAS 
By Otro F. G. ScuILuine 


G. Kéthe has investigated infinite algebras over abstract fields in his paper 
“Ueber Schiefkérper unendlichen Ranges”.' In this note we shall apply some 
of his fundamental results to infinite algebras over a finite algebraic number 
field. Application of the theory of finite algebras over algebraic number fields 
enables us to give explicit representations of the algebras considered as general- 
ized crossed products.” Furthermore, we shall investigate the arithmetic in 
such infinite algebras. 


1. Algebraic theory. Let k be an algebraic number field of finite degree 
over the field of all rational numbers. We consider algebras of infinite rank over 
k as centrum. We assume that the algebras A are countable, i.e., that there 
exists a countable set {a,, a2, --- , ai, **+} of elements of A, such that each ele- 


ment a of A can be represented as a finite sum > k;, a;, with coefficients k;, 


vel 
in the field k. Furthermore, we restrict ourselves to completely normal alge- 
bras which we define as follows. 

DeriniTion. A countable infinite algebra A with centrum k is called totally 
normal over k if every finite system {b,, --- ,b,} of elements of A lies in a 
normal simple finite algebra over k. 

With slight. modifications of Kéthe’s proofs one proves 

THEOREM 1. For every totally normal algebra A there exists at least one defining 
sequence kk © --- ( Ais & Ai ] --: of normal simple algebras A;. 

Conversely, we have 

THEOREM 2. Every sequence kk © --- & Ain & Ai &-:: of normal simple 
algebras A; over k defines an infinite totally normal algebra A with the center k. 

THEOREM 3. Every totally normal algebra A over k is representable as the direct 
product of a countable infinity of normal simple algebras A; of finite degrees over k. 
Such a decomposition is not necessarily uniquely determined. 

If we collect all simple normal systems of such a decomposition which belong 
to a fixed prime g, we have 

THEeoreM 4. Every totally normal algebra A over k is the direct product of 
a countable infinity of simple algebras A‘ which are primary with respect to k. 
The factors are uniquely determined except for isomorphisms. 


Received December 23, 1936. 

1G. Kéthe, Math. Annalen, vol. 105 (1931), pp. 15-39. See this paper also for the differ- 
ent notions used later on. 

? For the theory of crossed products see the report of Max Deuring, Algebren, Berlin, 
1935. 


303 











304 OTTO F. G. SCHILLING 


Thus the determination of structure is reduced for this type of algebra to 
that of the structure of finite or infinite totally normal primary algebras A“. 

Now let us consider a direct product A = A; X --+ X A; X --> of infinitely 
many finite primary algebras A;. Then AX = A, X --- X A, = D, X Ra, 
with a normal division algebra D, and total matric algebra k;,, of degree r, 
over the field k. Kéthe has shown 

Turorem 5. If the degrees r, of the matric algebras successively contained in A* 
tend to infinity with increasing n, the infinite direct product A is isomorphic to the 
direct product M of infinitely many matric algebras k,. 

With the help of Theorem 3 and the theory of finite normal simple algebras 
over an algebraic number field k, we prove 

THEOREM 6. There do not actually exist infinite totally normal primary division 
algebras D over a finite number field k. 

Proof. It is a well-known fact that the direct product of two normal finite 
division algebras D, and D, over k is a division algebra if and only if the ex- 
ponent of D, X Dz» is equal to the degree of D, X D.. The degree of D,; X Dz 
is equal to the product of the degrees of the factors. Now let D, and D2 be 
primary division algebras with the respective degrees q™' and q“, a1, a2 both > 0. 
Then the algebra D, X Dz has the degree q@'*™. 

On the other hand, the index of an algebra is determined as the least common 
multiple of the local exponents or indices.’ According to the theory of finite 
normal algebras over an algebraic number field, the local invariants of a product 
are equal to the sum of the respective local invariants of the factors. The latter 
can be represented for every prime ideal p of k as fractions with the maximal 
denominators q™"“““" “’. The possible exponent in the large is then at most 
q@" * But this means that the direct product of two primary finite 
division algebras D,; and D, must necessarily split off a matric algebra, because 
max(a;, a2) <a, + a.. We apply this to the products D, X D. X --: X D, = 
Di. x k,,,, where D}, is the division algebra which belongs to the product. Then 
one sees immediately that the degrees r, of the matric algebras tend to infinity 
with increasing n. According to Theorem 5 all these products are isomorphic 
to the infinite product of matric algebras k,. Every division algebra D can be 
represented as an infinite direct product of finite division algebras as stated in 
Theorem 3. But our investigations of direct products of primary division 
algebras show that they must be isomorphic to M. 

There remains then the study of totally normal algebras which are not iso- 
morphic to an M. They must be representable as infinite direct products of 
primary finite normal simple algebras, as Theorems 3 and 6 show. 

We apply the theory of structure as developed by H. Hasse’ to this problem. 
He shows that every finite normal algebra A over the algebraic number field k 


* Cf. footnote 2. 
‘Cf. footnote 2. 
*H. Hasse, Die Struktur der R. Brauerschen Algebrenklassengruppe wiber einem algebrai- 
schen Zahlkérper, Math. Annalen, vol. 107 (1933), pp. 731-760. 























STRUCTURE OF CERTAIN RATIONAL INFINITE ALGEBRAS 305 


of degree n can be represented as a cyclic crossed product (a, Z/k, u), where Z is 
cyclic over k of degree n and a is an element (# 0) of k, u an operator, describing 
the algebra A with respect to the splitting field Z. The complete factor set 
of this representation is given by the square matrix of n rows and columns 


Leeeeeeee 1 
° “a 
F(a) = , fo. 
1 uf eecne a 


Therefore we may write A = (F‘"’(a), Z/k, u). 

Now we consider two prime’ cyclic products A; = (F‘"" (a), Z:/k, w) and 
A, = (F‘" (ag), Z2/k, uw). Their direct product is isomorphic to a cyclic algebra 
with the splitting field Z:Z2. This field Z,Z2 is cyclic over k, because we as- 
sumed A, prime to Az. The Galois group of Z,Z2/k is isomorphic to the direct 
product of the Galois groups {S,} and {S:} of Z,/k and Z./k. For this reason 
we shall represent A; X Az by the symbol 


(F“" (a) @ F“ (aa), Z:Z2/k, (ur, Ue}), 


a] ) +(ne) . . . ee 
where F“"" (a;) @ F‘"?’(a2) is an abbreviation for the factor set arising from the 
2 
(nyn2)” products 


uy sus? = aj'ag?-ut'-us’, 


if m: = gim + wy and me = gone + we. This factor set F‘"? (a:) ® F“® (az) has 
the form 
FP (a,) si FP (a,) 
: ap: F"” (ay) 


F(a) ag-F°™ (a) «++ ag-F (ay) 


with mz rows and columns. It is the multiplication table belonging to the mn2 
elements of the complex {u;, uw} in the ordering 


2 -1 2 —1 2 2 
1,4, %j,°°*,@ , U2, U2, ***, U2” , Ujle, Uju2, *** , Use, 


The matrix F"?’ (a2) @ F‘"” (a) is generated as the tableau of 


2 —1 2 ae | 2 2 
1, Ue, Us, °**, Ue” , Mi, M%,°°* wh C, Uot, Us, *** , Ue), 


Obviously F“"? (az) @ F‘"?(a;) can be obtained from F‘"? (a) @ F‘" (a2) by 
interchange of rows and columns, and vice versa. 

Furthermore, it is required that wz'zyw2 = 2: and uj zu = 2% for arbitrary 
elements 2;, 22 (# 0) in Z; and Z.. We shall not reduce this representation of 
A, X Ag to the cyclic form (F™ "” (ay), Z:Z2/k, ux). 

If we form the union Z of infinitely many finite algebraic number fields Z; 


6 Two cyclic products are said to be prime if their degrees are prime. 






















306 OTTO F. G. SCHILLING 


which are prime over k and cyclic, this field Z is an infinite cyclic extension of k. 
The direct produet z of all the cyclic Galois groups of the fields Z;/k is a group 
of automorphisms of the field Z/k. It is contained in the most general group of 
automorphisms of Z/k as introduced by J. Herbrand’ and W. Krull® by topo- 
logical methods. All elements of z are of finite order by definition of the direct 


product. 

. . . . ) F 

Now let us consider infinitely many crossed products A; = (F‘"" (a;), Z;/k, us), 
which are prime with respect to k. Then the infinite complex {w, w,--- } 


with the relations 





m m; “ is i Gi; 
ui,’ eee cs ui; = Ui, © eee e uj! a;,' “rn t a;;' 
with m;; = gi;ni; + mi; for every finite product of operators forms a crossed 
representation of the group z. The factor set arising above may be called 
F’? (a,) @ F°"?’(a2) @ «+. We define then the infinite algebra 
ny) , ) j ¥ 
A = (F'"" (a) @ F'" (a2) @ --- , Z/k, fur, we, «++ }) 
. e e . 
as the set of all finite sums >> Rees «++ o'0q 2+ 0q Beg **° Ba, We, *** Me, With ole- 
o,f Sy 
ments k... in the field k. , 
n *,: . a 
lhe laws of composition are completely defined by the relations 3 


(i) [Tut = [ai ll ui 


v=l vel v=l 








u;'2juy = 2; (i # j) for every z; # 0 of Z;; 





(ii) 





—1 s 
u; zu; = 2° for every z; ¥ 0 of Z;. 






We shall call such an infinite crossed product with cyclic Z/k a generalized cyclic 
product. Then we have 

TuHeoreM 7. The generalized cyclic products A are totally normal over the 
field k. 

Proof. It is obvious that the finite direct products A; KX Az X --+ X Aj 
form a defining totally normal sequence of A. Theorem 2 asserts that A is then 
totally normal. 

TueoreM 8. Every proper’ infinite totally normal algebra A over k can be 
represented as a generalized cyclic product. 

Proof. Let A be decomposed according to Theorem 4. The components are 
finite simple normal algebras A; which are prime with respect to k. Theorem 7 
leads to possible representations. We remark that these representations are 
not at all uniquely determined. 
















7 J. Herbrand, Extensions algébriques de degré infini, Math. Annalen, vol. 108 (1933), 
pp. 699-717. 

*W. Krull, Galoissche Theorie der unendlichen algebraischen Erweiterungen, Math. 
Annalen, vol. 100 (1928), pp. 687-698. 
* That is to say an algebra which is not isomorphic to an infinite matric algebra M. 











= 





Si i < a BA a 





STRUCTURE OF CERTAIN RATIONAL INFINITE ALGEBRAS 307 


2. Arithmetic theory. We first introduce some notions already known in 
the theory of infinite algebraic number fields." Let k = Ap © «++ © Av © 
A; © --- be a defining sequence of normal simple systems of the totally normal 
infinite algebra A over k. 

DerFIniTion. We say that the series Ro C--- C R:, CR; C:-:: of 
rings R; in A; is a sequence “belonging to A = {A,}” if and only if 

(i) the rings R; are of maximal rank in A;, 

(ii) A = Nn R; = Ry. 

We consider also ideals a; in the rings R;: left, right, and two-sided ideals. 

DEFINITION. A series @ C--: € ai1 € a; ©--- of ideals is called a 
sequence of ideals belonging to the sequence of rings R; if and only if 

(i) a; CR, aiR, C ai, 

(ii) Rin nN a; = aj-4. 

Condition (i) is to be changed according to the nature of the ideals. 

Then we have 

THEOREM 8. Between the rings R of A and the sequences of rings {R;} can be 
established a one-to-one correspondence, which is explicitly given by 


iR;} —~R = Rj, R— {RN A; =R;,}. 


TuHeoreM 9. Between the ideals a of the rings R in A and the sequences of 
ideals {a;} belonging to the sequence of rings R = {R,} there holds a one-to-one 
correspondence according to the formulas 


jai} >a = Lai, a— jaf R; = ai}. 


For the proofs one has only to recall the different definitions. 

DeriniTion. A ring J of A is called a maximal order if 

(i) J contains the maximal order of k, 

(ii) all elements of J satisfy minimal equations with coefficients in the 
maximal order of k, 

(iii) J is not contained in a larger ring, which fulfills the conditions (i) 
and (ii). 

We now construct special maximal orders of A. Let A = A; X Az X-::: 
x A; X --- be a representation of A according to Theorem 3. In each system 
A; we fix an arbitrary maximal order M;. Then we form successively the 
direct products 


M; X M2 © Mw C A; X Ao 
My X M; © Mis C A; X Ao X Az 


ee 


Mw... 1 X M; © My... ; C Ai X Ao KX ++ XK Aj. 


10 W. Krull, Idealtheorie in unendlichen Zahlkérpern, Math. Zeitschrift, vol. 29 (1929), 
pp. 12-54. 





308 OTTO F. G. SCHILLING 


We denote by My ... ; the uniquely determined" maximal order of Ay X Az X --- 
< A; which contains the order My... ;.. X M;. All these hypercomplex orders 
have the highest possible rank. 

We proceed to 

THeorem 10. The sequence of maximal orders {Mw... ;} defines a maximal 
order J of A. 

Proof. Let us assume that J is not a maximal order. Then J is contained 
in at least one maximal order J’. Hence there must exist an element a of J’ 
which does not lie in J. Since a is an element of a maximal order J’, its minimal 
equation has integer coefficients ink. It must belong to a finite subalgebra of A. 
Let Ay X Az X --- X A, be this algebra. Then M’ = {J,a} M A; X AX --- 
x A, D M,...,. This means that in the finite normal algebra A; X Az X -:: 
x A, there must exist an order M’ properly containing the maximal order 
M,...,, but this is clearly impossible. 

TuHeoreM 11. There exist continuously many maximal orders in the algebra A. 

Proof. This follows immediately by considering the construction of the 
special maximal order J. At each step we can choose infinitely many maximal 
orders M; in the algebras A; to form the maximal orders My ... ; of an approxima- 
tion. The maximal orders My... ; are different for different M;,. To prove 
this let M; and M; be two different maximal orders of A;. Assume that the 
uniquely determined maximal orders My... ; and Mjg...; in Ay X --- X Aj 
which imbed M;_; are equal. Then the intersections My... ; MA, X --- X Ais 
and Miz...; M Ay X «++ K Aj are equal. This intersection is obviously an 
order of Ay X +++ X Ajyy. It contains the two different maximal orders 
M; and M;. It therefore is equal to both, in contradiction to the assumption. 
This construction leads then to &"o = & different maximal orders of A. 

Now we consider the decomposition of prime ideals p of the center k. It 
suffices to consider the p-adic extensions A, of the given totally normal algebra 
A.” We understand that the extension A» is the product modulus A-k,. 
Obviously A, is a totally normal simple algebra with the center kp.’ In the 
representation (A;)p X --- X (Aj)p X --- of Ap as a direct product, we can 
combine all finite local division algebras (Dp); of (A,)p into a local division 
algebra Dy which is also totally normal over kp. 

This reduces the problem of decomposition to the two partial problems: (1) 
arithmetic in a totally normal division algebra Dp over kp, (2) arithmetic in 
systems of matrices over such a division algebra. We prove 

TueroreM 12. A totally normal division algebra Dp over a p-adic number field 
kp contains exactly one maximal order Op, and there exists one prime ideal P in Dp. 


tT. Nakayama, Uber das Produkt zweier einfachen Algebren mit zu einander teilerfremden 
p-Indizen, Jap. Journal of Math., vol. 12 (1935). 

12 EF. Noether, Zerfallende verschrénkie Produkte und ihre Mazimalordnungen, Actualités 
Scientifiques et Industrielles, 1934. 

13 Consider the approximation by products of the (A;)p. 








STRUCTURE OF CERTAIN RATIONAL INFINITE ALGEBRAS 309 


All other ideals are two-sided and they are determined by their values with respect 
to P and their symbols. 

Proof. Let --- & (Dp)is & (Ds); & --- be a defining sequence of finite 
p-adic division algebras of degrees n; over kp. These algebras contain exactly 
one maximal order (Op); with one prime ideal P;. The value with respect to 
P; may be denoted by vp,."* The sum > (05); is obviously a maximal order 
O» of the algebra Dp. By definition, Op contains all elements of Dp whose 
minimal equation has integer coefficients in the field kp and the highest coefficient 
is one. Let O}; be another maximal order of Dp. All its elements satisfy 
minimal equations with integer coefficients in kp. Therefore O; C Op. But 
O; was assumed to be a maximal order. Hence O; = Op». 

The sequence --- © P;_,; © P; € --- of prime ideals defines the prime ideal P 
of the maximal order Op. Let a be an ideal in Op. Its defining sequence 
‘a; = a Nf) (Op);} of ideals in (Op); consists of two-sided ideals. Hence a itself 
is two-sided. The value vp with respect to the prime ideal P is defined by vp(a) 
= lim vp,(a)-n;", if a lies in the algebra (D»);, and not in (Dp);,-1. It is vp(a) = 
Up)(a)-n;,. The value vp(a) of an ideal a is defined by lim inf vp(a). The 

a¢ea 
symbol s(a) is equal to f (finite) if and only if there exists an element a in a 
such that vp(a) = vp(a). The symbol s(a) = 7 (infinite) in all other cases. 
Then all ideals of Op are uniquely determined by their values and symbols.” 
This theory runs exactly as in the commutative case. 

If we consider a finite matric algebra (Dp), over Dp, then (Op), is a maximal 
order of (Dp) m. 

THEOREM 13. The two-sided prime ideal P of (Op) m is given by 

p...p (m) 
P(O,) m = 
p..-p 


and it is the common part of m left ideals L; which are given in the form 


O, --- P--- 0,\” 
L; = ° 


? 


O,-::P--- 0, 


the P’s being in column j. 

Proof. These facts are quite obvious.” 

Now we consider the product Ap of infinitely many matric algebras (kp),,; 
with the totally normal division algebra Dp. 


14 For the notion of value, etc., in division algebras see H. Hasse, Uber p-adische Schief- 
kérper und ihre Bedeutung fiir die Arithmetik hyperkomplexer Zahlen, Math. Annalen, vol. 
104 (1930), pp. 495-534. See this paper also for the proofs of the following theorems. 

16 Cf. footnote 10. The algebra Dp is by no means perfect with respect to the valuation 


v 
P 


16 Cf. footnote 14. 





310 OTTO F. G. SCHILLING 


TueoreM 14. The two-sided prime ideal P of the maximal order Op X II(op),, 
= J is divisible by continuously many left ideals in J. (op is the maximal order 
of kp.) 

Proof. The series op & --- © Op X Il (Op); & Op X et (Op) m; 

j7=1 
is a defining sequence for a maximal order J of Ap. The ideals P x Op X 
I (Op); are two-sided prime ideals of the maximal orders Op X Il (Op) m,; - 


f=1 


They define the two-sided prime ideal P of J. Let L, be one of the left divisors 


of P xX Op X Il (Op)m; in Op X I (Op)m;- Their number is surely greater 


7~=1 
than one. Then all of these ideals eg are divisible by more than one left ideal 


L; of Op X II (Op);. This construction leads again to at least 2% = ® left 
j=l 
divisors of P. 

Remark. A combination of these results with the explicit construction of 
finite division algebras shows that there exist totally normal division algebras 
in which exactly two prime ideals p and q are totally ramified, i.e., Dp and D, are 
both infinite division algebras. There also exist totally normal division alge- 
bras, all p-adic extensions of which are equivalent to the product of a finite 
division algebra with an infinite product of matric algebras. 


Tue INSTITUTE FOR ADVANCED Stupy. 








AN EXTENDED ARITHMETIC 
By Garrett BiRKHOFF 


1. Introduction. In this paper there are defined three combinatorial opera- 
tions upon partially ordered sets X and Y resulting in partially ordered sets 
which will be denoted by X + Y, XY, and X” respectively. 

In the case where X and Y are finite unordered sets with cardinal numbers 
m and n respectively, the operations yield the finite unordered sets of cardinal 
numbers m + n, mn, and m” in the commonplace sense. In the more general 
case where the requirement of finiteness is dropped, they yield the usual founda- 
tions for the arithmetic of general cardinal numbers.’ 

It will be proved that the formal properties of general cardinal arithmetic’ 
persist as laws of composition for our extended arithmetic of general partially 
ordered sets. On the other hand, no algorithm for well-ordering the class of 
partially ordered systems is given, and so the theorem of transfinite arithmetic 
which asserts that the cardinal numbers are well-ordered has no analogue. 

It will also be shown that our extended arithmetic is of considerable use in 
describing algebraic systems. Thus although it differs from Hausdorff’s ordinal 
arithmetic when applied to sequences, it is apparently more consequential than 
the latter. 


2. The extended arithmetic. The extended arithmetic which is proposed will 
be described by defining first the domain of elements to which its operations 
apply, and then defining the resultants of its operations. 

The elements are partially ordered systems in the usual sense of Hausdorff— 
that is, systems X, Y, Z, --- whose members (denoted by small Latin letters) 
are related by an inclusion relation z < 2’ satisfying 


Pl: 2 S x. (Reflexiveness) 
P2: 2 S 2’ and’ S zimplyz = 2’. (Anti-symmetry) 
P3: 2 S$ 2’ and2’ S x’ imply z S x”. (Transitivity) 


Two partially ordered systems X and Y will be called isomorphic (written 


s 
s 


Received May 22, 1936; in revised form, December 31, 1936. 

1 Cf. F. Hausdorff’s Mengenlehre, Berlin, 1927, p. 29, p. 62. 

? The possibility of such relations as z + 1 = 2, 2x = z, and z? = z distinguishes general 
from finite cardinal arithmetic. 

’ Although transfinite numbers haye numerous uses, the only constructions of sums, 
products, or powers of transfinite numbers which have been really interesting hitherto 
have been those of (1) the power of the continuum as 2% (9% denotes countable infinity), 
and (2) transfinite ordinals such as w + 1, 3, or w? by ordinal addition and multiplication, 
and only the second of these is lost in our extended arithmetic. 


311 








GARRETT BIRKHOFF 


X = Y in this article) if and only if there exists a one-one correspondence 
between their elements which preserves inclusion. 

By the sum X + Y of two such systems X and Y is meant the system whose 
members include both the members x of X and the members y of Y, in which 
x 2x'andy S y’ keep their meaning, while x S y and y S 2 are always denied. 

By the product X-Y of X and Y is meant the system whose members are 
all couples [z, y] with «eX and y ¢ Y, and in which [z, y] S [2’, y’] means 
thatz S z’inX andy S yin Y. 

By the power X” of one such system X with respect to a second such system 
Y as exponent is meant the system whose members are the monotonic functions 
S(y) with domain Y and range in X (that is, all functions such that y S y’ in Y 
implies f(y) S f(y’) in X), ordered by having f S g mean that f(y) S g(y) 
for all y. 

The reader will have no difficulty in verifying that X + Y, X-Y and X” 
are partially ordered systems. Further, if X and Y are lattices,‘ then so is X-Y. 
Moreover any laws such as the modular and distributive laws which hold in 
X and Y hold in X-Y. While if X is a lattice and n is the power of Y, then 
X" is a sublattice of X", and so X” is a modular resp. distributive lattice if X is. 

The proofs of these facts will be omitted. Also, we shall not prove that the 
above definitions yield an extension of the cardinal arithmetic of Hausdorff— 
this is evident if one looks at Hausdorff’s definitions (loc. cit.). 


3. Applications. It is interesting to consider various arithmetic combina- 
tions of especially simple partially ordered systems, which have an independent 
algebraic importance. 

As regards the simple systems, we shall let n denote the unordered aggregate 
of n elements, C, the sequence of n elements (n finite), symbols N. the trans- 
finite cardinals (= unordered aggregates), and adopt the conventional notation 
for transfinite ordinals. Finally, we shall let B = C2 denote the Boolean 
algebra of two elements, and P, the one-dimensional projective geometry with n 
points on its line. Then 


(3.1) The finite Boolean algebras are the B”. 

(3.2) The Boolean algebra of all subsets of any aggregate of power § is B*. 
(3.3) The finite distributive lattices are the B*, where X denotes a variable 
finite partially ordered set. 

(3.4) The “quotient-lattice” associated by Ore* with each abstract lattice L 
is L’. 


‘ By a ‘‘lattice’’ is meant a partially ordered system Z in which any two elements z 
and z’ have a g.l.b. 2 M 2’ such that 2’’ < 2 / x’ means that z’’ S$ zand 2” S 2’, and 
al.u.b. z U 2’ such that 2’’ => 2 U x’ means that x’ = zand x”’ => x’. By the modular 
law is meant the law that z S x’’ implies x U (z’ N z’’) = (x U 2’) N x”; the distributive 
law asserts that (x U 2’) N (zx’ Uz") N(x" Uz) = (x N 2’) U (2? N 2”) U (2 N 2). 

5 On the foundations of abstract algebra. I, Annals of Math., vol. 36 (1935), p. 425. Ore 
calls lattices structures. 








AN EXTENDED ARITHMETIC 313 


(3.5) The integers ordered with respect to divisibility (the relation m | n) are a 
sublattice of w™°. 

(3.6) The “free” Boolean algebra generated by n symbols is B’’. 

(3.7) The “free” distributive lattice generated by n symbols and with 0 and J 
added, is B””. 

(3.8) The “free’’ modular lattice generated by three symbols is a sublattice 
of B’-P; ° 

(3.9) The most general configuration generated in n-space by an r-plane and 
an s-plane through the origin, after iterated sections, linear sums and taking 
of orthogonal complements, is B*:P,. 


The proofs of (3.1)—-(3.9) are various. Assertions (3.1), (3.2) and (3.6) are 
known.” Assertions (3.3) and (3.7) are proved in the author’s article, Rings 
of sets, which will appear in the next issue of this journal. Inspection of Ore’s 
definition, which is identical with our definition of L”, yields (3.4). Statement 
(3.5) is a corollary of the unique representation of any integer as a product 
of powers of ascending primes. The result (3.8) has been proved by the author 
(On the structure of abstract algebras, Proc. Camb. Phil. Soc., vol. 31 (1935), p. 
443, Theorem 14), while (3.9) has been recently proved by J. von Neumann; 
the proof will be published elsewhere. 


4. Arithmetic identities. It is obvious that addition and multiplication are 
commutative—in symbols, that 


(4.1) X¥+Y=Y+X and X:-Y¥ = Y-X. 
They are also associative—that is, 
(442) X+(¥+Z)=(X + Y)4+2Z and X-(¥-Z) = (X-Y)-Z. 


For both (X + Y) + Z and X + (Y + Z) consist of all members of either 

X, Y, or Z, with the provision that x < 2’, y S y’, andz S 2’ preserve their 

meaning, while x S y,z 2y,y S2,y 22,2 S x, andz 22 are always denied. 

And both X-(Y-Z) and (X-Y)-Z consist of all triples [x, y, z] with x eX, 

y¢«Y and zeZ, where [z, y, z] S [z’, y’, 2'] means that x S 2’, y S 2’ and 
Again 


(43) X-(¥+Z)=X-Y+X-Z and (X+Y)-Z=X-Z4+Y-Z. 


(In words, multiplication is distributive with respect to addition.) For 
X-(Y + Z) and X-Y + X-Z alike consist of all couples [z, y] and [z, z] with 
reX, ye Y, and z eZ, where [z, y] S [z’, y'] means that zt S 2’ andy S y’, 
[x, 2] < [2’, z’'] means that x < 2’ andz S 2’, while [z, y] < [2’, z] and [z, y] = 
[x’, z] are always denied. This proves right-distributivity; left-distributivity 
follows by commutativity. 

® For instance (3.1) is proved as Theorem 25.1 of the author’s On the combination of 


subalgebras, Proc. Camb. Phil. Soc., vol. 29 (1933), p. 460; (3.6) follows from this and 
E. Schréder’s Algebra der Logik; (3.2) is immediately obvious. 








314 GARRETT BIRKHOFF 


Again, although exponentiation is not commutative—in general, XY & Y*— 
it satisfies the usual identities 


(4.4) Wa a2 and ro Cer, 
(4.5) X'=X and ry « 2". 


The first identity of (4.4) is easy to prove. The system X’X7’ consists of all 
couples [f, g] where f and g have Y resp. Z for domains, and so are equivalent 
(Y and Z being non-overlapping in Y + Z) to a single function A with domain 
Y + Z. Moreover [f, g] < [f’, g’] means that for all ye Y and ze Z, f(y) S 
fy and g(2) 3 < 5e )—that is, that for allue(Y + Z), h(u) S h’(u). Hence 
X"-X* = x’ 

Again, x? Y” is the set of all function-couples [f, g], where f and g are from 
Z to X resp. Z to Y, and [f, g] = [f’, g’] means that f(z) s f’(z) and g(z) S g’(z) 
for all z. But each such [f, g] can be regarded as a function A carrying each 
z eZ into [f(z), g(z)] « XY—and so, since h S h’ if and only if [f, g] < [f’, 9’), 
X’Y’ = (XY)’. 

That X' = X is obvious; it is a corollary (using (4.4)) that X*° = XX, 
X* = XXX, --- , and that X"X" = X”*”. 

Actually, the second half of (4.5) is not simple to prove, and we shall start 
by analyzing X"*. By definition, this consists of all monotonic functions f 
with arguments [y, z] and values z [x eX, ye Y, z eZ]. Such functions f asso- 
ciate with each z«Z, a function g, with arguments y « Y and values g.(y) = 
S(ly, z]) in X. But for any z, y S y’ implies [y, z] S [y’, z] in YZ, hence 


gy) = f(ly, zl) S f(y’, zl) = g-(y’) 


and so g, is in X”. 
Furthermore, if z S 2’ and y is fixed, then 


g-(y) = f(ly, zl) S f(ly, 2’) = gy), 


and so the correspondence f: z — g. is monotonic, which makes X*’ a subset 
S of (X")*. While f < f’ if and only if f([y, z]) < f’(ly, z]) for all [y, z]—which 
means that for all z, g-(y) S g.(y) for all y, which means that for all z, g. < 9: 
and so X** is isomorphic with S. But conversely, given f « (X")’, if y < y’ 
and z S 2’, then g.(y) S ge’ (y) Ss ge (y’), and so f « X"”, which completes the 
proof of the fact that X”* and (X")’ are isomorphic. 


5. Decomposition theorems. It is evident that if X = X,; + --- + X, = 
Yy, +.----+ Y, is any partially ordered system, written in two ways as the 
sum of additive components, then by set-theory X = Zu + --- + Z,., where 
Z;; = X; | Y;, denotes the set of elements common to X; and Y;, ordered as 
in X. For if x ¢Z;; and x S 2’, then 2’ e X; and 2’ « Y;, whence 2’ € Z;;. 

It follows that if X is finite, it has one and only one representation X = 
NX, + --- + X, as the sum of (non-void) partially ordered sets not themselves 
further reducible. Actually, if we regard the elements of X as vertices of a 











nT oS 





AN EXTENDED ARITHMETIC 315 


graph, and join two vertices x and z’ when x < 2’ or x > 2’ and no element z 
with z < z< 2’ orz >z> 2’ can be interpolated between z and 2’, then the 
irreducible additive components of X are the connected components of its graph. 

The author has proved elsewhere’ that any finite lattice L has a similar unique 
representation L = 1, --- L, as the product of constituents not themselves 
further reducible. The prime-factor theorem of arithmetic asserts that the 
same conclusion is valid if LZ is a totally unordered aggregate. 

Can we unite both these results in the single assertion that each finite partially 
ordered system X can be written in one and only one way as a product X; --- X, 
of “prime” (= indecomposable) factors? It is quite possible. 


6. Solution of equations. One naturally asks if it is true that in the finite 
case, the rules that 


(6.1) X+Y=X+Z implies Y =Z, 
(6.2) X-Y=X-Z implies Y = Z, 


which are true for pure cardinals and ordinals, are valid for general partially 
ordered sets. 

It is easy to prove (6.1) by appealing to the result that every finite partially 
ordered system X has a unique decomposition into a sum of irreducible com- 
ponents. For, making this decomposition, we have 


X4+Y¥=(Mit--+X)+ (+--+), 
X4Z=(Mit--- +X) + (Zi t+-- +2). 


One sees that by ordinary subtraction, each kind of component must occur 
in Y and in Z alike, the number of times it occurs in X + Y = X + Z minus 
the number of times it occurs in X, whence Y = Z. 

This argument would prove (6.2) similarly if we knew that the proposition 
stated at the end of §5 was true; in any event, it holds for lattices, since they 
have unique decompositions into irreducible multiplicative factors. 

If it held in general, would it imply that one could, by introducing negatives 
and quotients as ideal elements, extend our arithmetic to form an abstract 
field of elements of the symbolic form (X — Y)/Z? This seems highly 
improbable. 


7. Lexicographic combinations. Let X be any partially ordered system 
whose members are themselves (not necessarily distinct) partially ordered 
systems Y,. 

We shall define >> Y, (in words, the lexicographic sum of the Y, relative to 

x 


the ordering X) as the system whose members are the y, « ¥. [x « X], in which 
yz S y. preserves its meaning in Y,, and yz < y. [ry * 2’] means that x < 2’. 


* On the lattice theory of ideals, Bull. Am. Math. Soc., vol. 40 (1934), p. 616, Theorem 2 
and its corollaries. 








316 GARRETT BIRKHOFF 


Similarly, we shall define [] Y, (in words, the lexicographic product of the 
x 


Y, relative to the ordering X) as the system whose members are the functions f 
carrying each x ¢ X into an f(x) « Y, , and in which f S g means that for every 2 , 
either f(%») S g(x) or there exists an % < 29 such that f(%) < g(%) and f(x) S 
g(x) for alla < Z. 
In case X is totally unordered >> Y, and I] Y, are the cardinal sums and 
x xX 


products of Hausdorff extended as in §3. In case the Y, are sequences arranged 
in the sequential order X, p Y, and Il Y, are the ordinal sums and products 
x x 


of Hausdorff (loc. cit.). 

If all the Y, are isomorphic—that is, letting them all be a particular partially 
ordered system Y—we get a new binary operation of exponential type. For 
although the lexicographic sum )> Y of the occurrences of Y relative to the 

x 
ordering X is simply the lexicographic product [] X-Y of X and Y relative 
x<Y 


to the ordering X < Y, the lexicographic product IL Y, which we may denote 
x 


lex Y*, is a new system which could not easily be defined otherwise. Its ele- 
ments are the different “words” f with letters from X, one in each position of Y, 
and ordered in lexicographic order. 

Lexicographic combination is non-commutative, and seems to have no interest 
aside from the fact that lexicographic combinations of simply ordered (or well- 
ordered) systems are always themselves simply ordered (resp. well-ordered). 
This property makes them useful in ordering words in dictionaries. 


Harvarp UNIVERSITY. 








e 








THE IMBEDDING OF RIEMANN SPACES IN THE LARGE 
By Cart B. ALLENDOERFER 


1. The “classical” theory which describes the properties of an n-dimensional 
Riemann space immersed in an (n + p)-dimensional Euclidean space has its 
roots in the work of Voss and Ricci,’ and is a natural generalization of the 
properties of a two-dimensional surface in ordinary three-space. 

A more recent theory is that of W. Mayer, the latest refinement of which 
appeared in the Transactions of the American Mathematical Society in 1935 
(vol. 38, pp. 267-309). We shall refer to this paper as ““M”’. This theory com- 
bines the generalization of the properties of an ordinary surface with the general- 
ization of those of a curve in an n-dimensional Euclidean space. Thus the theory 
speaks of the “first normal space” of an n-dimensional Riemann space as an 
analogue of the first (“principal’’?) normal of a curve. Second and higher 
normal spaces also occur as extensions of the notions of the second and higher 
order normals of a curve. Associated with each normal space is a fundamental 
form, the form for the r-th normal space being of the 2(r + 1)-th degree. These 
forms give a complete description of the geometry of the normal spaces. Mayer 
has shown that the coefficients of these forms are not entirely independent, 
but satisfy certain relations. The first object of this paper is to prove some 
additional relations of this nature and to determine a set of them which is the 
necessary and sufficient condition on a number of forms that they describe a 
Riemann space which is actually imbedded in a Euclidean space. These results 
are stated in Theorem VII. In the course of the argument (§5) we prove a new 
set of identities in the curvature tensors of order higher than the first which are 
analogous to those for the classical Riemann tensor. 

The previous treatment of Mayer’s theory had the further defect that it was 
purely local, referring to an unspecified domain about a suitable point. In this 
paper the differential equations which occur are investigated with the purpose 
of finding the maximum region within which they may be integrated. We 
define singular points and show that the results of Theorem VII hold for any 
simply connected portion of the space which does not contain a singular point. 


Received December 1, 1936. 

1A. Voss, Zur Theorie der Transformation quadratischer Differentialausdriicke und der 
Kriimmung héherer Mannigfaltigkeiten, Math. Ann., vol. 16 (1880), pp. 129-179. 

G. Ricci, Formole fondamentali nella teoria generale di varieta e della loro curvatura, Rend. 
dei Lincei, vol. 11 (1902), pp. 355-362. 

A modern exposition of the theory is given by Eisenhart, Riemannian Geometry, 1925, 
Chapter IV. 


317 











318 CARL B. ALLENDOERFER 


rm™.: ° es . r r 2 
his treatment is in the spirit of recent work by T. Y. Thomas and W. Mayer 
in similar connections, and many of the methods used here are due to those 
writers. 


2. Riemann manifolds. We shall consider a Riemann manifold, R,, ie., a 
codrdinate manifold endowed with the Riemann metric 


(2.1) ds’ = gagdx* dz’, 


where the g.3(2) are continuous functions having continuous derivatives to a 
definite order. This order will be specified in the next section. We assume 
that the determinant | g.3| is positive definite. Let y' (¢ = 1,---,n + p) 
be the rectangular Cartesian coérdinates of an (n + p)-dimensional Euclidean 


space E,,,,. Then BR, is said to be imbedded in £,,,, if the equations 
dy’ dy’ 

2.2) ge Oe 

93 = axe art 


have a solution 

(2.3) y = ¢ (2) 

in each coérdinate neighborhood of R,. We shall understand hereafter that 
the codrdinate system z is a typical coérdinate system, not some special one, 
and consequently an invariant equation such as (2.2) will be understood to hold 
in all coérdinate neighborhoods without further mention of this fact. 


3. The normal spaces. For each value of @ and for linear transformations 


eee 
of the y’s, the quantities oo are the components of a vector field defined for all 
re 


= , . ay’ 

values of z C U. The totality of vector fields defined by o* fon (where o* 
xr 

are arbitrary scalars in E,,,,,.) is called the first osculating space J;. In a similar 


manner, the set of all vector fields defined by 


a oy a\ag a y' @\ae--:: a y' ) 
(3.1) Y eatin 1%2 A els 1Q ++ aR q 
x(- ae ee & +8 qa aa 





(where again the o’s are scalars) is called the A-th osculating space J)2...;. 
The invariance of these spaces under transformations of the z’s is proved in 
M, p. 270. 

The totality of vector fields belonging to J2..., and normal to Jj...,-1 is 
called the (hk — 1)-th normal space, J,. We shall denote the projection of the 
vector field 

a" y' 


2T. Y. Thomas, Riemann spaces of class one and their characterization, Acta Mathe- 
matica, vol. 67 (1936), pp. 169-211; Fields of parallel vectors in the large, Compositio Math- 
ematica, vol. 3 (1936), pp. 453-468. 








IMBEDDING OF RIEMANN SPACES IN THE LARGE 319 


into the normal space J, by Y%,...2,- Since the vector fields belonging to each 
normal space are normal to those belonging to every other normal space, we 
have 


(3.2) i eS (h # k). 


Since there are at most p independent normal vectors, it is clear that there 
are at most (m — 1) S pnormal spaces. It is therefore of significance to speak 
of the “last” normal space, J,,. The index m will have this significance through- 
out the paper. 

Let us now assume that the functions ¢'(x) occurring in (2.3) are of class C”"', 
that is, they have continuous derivatives of the (m + 1)-th order. The co- 
efficients of the first fundamental form g.s3 given by (2.4) are therefore of class 
C™. We shall also denote them by E.s(= gas). The coefficients of the A-th 
fundamental form are defined by 
(3.3) | Sr) ey = | YA, ---Bs 
for h = 1,---,m. From their definitions it is clear that Ea, ... a4), ...3, 
are of class C"”"**. They are also tensors under transformations of the 2’s, 
for it follows from their law of transformation that vz. ...a, are tensors. Be- 
cause of (3.3) the matrix’ 


Ex = || Ba, .--0n 16; ---bs || 


is positive semi-definite and its rank is the same as that of the matrix 


¥, of] Fe,-all, 
where 7 denotes the rows and combinations of a --- a, denote the columns. 
In fact (ef. M, p. 274) the equations 
(3.4) Ba mite * = @ 
and 
(3.5) Y5,..-,0° °°" * = 0 


have the same solutions at any point P. 

A point P C R, will be called a regular point of the imbedding if there exists 
a neighborhood N(P) > P within which each Y, is of constant rank. This 
implies that each EF, is of constant rank in N(P), and conversely. A _ point 
which is not regular will be called singular. Consider now the matrix 





_ —_ | 
|| | 
| E.siy 
sul Cee |. 
| > i] 
| r | 
| | @, +++ am |B, -+-Bm 
’ The rows are given by combinations of the indices a «++ a, , and the columns are 


given by combinations df 6; --- Ba. 








320 CARL B. ALLENDOERFER 


It is clear that Z is of constant rank in N(P). Suppose, conversely, that in a 
neighborhood M(P) the matrix EZ is of constant rank. Now there exists a 
neighborhood M(P) C M(P) within which each E, has a rank not lower than 
its rank at P. But since the rank of Z is the sum of the ranks of FE, , and since 
the rank of E is constant in M(P), it follows that each EZ, is of constant rank 
in M(P), i.c., P is a regular point. Hence we may make the equivalent 

Derinition. A point P C R, is regular if there exists a neighborhood N(P) 
within which the matrix E is of constant rank. Other points are called singular 
points. 

It will become evident in the next section that the differential equations we 
consider leave the rank of Z constant in any domain in which they can be inte- 
grated. Hence we see that our singular points are also singularities of the | 
system of differential equations. We shall show them to be the only such 
points. 

It is evident at once that the set of singular points is nowhere dense in R, 
and is a variety of dimension lower than n. Hence there will exist a neighbor- 
hood of every regular point which contains only regular points, and in fact this 
neighborhood can be taken to be simply connected. We shall call such a domain 
the neighborhood LU, and shall limit our discussion to regions of this nature. 





4. The Frenet equations. It can be proved (M, p. 278) that at any point 
P CU there exist [’s which satisfy the Frenet equations 





i) ‘ 
oe tal) ew BE 
—— (y’) 1 
ts] ri i ri i 
ax? (Y.,) _ Toe Ys, i Vases) 
oi igen 
CG) ri **Ba-1 31° i 
axarti (} @\- a) _ . “H+ Y5,-. *Ba-1 + May: ies Ys, **Bh + Vay---antiy 
a ri ) $i°-"Bm-1 yi 3 ri 
arent (Y ay'an)? = Pay:s'am+i **Bm-1 + re; - Yp,-+-Bn° 
From (3.2), (3.3) and (4.1) it follows that for every point we have 
(4.2) Es, *Balyi-- me ee “eae = [ Sure, (Yo wll Teme, 
and 
P 8)-**Ba—1 rs] ‘ vi 
(4.3) Bg, twalve->-tang Bae---aar & | tal Paitin 


On account of (3.2) we may write (4.3) in the form 


, By***Ba-1 ri yi 
(4.4) E3,. Sh-alea-+-rana Pay---aar - —Voy-an a5 (Var 1a-a: 

















IMBEDDING OF RIEMANN SPACES IN THE LARGE 321 


Since by hypothesis the right hand sides of (4.2) and of (4.4) considered as 
functions of the z’s are of class C”™ and C”*** respectively, we have the fact 
that the left sides have this same property. 

Since we have assumed that each matrix EF, is of constant rank r, in U, to 
every point P C U there corresponds at least one non-vanishing r,-th order 
minor, D, , of each matrix E,. Because of the continuity of the Z’s there exists 
a neighborhood V(P) > P which is contained in U and within which no D, 
vanishes. 

Now we know that the I’s actually exist, and so it is possible to solve (4.2) 
and (4.4) for the I’’s as functions of the z’s within each V(P). The solutions 
in general will be determined to within additive functions 6°'’**** and @°'"" °*~", 
respectively, which are solutions of 


(4.5) E'4,.--Bylvi---vn gf--- Pe = @, Es, +o familys -**Va—1 gr" Pa-a = 0. 


The 6’s may be chosen so that I'2\%,, and Pi" are of class C"™ and 
c”"*? respectively in V(P). 
If 2"... are solutions of (4.2) in V(P) and if I’2'"""%,, are solutions in 


another neighborhood V jst it is clear that in V(P) N V(P’) 
(re: + — re. valle. ‘Balyi---vR = 0. 


A similar result holds for I'’%',""%~!. Mayer has proved (M, p. x78) that for 
transformations of the 2’s, yA are tensors, and that ee rear have a 
transformation law like that of the Christoffel symbols. 
If we form the integrability conditions of (4.1) in any V(P), we obtain 
*Oh—-2 


pr" *Ph— o1**on—2 Pit *Ph— Per 2 é : 
(4.6) TS. +--@ar Faves he Pas---aae Tne Ver---en-s _ 0; 


Ph-12@ 


O,°**Oh—] 171° —— 1 
(Fa “@at _ Qi" ahe + re *Ph-1 prt: eat = | eae ans. 
“ear > pE°** @y "ano * pi **Ph—1 





Ox® Ox 
(4.7) 
+ eye eae Pope sspne) — Tay-- “ene Daa en ee Me 
yp ae RE oe . ae 
( = wise = = + Pe > Dy ° io wee FS on aan onthe. 

(4.8) 

+ Tay---aar  — Te,- aon 6" + re “aare = Eine] — = 0, 
or 


(EG ase > FS hee ~ Bel ane @ & 
where B’s are thus defined. When h = m we have 
(4.9) | lt. I, a a 
Finally we have 


620 Cs Em Tet FS cate — Fe --ceed y ey O 








322 CARL B. ALLENDOERFER 


In these equations the index h has the range 1, --- , m, it being understood 
that no terms involving zero or negative indices are written. The entire set 
may be written in a more convenient notation by writing s, for the set of h 
symmetric indices o;---o,. A single latin index without e subscript stands 
for a single greek index. Thus, for example, we write T@,..%,, = Tuh,. Here- 
after we shall pass from one notation to the other without comment. If (4.6) 
to (4.10) be multiplied by appropriate Y’s and the summations performed (for 
example multiply (4.6) by Y%,...s,.,), the set can be written in the shorter 
notation as follows: 








(4.11) Wars’ Nance * Kee Pencctd net, @ @ 
Pa , Poy rho 8h— r —8h-1 
? 2" i er! + Daye i oe Taw’ I ranit 
(4.12) 
+ Tone ra sit Tove Pa" Be = 0; 
(4.13) (Bariw + Tae — Tarwe) Bs, jo, = 0 (h _ l, ies m — 1); 
(4.14) } oe E...\bm = 0; 


(4.15) (Tagrie Se — Pegrie St + Perite — Peprios) Ba-iriy, = 0. 
If we differentiate (3.2) and (3.3), substitute from (4.1), and again make use of 
(3.2) and (3.3), we have forh = 1, ---,m 


(4.16) Bag—ytion + Tete! Bay—sen-1 = 0; 
- fe) 
(4.17) art (Ea, i0,) — rm, Ea, iv, = rs, Ea, eg = 0. 


The entire set (4.11) to (4.17) hold in every V(P). 

TueoreM I. If y' = ¢'(x) are functions of class C"™ which define a Riemann 
space imbedded in a given E,,,, throughout a neighborhood U’, then 

1. The tensors Eg,...a,\8;---8, (kh = 1,---,m) are defined as functions of 
class C"*** in U; the matrix E, is positive definite in U, and the matrices E; (h = 
2, ---,m) are positive semi-definite of constant rank r, in U and rz + +++ + Tm 
= p. 

2. The quantities rar" 7*~1(2) and Tet’. Sear(2) are defined to within a solution 
of (4.5) in each V(P) € U and are of class C”~**' and C”™, respectively. 

3. The E’s, the Y’s, and their derivatives satisfy (4.11) to (4.17) in each neighbor- 


hood V(P). 


+1 


5. Some identities and formal relations. We interrupt the geometrical argu- 
ment at this point to consider some formal relations which we shall need later. 


The expressions 


Bairitw = —_ By tw Ea, ia (h == l, eee m) 














IMBEDDING OF RIEMANN SPACES IN THE LARGE 323 


are called the components of the h-th curvature tensor, the first curvature tensor 
being in fact the ordinary Riemann tensor. From (4.13) and (4.16) it follows 
that 


(5.1) Baging|tw = Lapw|pyt —~ Lagt|pyw (k = 1, 18s a 1). 


These equations are generalizations of the Gauss equations of the classical 
theory, so we shall continue to call them the Gauss equations. Since the right 
sides of (5.1) are of class C"™, it follows that Ba,jp,\:w are of class C"™™*. In 
fact, we see that the curvature tensors are defined as functions of this class 
throughout U in spite of the fact that the I’s from which they are formed are 
defined only within each V(P) C U. It is also clear that the arbitrariness 
which occurred in the definition of the I'’s has disappeared from the B’s. Be- 
cause of (5.1) there exists a set of relations in the B’s which holds for all z C U 
and which is a generalization of that which holds for the Riemann tensor. By 
examining the definition of the B’s, it can be shown that these relations are 
actually identities in the z’s. They are the following: 


(5.2) Bag pg lew + Bayi pglwe = 0; 

(5.3) Baxi pyitw + Boy ia; \tw _ 0; 

(5.4) Baxi m-yvitw a Baxipy-ywlvt + Bagipn-stivv _ 0; 

(5.5) P(Ba, ...0ai6,--taira) = 0, 

where P denotes the sum of terms formed by the cyclic interchange of (a; --- ax 
8: --- Ber). We indicate by a bar over an index that it is not to be included 
in the permutation. The final identity is 

(5.6) S(Ba,.--a4)8,---Bglro) = 0, 


where S indicates the sum of k + 1 terms, of which the first is the term written 
above, and the q-th of which is obtained from the (q¢q — 1)-th by interchanging 
@-1 With the last index of the (¢q — 1)-th term and 8,_, with the next to last 
index of the same term, and then interchanging a,_; and 8,-; in the result. In 
case k = 1 this becomes the ordinary identity 


Bajsiro + Bajrtas = 0. 
For k = 2 we have 
Bayas) 3; Bq|rw + Bessie felons By + Bap, ira; 10 62 _ 0. 


In order to generalize the Bianchi identity we define a generalized covariant 
derivative which will be denoted by a semi-colon and which is a special case of 
those considered by A. W. Tucker.‘ The rule to be used is shown by the follow- 


4A. W. Tucker, On generalized covariant differentiation, Annals of Math., vol. 32 (1931), 
pp. 451-460. 








324 CARL B. ALLENDOERFER 


ing example, where t; and 7, are compound indices of the same or different orders: 
tf 
5 t aT, ss 
(5.7) i ~ a Tr roe rat yt A Mae 


The ordinary rules of covariant differentiation apply in this case, and for exam- 
ple we may write (4.17) in the form 

Bagi by;t = 0. 
When A = 1 and f = 1, we have ordinary covariant differentiation with respect 
to T'3,; this will be denoted by a comma. The generalized identity may be proved 
by direct calculation from (4.11)-(4.17) and has the form 


(5.8) Bayi ppitw;e + Bayi nsivtze + Bayi pa iwvt = 0. 


If (5.6) is differentiated covariantly and each term in the result is replaced 
by the corresponding two terms given by (5.8), we have 


(5.9) T(Ba,..-a4| 8; ---B,l700) = 9, 
where J denotes the sum of terms obtained by the cyclic interchange of 
(ay -++ an OB; «++ Brw). 
By differentiating (5.5) we have 
P(Ba,.--a,)8;-+- Biro) = 0, 
the cyclic interchange omitting w and \. Adding to this the identity 
P( Bay ---an| By Baliria) = 0, 
and using (5.8), we have 
(5.10) P(Bay.--ay| 8; ---Bylaiie) = 0. 


This may be thought of as another generalization of Bianchi’s identity. 

It is also of importance to show that (4.12) is a consequence of the other 
equations of the set (4.11)-(4.17). Differentiating (4.16) and eliminating 
derivatives of the E’s by (4.17), we have 


rh-1 
OP o,-iwe Th-1 Sh-1 


al sh- 
Ey \es-1 _—<_« —Piprie (Poste Bays lea-1 +> Pay-ie EB. 1rn-1) 


18h 7 Sh 
=] an—ytv E..in-10 = Ta;-,00 EBayion-yt . 


Substituting this into (4.12), we obtain an expression which vanishes identically 
because of (4.15) and (4.16). Thus (4.12) is a consequence of (4.15), (4.16), 
and (4.17). 

A final relation is obtained by substituting in (4.11) from (4.16). The result is 


Th-1 Th-1 
Pye Ey, 1a,-20 — Poi Ey y\an-2t = 0. 











IMBEDDING OF RIEMANN SPACES IN THE LARGE 325 


Because of (4.16) this becomes 
B psin—100 _ B nica-ste = 0, 
which vanishes identically by the very definition of the E’s. Hence (4.11) 
is a consequence of (4.16). These last two relations are given by Mayer (M, 
p. 293) but his proof is from a somewhat different point of view. 
Corotuary. If the matrix E), is of rank one, the h-th curvature tensor vanishes 
identically. 
For since E, is of rank one at any point P C U’, we may write 
Eayib, = Yay Ary - 
Then (5.3) may be written 
(5.11) Pate Asp Na, + Bohiw As, Apa = 0. 
If we put a, = p, , the result is 
Bote Ln Xp» = 0. 
But at least one ,, (say \,,) is not zero. Then the corresponding Bohiw As, = 9, 


and consequently for any a, Bor.Es,)2, = 0. But from this result and (5.11) 
it follows that 


(5.12) oe hea de, = 0, thiw As, = 0. 


Consequently all B,,)a,j0 = O0in U. This result has been obtained in a different 
manner by Burstin.° 


6. Determination of the I’’s in terms of the Z’s. We now turn our attention 
to equations (4.11)-(4.17) and deduce from them a set of necessary conditions 
on the coefficients of the fundamental forms. We know that in each V(P) 
there exists a solution of (4.16), i.e., 


(6.1) Teese Eayie, _ — Baytion+15 
where these are considered as linear equations in the I’s. Hence we have the 
Matrix Conpition I. The augmented matrices Ex, = || EBayja,} Baxe\o,,, || 


are of rank r, in U for every choice of t and by. . 

Similar linear equations in T3*; may also be derived. Before proceeding to 
the general case, we illustrate the method by taking h = 1. From (4.17) it 
follows that 


OBais dE3\- OE ria 


ox? ox* ax? 





(6.2) 2T ar By\s = 


Since the determinant | F,;,| # 0, these may always be solved for T?,. Thus 
the 2, turn out to be the Christoffel symbols of the second kind. 


5C. Burstin, Beitrige zur mehrdimensionalen Differentialgeometrie, Monatshefte fiir 
Math. und Physik, vol. 36 (1929), p. 114. 











326 CARL B. ALLENDOERFER 


For h = 2 we have from (4.17) and (4.15): 


(6.3) QE op\y8 Vase = 2Eeoiya(T ae 53 + U3 5%) + Easizs.e + Esyise.a 
jt 

+ Eysjeas — Escias.y — Eraiay.s = Prsiases 
where @5s\a3% are thus defined. We know that these equations, considered as 
linear in the [’s, have a solution in every V(P). Thus we have the following 

Matrix Conpition Il. For every choice of (aBe) the augmented matrices 
Es = || Egos 3 y8\ade || are of rank re in every V(P). 

It will be noticed that these conditions are closely related to the Codazzi 
equations of the classical theory. 

In order to extend these results to a general h, we define the operator Q to 
indicate the sum of terms obtained by cyclic interchange of (a - «+ a,61 +--+ Bar), 
a plus sign being attached to a term if the last index in the preceding bracket is 
7 or any a, and a negative sign if it is any 8. Thus the general case of (6.2) 
and (6.3) is 


- viet ya—18 Yi'*Ya-1 34 
2E,,. Ya-18 d+ Suma © enunes = | 


58 ont YA=1 
+ Ter 8a; ---ea-1) EB, .--14=1810, ++ Bama + o( 


dE, es -a@n—1€18) ore “Bn—1d 
Ox" 





(6.4) 


: i'*"Hh—1 7 1° * “Bal 
Ea, “*@a—1e wr--wa—ad By -+-By—ar EB, .--sa-10101-+-a-rd Tay ---ea—it 


, we y 1p 
a } *@a—10 Be és enh Eon = | 3,---3) Th). 


In our briefer notation this may be written 


7 an +ca—jd Ch- d d Ch- 
(6.5) 2B ey-,4 br—ial oni pt = 2(Tay— ie 6 p+ "pt 6a, Bex—sdida—re + Q(Eay-svibn—rait)s 
or 


2E cx id, ro = Pdbplants 


where ¢,\a,¢ are so defined and we have written c, for c,..d, ete. Since we 
know that the [’s exist which satisfy these equations, we get ' 
Matrix Conpition II (General case). The augmented matrices Kk, = 
Bex \o,5 Gb, \aye || are of rank r, in every V(P) for each choice of (at). 

As we have stated them, the matrix conditions I involve E,,), and Ea,.,\5,., 
only. On the contrary, conditions II involve their first derivatives, and T'3}7},, 
and thus are not expressed as conditions on the fundamental forms. However, 
r.7} is given by (6.5) in terms of Z,,_,\»,-, and '4}-3,. By repeating this process 
we finally can eliminate all T’s and may regard the matrix conditions II as involving 
only the fundamental forms. 

In a similar manner it follows that the Gauss equations (4.13) and (4.14) 
can be expressed in terms of the coefficients of the fundamental forms and their 
derivatives only. It is conceivable that in eliminating the T’s by means of (6.5) 
the arbitrary 6’s might enter into the resulting expressions. Direct examination 
of the formulas involved, however, shows that this is not the case. We have 


thus proved 

















IMBEDDING OF RIEMANN SPACES IN THE LARGE 327 


TueoreM II. The matrix conditions I and II and the equations (4.13) and 
(4.14) are conditions on the coefficients of the fundamental forms which are necessar- 
ily satisfied if the forms define an imbedding of a Riemann space. 

We now turn to the converse problem and determine conditions which are 
sufficient for these forms to define such an imbedding. We assume at the 
outset that the E,,», satisfy (1) of Theorem I and that they also satisfy the 
conditions stated in Theorem II. We define a set of I'’s in each V(P) by the 
following recursive process. For h = 1, (6.2) define I's, uniquely as functions 
of class C"™’. These actually satisfy all equations in (4.11)-(4.17) for h = 1, 
for the only equations which occur in this case are (4.15) and (4.17), and these 
are indeed satisfied identically because of (6.2). 

For h = 2, we define It’s, by means of (6.3). This is possible since the right 
- hand sides contain only the given E4g),5 , their derivatives, and a previously 
determined set of T'3,. Since the matrix E2 is of rank rz, a solution of (6.3) exists 
and is determined to within an additive function 62, which is a solution of (4.5). 
These solutions are defined as functions of x in each V(P), and the additive 
functions can be so chosen that they are of class C”*. In a similar fashion 
I'sys are defined as functions of class Cc” in U by equations (6.1). It remains to 
investigate whether the I'’s so defined actually satisfy (4.11)-(4.17) when h = 2. 
If we substitute for T%%, in (4.15) and then use (4.13), we obtain 


Boa be. + Bysse.a + Bay ge8 = 0. 


But this is Bianchi’s identity, and so our [’s satisfy (4.15). Similarly the result 
of substituting for I's}; in (4.17) vanishes because of (4.13) and the identity (5.9). 
Since (4.16) are satisfied by the very definition of T's,;, and since (4.11) and 
(4.12) are consequences of the other equations, it follows that if E£.;3 and Ea,jys 
satisfy the Gauss equations and if E is of rank re, it is possible to find I'g,; and 
V's. Which satisfy (4.11)-(4.17) for h = 2. 

For a general value of h we solve (6.1) and (6.5) for T'}}7' and Ty, in terms of 
the coefficients of the fundamental forms and previously determined T’s. And 
as for the case h = 2, it can be shown that these new I’s satisfy (4.11)-(4.17) 
as a consequence of the identities of §5. Hence we have 

THEOREM II]. Let there be given a set of tensors Ea, ...a,\8,---8, (R = 1, +--+ , m) 
which are defined as functions of class C™"*' in U, where E, is positive definite in U, 
and E, (h = 2,---,m) is positive semi-definite of constant rank r, in U and 
ro +--+ + rn = p. Then quantities 3)... and V5)...9,2°"' can be found 
which will satisfy the equations (4.11)-(4.17) in every V(P) C U if and only 
if the Gauss equations are satisfied in U, the m-th curvature tensor vanishes in 
U, and the matrices E, and Ej, are each of rank r,in U. The T§)..3%% so defined 
are of class C”™ and the §..{4~" are of class C”~"** respectively. 


7. Solution of the Frenet equations in a restricted neighborhood. We now 
suppose that a set of E’s are given which satisfy the conditions of the preceding 
theorem. In a definite V(P) we define the I’s by the method just described. 








328 CARL B. ALLENDOERFER 


Consider further a spherical neighborhood W(P) covered by a single coérdinate 
system and such that V(P) > W(P) > P, i.e., one such that its map in the 
arithmetic space is the interior of a sphere. We now seek to integrate the mixed 
system composed of the Frenet equations (4.1) and the algebraic equations (3.2) 
and (3.3) in the neighborhood W(P). 

For convenience let us denote (4.1) by 

does — — t=1,---,n+p; 

(7.1) aos WO) ¥ a5 *** 5 SS = One Me h=1,---,m; 
Ke = Qy-y yy ys, 


Let the codrdinates of P be xo, and assign the values (y")o ; (Yi,)o to the unknowns 
at P such that (3.2) and (3.3) are satisfied when zr = x. It is clear that this 
choice of initial values is always possible, for the matrix E is by hypothesis a 
symmetric, positive semi-definite matrix of rank n + p. Therefore a canonical 
representation of the form (3.2) and (3.3) is possible at P.° 

Moreover, any two possible sets of initial values (¥,)o are. connected by the 
relation 
(7.2) (Yao = aj(Ya)o, 
where the a’s are an orthogonal matrix of constants. The values of (y')o are 
completely arbitrary. 

Picking out one such set of initial values we now seek to find the corresponding 
values of the unknowns at another point Q of coérdinates xz, C W(P). To do 
this we join P and Q with an arbitrary curve C, lying in W(P), defined by 
x“ = f*(t), where f*(0) = af; f*(1) = af and f*(é) are single valued and of class 
C'for0 < ¢t <1. There always exists such a curve, for in particular we may 
take f*(t) = zi + (af — 2¢)t. Now integrate 


- ly' ~~ dx" dy: ; dx’ 
(7.3) - = Y peers He = Pearle) Yi S 





along C for the above set of initial values. The quantities y‘(t), Y:,(é) thus 
defined satisfy (3.2) and (3.3) for ¢ = 0 by hypothesis, and indeed they satisfy 
(3.2) and (3.3) all along C. For let 


(7.4) [ay coe ey Bi eee Bi] = a ao ) ny eee 


(7.5) [oy -++ an Br-++ Ba) = Yi,...0, Yoyo (h # k). 
Then (3.2) and (3.3) may be written 
(7.6) [a ++: an fy +++ Bi] =O (h,l = 1, +--+, m). 


Differentiating (7.4) and (7.5) along C and then applying (7.4) and (7.5) to 
the result, we find that the derivatives of the bracket expressions are linear 
combinations of the bracket expressions themselves, the coefficients involving 


® Cf. Duschek-Mayer, Lehrbuch der Differentialgeometrie, II, p. 25. 











IMBEDDING OF RIEMANN SPACES IN THE LARGE 329 


the I’s and dx*/dt, plus terms which vanish because of (4.16) and (4.17). But 
this shows at once that (7.6) is satisfied all along C. 

TuroreM IV. The solutions Y3,(t) of (7.3) whose initial values satisfy (3.2) 
and (3.3) for x = 2x» satisfy (3.2) and (3.3) for all points of C. 

We have assumed that the conditions (4.11)-(4.17) are satisfied in W(P). 
Moreover, we know from §3 that if (4.11)-(4.15) hold, then the integrability 
conditions (4.6)-(4.10) are satisfied provided that (3.2) and (3.3) are true. 
Hence we have 

TuHEeorEeM V. The integrability conditions (4.6)-(4.10) of the Frenet equations 
are satisfied along the curve C by the solutions of (7.3), whose initial values satisfy 
(3.2) and (3.3) for x = x». 

In order to prove that the values of the y‘ and Yi, thus defined at Q are genuine 
functions of the 2’s, it is necessary to show that these values are independent 
of the curve C along which we have integrated. Join P and Q with some other 
curve C; of class C’ lying in W(P) whose equation is x* = ff (t), where ff have 
the same properties as f*.". Since W(P) is a spherical neighborhood, C and C, 
can be imbedded in a one-parameter family of curves z* = G*(t, p), joining 
P and Q and lying in W(P). The G*(t, p) are assumed to possess the following 
continuous derivatives 


aG. aG eG  aG 
at’ ap’ atap  apat 
for0 St S landO S p S 1 and to be such that G*(¢, 0) = f*(®) and G*(é, 1) = 
ST (H); G°(, p) = x ; and G*(1, p) = zf. In particular, we may put G*(t, p) = 
LO + aAf® — fl. 
Consider the equations 


. 4 aG” ee aG” 
(7.7) — Yi(G(p, t)] ‘at’ ? dt 


= any [G(p, t)) Yi Ot 
along any curve of the family, i.e., for a particular value of p. If we take for the 
values of the unknown at P (corresponding to t = 0) the quantities which 
served as initial values for our integration of (7.3), the equations (7.7) define 
a set of functions y'(t, p), Yi,(¢, p) for all values of ¢ and p in the specified ranges. 
Because of Theorem IV they also satisfy 


(7.8) EaywlG(t, p)] = Yal@t, YG, p)I.- 


For t = 1 we have the values of y‘(1, p) and Yi,(1, p) at Q as defined by the 
curve of the family of parameter p. We now show that these values are actually 


independent of p. 


7 The following treatment is based on that given by T. Y. Thomas, Systems of total 
differential equations defined over simply connected domains, Annals of Math., vol. 35 (1934), 
pp. 730-734. Thomas’ treatment, however, applies only to completely integrable systems 
which do not involve additional algebraic equations. 











330 CARL B. ALLENDOERFER 


Consider the equations 


" oy’ < aG* ay: , ae a 
(7.9) — « ¥iio) =: % =» (g) YG) — 
' at a ay = Pur GEG) 

{ , a" , 

i os Fi — + 

| Op ap 
(7.10) el 

aYs aG'(t,p) 

= 7s o 


k ri 
—a 32 iy G) Yi (@) - ah 
| “ap Pony (G) Yi ( apt om 
where o) and o}, are defined by the last equations. If we differentiate (7.9) 
with respect to p, and (7.10) with respect to ¢, and equate the right hand sides 
of the resulting equations, we obtain 


doo OG gk OG AE 

a & — + Fi — —; 

fed Rl 
(7.11) , 
Oo, i ke OG aG” aG” 


= Gitar a + PanVi seg, 


at 
where 5,3 and 5, ,; are the bracketed expressions on the left sides of (4.6)-(4.10). 
From Theorem V we see that 
$5,,Yi = 0, $%,,3Y; = 0 
for all allowable ¢ and p. 

Now for t = 0, we have y’, Y,, and G*(0, p) by hypothesis independent of p; 
hence from (7.10) it follows that ¢) = 0 and o,, = O fort = 0. And so from 
(7.11) we have that o) = 0 and o,, = 0 for all allowable values of t and p, and 
in particular fort = 1. We have also by hypothesis that G(1, p) are independent 
of p, and hence from (7.10) it follows that the solutions of (7.7) at ¢ = 1 are 
independent of p. 

We therefore define y'(x), Yi,(x) as functions of x in W(P) whose values at any 
point Q < W(P) are obtained from a given set of initial values satisfying (3.2) and 
(3.3) at P by integrating along any curve of class C' joining P and Q and lying in 
W(P). These functions will satisfy (3.2) and (3.3) in W(P). 

The proof that these functions are unique and that they are actually solutions 
of the differential equations is given by Thomas.’ It is also shown there that if 
Q C W(P) and if initial values are chosen satisfying (3.2) and (3.3) at Q, there 
exists a unique solution of the mixed system in W(P) having the given initial 
values at Q. To find this solution, it is only necessary to integrate along a curve 
from Q to P, thus determining values of the functions at P. With these as 
initial values the system can be integrated in W(P), and it is shown that the 
resulting solution has the required initial values at Q. We shall need this 
property in the next section. ; 

TueoreM VI. Let (1) there be given a set of functions Eag,...a,)8,--.3, (k = 
1, «= , m) of class C”™ **' in a spherical neighborhood W(P) such that E, is positive 
definite in W(P); E,, is positive semi-definite of constant rank r;, (h = 2, «++ , m) 














IMBEDDING OF RIEMANN SPACES IN THE LARGE 331 


in W(P) and rz + --- +17» = p; (2) the matrices E;, and E, be of rank r, in W(P); 
(3) the Ea, ...a,\3,---3, and their derivatives satisfy the Gauss equations in W(P); (4) 
the m-th curvature tensor vanish in W(P). 

Then and only then do the Frenet equations (4.1) and their associated algebraic 
equations (3.2) and (3.3) have a unique solution y‘(x), Ys, ---a,() for all x C 
W(P) for each set of initial values (y')o and (Y%,,...«,)o which satisfy (3.2) and 
(3.3) at any point Q C W(P). The Riemann space defined by E.\3 in W(P) is 
thus imbedded in a Euclidean E,,, . . 


8. Solution of the Frenet equations in an open simply connected domain. 
Next we consider an open simply connected domain LU’ of a coérdinate manifold. 
Suppose that functions E,,», (A = 1, --- ,m) satisfying the hypothesis of the 
last theorem are defined throughout U. Then each point P C U is contained 
in at least one spherical neighborhood W(P) C U within which the I’s may be 
defined as in §6. If two such neighborhoods W(P) and W(P’) have points in 
common, from §4 we see that the (T)p defined in W(P) and the (T)» defined in 
W(P’) satisfy the equations 


Ea, l(Te.,)e — (Te,,)e-] = 0, 
Run.city Tae = (Pees!)p-] = 0 


in W(P) N W(P’). We shall now show that the conclusions of the theorem of 
§$7 can be extended to the entire neighborhood U’. 

Pick out any point P C U of coérdinates 2» and assign initial values (y')> , 
(Yi,)o to the unknowns at P which satisfy (3.2) and (3.3) at P. In order to 
determine the corresponding value at another point Q C U, join P and Q with a 
simple curve C of class C’. This is possible, since U is connected. Because 
of the Heine-Borel Theorem, C can be covered by a finite set of the spherical 
neighborhoods W,,---,W,;, where W, = W(P) and W; = W(Q). Let 
Vi, --:, Vy be points on C such that V, CW, N Wyus G = 1,---,f — 1) 
and V; = Q. Integrating along C from P to V,, we obtain values (y'); and 
(Yi,): at V; corresponding to the given initial values. Then taking these values 
at V, as initial values integrate in W2 from V, to V2 and continue this process 
until finally by integration from V;_, to Q values (y')g and (Yi,)¢ are defined 
at Q. 

Suppose next that P and Q are joined by another simple curve of class C’, 
say C,, which passes through the same set of neighborhoods W,, --- , W; in their 
natural order. Select points X,, ---, X; on C; such that X, CW, N Wy. 
(g = 1,---,f — 1) and X; = Q. Now by the above process define values 
(y')@ and (Yi,Je at Q corresponding to the curve C; for the given initial values 
at P. 

Let V, and X, (g = 1, --- ,f — 1) be joined by some curve of class C; , say 
S,, lying in W, N W,.:. Integrating (7.3) along S, from V, to X, for the 
values of the I’s, ((),, defined in W,, and for initial values satisfying (3.2) 


(8.1) 








332 CARL B. ALLENDOERFER 


and (3.3), we obtain a set of y'(#) and Y;,(t) for each point of S,. Now (7.3) 
are actually equations of the form 


dy’ _ yi dx* 
— ee 
(8.2) , 
@ pyri ° ri 1bp vi rar 2 
dt (Fo = [(ras'),} ba—1 + (I ae ba + om “dt . 


Now substitute the y‘(t) and Y,() obtained by the above integration along S, 
into the equations (8.2) in which the (IT), are replaced by the (T),.: defined in 
W,.: , and subtract the resulting equations from (8.2). We obtain 


(8.3) [Peay — (een Vi, S + [eee — (Peden Yi, F = 0. 
Because of (8.1) and the fact that (3.2) and (3.3) are satisfied along S,, we see 
that (8.3) are satisfied identically in t. Hence integration along any S, gives 
the same result, whether S, is regarded as a curve of W, or as a curve of Wo4:. 

Next we observe that the values of the unknowns at X, are the same whether 
the integration has proceeded from P to X, along C, or from P to V, along C 
and then along S, from V, to X;, ; for all these curves lie in W(P). We indicate 
this symbolically by 


(8.4) PX, = PV,X;, ° 


Now suppose, in order to establish an induction, that the values at X,_; are the 
same whether we have proceeded along C, to X,_; or along C to V,_; and then 
along S,: from V,_, to X,.,. That is, we assume that 


(8.5) PX,-1 = PV,-1 > on 


Then the values at X, as found by proceeding along C; from P to X, are the same 
as those obtained from considering the curve C from P to V, and then the curve S, 
from V,to X,. That is, 


(8.6) PX, = PV,X,. 


For to values at X,_; given by integration along C; from P to X,_,, the cor- 
responding values at X, are the same for the integration along X,_,X, as for the 
integration along X,-:V,:V,X, , since all curves involved lie in W, ; i.e., 


(8.7) XpaX, = XpiVisiV,X,. 


Combining (8.7) and (8.5), we establish (8.6). Since the hypothesis of the 
induction holds for g = 1 because of (8.4), the relation (8.6) holds for any g, 
and in particular for g = f, i.e., at Q. Therefore the values (y')e and (Y3,)e 
determined by the curve C covered by W,, --- , W; are the same as those determined 
by any other simple curve of class C’ lying in W,, --- , W; and passing through 
them in their natural order. 














IMBEDDING OF RIEMANN SPACES IN THE LARGE 333 


Finally, consider any simple curve C; of class C’, lying in U, joining P and Q, 
and distinct from C. As above C2; may be covered with a finite number of 
neighborhoods and gives rise to a set of values (y')e and (Yi,)e corresponding to 
the given set of values at P. Suppose that C and C, are imbedded in a one- 
parameter family of curves joining P to Q and such that for the parameter value 
p = 0 we have the curve C and for p = 1 we have C;. Assume also that the co- 
érdinates x“ on curves of the family are continuous functions of p for 0 S p < 1. 
Hence if some member of the family, say Cj, of parameter p is covered by a 
finite set of neighborhoods, Y,,--- , Y;, there exists a 6 > 0 such that for 
|p — p| < 6 all curves of parameter p lie in Y,, --- , Y; and pass through them 
in their natural order. Because of this property and the last italicized state- 
ment, the [y‘(p)]e and [Yi,(p)]e are differentiable with respect to p for 0 S p < 1, 
and in fact the derivatives are zero for each value of p. Hence the unknowns 
are uniquely determined at Q for given initial values satisfying (3.2) and (3.3) 
at P by integrating along any simple curve of class C’ joining P to Q and lying 
in U. 

TueoreM VII. Theorem VI remains true if the spherical neighborhood is re- 
placed by an open simply connected domain. 

From (7.2) we have the fact that any two sets of initial values of Y, which 
satisfy (3.2) and (3.3) must have the relation 








(8.8) (Ya)o = aj (Ya)o, 
where || a} || is orthogonal. Hence the corresponding y’s satisfy 
(8.9) y' = ayy” + b*, 


where b’ are constants determined by the given initial values of the y* and y"’. 
Therefore, the above imbedding is determined to within a motion in the Euclidean 


space. 


PRINCETON UNIVERSITY. 











REMARK ON THE THEOREM OF GREEN 
By 8S. BocHNER 


The theorem of Green is an identity between an integral over a compact 
region R of an orientable locally Euclidean separable space S of class two' whose 
dimension will be denoted by n, and an integral over the boundary B of R, in 
case B is formed by “hypersurfaces”’: 


(1) | aiv Adv = [ede 
Rk B 


If R is closed, that is, compact and equal to S, thus having “no boundary”, 
the integral over the boundary is to be put equal to zero: 


(2) | div Adv = 0. 


If R is contained in one coérdinate neighborhood, the proof of (1) is compara- 
tively simple, provided the boundary B is sufficiently smooth with respect to 
the codrdinate system.” But the passage from the local case to a domain R 
in the large is rather laborious. It requires a cellular subdivision of R into 
sufficiently small subregions whose boundary is sufficiently smooth, an applics- 
tion of the local theorem to each subregion, and finally a justification of the 
mutual cancellation of the boundary terms arising from the artificial cellular 
partitions. Now this procedure is much too complicated and heavy in the 
case of formula (2) or in case of formula 


(3) [ div Adv = 0 (x' = 0 on B). 
JR 
We want to show that these two formulas can be deduced in a much simpler 
fashion, even avoiding any complication that might be inherent to the local 
theorem itself. In particular we shall eliminate from the proof the concept of 
[(n — 1)-dimensional] volume on the boundary B. 

Our space S being of class two, we can consider on it tensors of class one 
and tensor densities of class one. In particular, we assume the existence of a 
positive (non-vanishing) scalar density which, as in the special case of Riemann 
spaces, will be denoted by +/g; in going over from codrdinates (x1, --- , 2x) 
to codrdinates (y;, --+ , yn), the quantity +~/g is to be multiplied by the jacobian 


Received December 19, 1936. 

' That is, a space allowing local codrdinate transformations with continuous first and 
second partial derivatives and a positive (non-vanishing) Jacobian. 

2? A. Duschek and W. Mayer, Differentialgeometrie, 1930, vol. IT, p. 237 


334 

















REMARK ON THEOREM OF GREEN 335 


| ax/ay|.° With such a scalar density we can form an invariant volume ele- 
ment 
(4) dv = dv, = Vg dx +--+ dan. 
It gives rise to a regular Lebesgue measure and a Lebesgue integral as in the 
Euclidean case ~/g = 1; the only material difference being that in the case of 
every closed space S the total measure of S is finite. 

If \‘ is a contravariant vector, the quantity 
(5) div) = 2. Xv9%) 

Vg OR; 

is an absolute scalar. This follows easily from the fact that the (n — 1)-dimen- 
sional minors A;; (with appropriate signs) of the matrix 








|| x; || 
(6) | — | 
| OY; | 
satisfy the relation 
(7) oo 0 (i=1,---,n). 
dy; 


Hence we can form the integral of div \ over any bounded measurable set A. 
If A is contained in a coérdinate neighborhood, then 


(8) [ aw Adv = [ a) dx, -++ dz,. 
A ‘ 7 


A 


Suppose now that A is a rectangle a, < 2, < b,, v = 1,---, , and that d* 
vanishes on the boundary of A. In this case, for each 7, 


[me = 0, 


Oz; 


and therefore 
A 


Also, the integral of div \ is 0 over any open set on which \‘ vanishes. Hence 
we obtain the first result that (2) is true if \* vanishes outside some “rectangle” 
A. If S is closed, and hence bicompact, we can cover it by a finite number of 
neighborhoods U,,---,U,, whose closures are contained in “rectangles” 
A,,+-:,Am respectively. Corresponding to each uw, wu = 1, --- ,m, we can 
easily find a neighborhood V, between U, and A, and a non-negative scalar 
function ¢, of class one in A, which is 2 1 in U, and 0 outside V,. Completing 
¢, by values 0 outside A, we have, throughout S, ¢: + --- +¢n, 21. Hence the 
functions of class one 


30. Veblen, Invariants of Quadratic Differential Forms, 1927, pp. 19-25. 








336 S. BOCHNER 


(10) w= ——* 


have the following properties: each vanishes outside some rectangle and their 
sum is 1. For fixed « we multiply the vector \’ by the scalar y,. The resulting 
vector X{,) vanishes outside a rectangle, and hence satisfies (2). But div \ = 


, > div \y) , and this completes the proof of (2). 
p=l ‘ 
Formula (3) can be proved in a similar way provided X’ is strictly regular up 
to the boundary, this expression meaning that, if completed by values 0 outside 
R, it is a vector of class one throughout S, so that (3) may be written 


(11) [ div Adv = 0 (x' = 0 outside R + B). 


In fact, we can construct non-negative functions of class one ¢, --: , gm, 
each vanishing outside some rectangle and whose sum is 2 1 for all points of 
R + B. The functions (10) will be of class one in some neighborhood U of 
R + B, although perhaps not throughout S. But \* vanishes outside R + B, 
and therefore the vectors \j,) are again of class one, and we can again prove (3) 
from 


(12) [ div \dv = D> | div dA, ar. 
8 w= 8 

We can make further statements if S is a Riemann space with a positive- 
definite fundamental tensor g;; having the following property. Corresponding 
to any point x° of S there exists a neighborhood U(zx°) such that for any points x, y 
of U(x") the function Q(x, y), which is the square of the geodesic distance between 
x and y, has continuous first partial derivatives in (41, +++, nj Yrs *** » Yn) 
and that, for some constant C = C(x»), 
(13) \| grad, @ ||? = g'(z) i i < C2(z, y). 

The neighborhood U(X°) entering our condition can be assumed to be a 
sphere with center at 2°: Q(x, x) < p,p > 0. If 2° transverses a set A whose 
closure is compact, then by the Heine-Borel theorem, this radius p and the 
constant C(z°) may be chosen uniformly with respect to A: p = p(A), C = 
C(A). If 0 <r < ry = 4 (A), then A, will denote the open set consisting of 
all points x for which Q(x, z°) < r for some 2? C A. Let g(t) be any non- 
negative function of the real variable ¢ which has a continuous first derivative 
in 0 < t S 3, is positive forO < t S f}andOforl St <3. If 3r < 7m, the 
expression 


(14) P,(z) = [ o( M2) ae 4 [ oY) ae, 

















REMARK ON THEOREM OF GREEN 337 


defines a non-negative function of class one in A3, which has the value 1 on A 
and the value 0 outside A2,. Completing P,(x) by the values 0 outside As, , 
we obtain a non-negative function of class one throughout S which is < 1 
everywhere, has the value 1 on A and the value 0 outside A2,. Owing to the 
nature of our function g(t), the denominator is bounded from below by a con- 
stant multiple of r" uniformly for all points z in A, , and both the numerator 
and denominator are bounded from above by a constant multiple of r”, for the 
given function g(t), and, likewise, if this function is replaced by ¢’(#). Forming 
now the partial derivatives of P,(x) and using (13), we find that 


XQ 


(15) || grad, P(x) || < =f uniformly for z C A,. 

We are now in a position to drop completely in formula (3) the requirement 
that \‘ be strictly regular up to the boundary and to interpret the boundary 
condition “A‘ = 0 on B” as a very light condition requiring that \‘(x) have limit 
values 0 as x approaches the boundary B from within the region A. 

Turorem. If R is an open region whose closure R + B is compact and \* 
is a contravariant vector of class one in R, then 


(16) [ div \dv = 0 
holds, provided 

(17) i |divA| dv < « 
and 

(18) lim ~ E Vaqhw dv = 0. 


Proof. Condition (17) simply states that our integral shall also be absolutely 
convergent and condition (18) requires that the “length” of \‘(z) shall “in 
average’’ tend to zero as x tends to the boundary, in the sense that the integral 
of || \‘(x) || over the set of points having a distance less than r from the boundary 
shall, as r tends to 0, be small in relation to the quantity r. 

Now we shall give the proof itself. We consider the region R, the boundary 
B, and the exterior R. In B + R we complete \* by values 0, and we multiply 
\*(x) by the function 


(19) Q(z) = 1 — P,(), 


the function P,(x) being formed with respect to the compact set B. It is easily 
seen that X(,, = Q,-X’ is strictly regular up to the boundary and vanishes on B. 
Hence (16) holds for \{,,. All we have to prove now is the limit relation 


(20) | div (A — Aq )dv = [ div (P,A)dv — 0 asr— 0. 
R R° Bor 








338 S. BOCHNER 


But this follows from 
div (P,-) = P,-div \ + grad P,-d 


in connection with (15), (17), (18) and | P,(x) | S 1. 

In the theorem we have just proved, the boundary may in part or totally 
consist of points which are limit points of R and not of R; in other words, the 
theorem also takes care of the case in which the vector has singularities along 
lower dimensional manifolds. In particular it can be concluded that in Green’s 
theorems (1), (2), (3) the vector \‘ may cease to be of class one along sufficiently 
smooth manifolds of dimension < n — 2 as long as its length stays bounded in 
the neighborhood of these manifolds. 

We can also make a statement regarding equation (1) itself. Let S be closed, 
and suppose that on some given boundary B it is possible to define an area 
element dw and a vector #; such that (1) holds for every vector \* which is de- 
fined and of class one in a neighborhood of R + B. Then the same is true for 
the exterior R of R + B, which means that 


(21) | aiv rae cs [XA 0d0 


for every function \* which is defined and of class one in a neighborhood of 
R + B, provided that the (n-dimensional) volume of the set B is 0. In fact, 
(21) follows immediately frem (1) and (2) whenever X* is defined and of class 
one throughout S. But the values of the two integrals in (21) depend only 
on the values of \‘ on R + B. Hence (21) will hold for a vector \’ which, with- 
out altering its values on R + B, can be modified into a vector which is of class 
one throughout S. Now if \‘ is of class one in a neighborhood of R + B, this 
modification can be obtained by multiplying \‘ with a function P,(z) that be- 
longs to the compact set R + B and by putting it 0 in other points. 


PRINCETON UNIVERSITY. 











ANALYTIC MAPPING OF COMPACT RIEMANN SPACES INTO 
EUCLIDEAN SPACE 


By S. BocuNEeR 


Recently H. Whitney’ has proved that an n-dimensional separable coérdinate 
space S of class C, (1 S ¢S ~) can be mapped topologically onto the Euclidean 
Eons; in such a manner that the mapping functions 


(1) t, = t, (t1, +++ , In) (vy = 1,---,2n + 1) 
belong to class C, on S and have a Jacobian of rank n throughout S; the quanti- 
ties 7,, +--+ , 2, in (1) are local coérdinates on S varying with the neighborhood. 


It is not known whether for an analytic space S the mapping functions (1) 
can be chosen analytic. It is the purpose of the present paper to point out that 
they can be so chosen provided S is compact and has an analytic Riemann 
metric. 

The line of reasoning is very simple. With the fundamental tensor g;;(x), 
we form the Laplacian 


1 a i is 
(2) se = 2 (019 4) = oe. 
* 2 


In the Hilbert space H of all square integrable functions on S, the Laplacian 
is essentially the inverse of a completely continuous operator. Therefore the 
solutions ¢ of the equation 


(3) Ag = hg 


form a complete basis in H. But the solutions of (3) are analytic if the co- 
efficients g;; are so. Hence every function f(z) on S is the limit in square mean 
of analytic functions. For differentiable functions on S we shall prove more; 
if t(x) belongs to C,,, then corresponding to any « > 0 there exists an analytic 
function ¥‘(x) such that the function t(x) — Y‘(x) and its gradient differ by less 
than ¢ throughout S.2 We now apply this approximation to the functions (1). 
It is easy to see that, for ¢ sufficiently small, the approximating transformation 


€ 
(4) t, = (a1, +++, In 
will have the same mapping properties as the original transformation (1), which 
proves our assertion. 


Received February 16, 1937. 

1 Differentiable Manifolds, Annals of Math., vol. 37 (1936), pp. 645-680; p. 654, Theorem 1. 

2 In order to prove this conclusion it would be sufficient to assume that t(z) belongs to 
C,. But the proof would become more elaborate. 


339 





340 S. BOCHNER 
The Laplacian Ag on a compact space S was treated by Hilbert (for n = 2)* 
and by G. Giraud.‘ In order to avoid too many references to these and other 
papers, which would have to be supplemented in minor points in any case, we 
shall reproduce a number of known details in the first half of this paper. 
Finally, in the last part we shall have occasion to generalize the properties 
of the Laplacian from scalar functions to tensors of arbitrary rank. 


Part I. Definitions 

1. We shall consider a space S which is a compact orientable coérdinate space 
of an arbitrary dimension n with an analytic positive-definite fundamental 
tensor g;;(x). In order to avoid a known trivial complication in writing, we 
shall exclude n = 2, and thus assume n 2 3. 

The family of analytic functions and tensors on S, or on a specified open set 
in S will be denoted by C,, and C,,0 S q S ~, will denote the family of func- 
tions or tensors of class q. 


2. A point on S will be denoted by 2, y, z, #, &, 7, --- , its coérdinates by a 
corresponding subscript. The invariant volume element on S gives rise to a 
theory of Lebesgue measure and Lebesgue integration with the customary 
properties, except for the fact that the total measure of S is finite. The measure 
element will be denoted by dv, or more specifically, by dv, , dv, , dvz, etc. Thus 


[sa or [1a 


will denote the integral of the function f over the set A. In case A = S, we 


shall simply write 


[sa or [ 12am. 


3. The class of functions f(z) which are bounded and measurable on S, or 
on a specified subset of S, will be denoted by B. Furthermore we shall con- 
sider, on the total space S, the Hilbert space H of all (real) measurable functions 
of integrable square, with the inner product 


U0) = | sows 


We shall also use the symbol || f ||’ for (f, f). It follows easily, as in the case 
of a Euclidean region S, that every subspace C, of H is dense in H. It is our 
main objective to prove that C, is likewise dense in H. 


3D. Hilbert, Grundzige einer allgemeinen Theorie der linearen Integralgleichungen, 1912, 


chapter 18. 
* Probleémes miztes et probl2mes sur des variétés closes, etc., Annales de la Société Polonaise 


de Mathématique, vol. 12 (1933), pp. 35-54; p. 43. 














MAPPING OF RIEMANN SPACES INTO EUCLIDEAN SPACE 341 


4. Functions of two variables, f(z, y), f(z, £), etc., will be defined either on 
the product space S° = S X S, or on a specified part of it. It is clear what 
will be meant by the symbols G, G, ¥. 


5. Corresponding to any point 2° there exists a neighborhood U = U(a2’) 
such that any two points z, y C U have a unique geodesic distance R = R(z, y). 
The square of the geodesic distance will be denoted by 2 = Q(z, y). If U is 
sufficiently small, then Q(x, y) C CZ. Owing to the bicompactness of S and S’ 
we can find a number pp > 0 such that Q(z, y) C C3 if R(x, y) < po. 

The local analyticity of Q(z, y) becomes obvious if Q(z, y) is expressed in 
normal coérdinates. The normal coérdinates z; originating at the point 2; 
are analytic functions 


2; = 2(z, y) = yi — x + (higher terms in z and y) 
defined for z, y C U(z*). They are characterized by the relations 
(5) G:i(2)2; = gii(x)z;, 


in which 9;;(z) is the fundamental tensor expressed in the z-coérdinates originat- 
ing at the variable point z, with the initial values g;;(z) = g;;(x); the tensor 
gi(z) being given in a fixed coérdinate system of which z and y are arbitrary 
points. Important properties of the functions z;(z, y) are 


02; 02; — . ~ 
(6) az; —dbi;, i if z = 0, 
(7) Hs) 9 8 its = 0, 
Oz 
(8) Q(x, y) = gi(zdecz; = gi(x)ziz;.° 


6. The family C?,,, 0 < r < n, consists of functions Q(z, y) with the following 
properties. Q(z, y) is not defined for z = y. Everywhere else on S’ it belongs 
to class C?. Inside some belt R(z, y) < p, Q(z, y) and each of its derivatives 
is representable as a quotient 


A(z, y) . 
(9) Rw, yy 


also, within this belt, Q(z, y) has the order of magnitude O[R(z, y)], and each 
derivative of order go(S q) has the order of magnitude O[R(z, y)~” “’]. 

We shall also consider a class of functions B?, 0 < r <n. The function 
Q(z, y) belongs to B? if it is measurable on S’, bounded outside some belt R(z, y) 


* For the existence of normal coérdinates with these properties it is not necessary to 
assume that our space is analytic. They exist if S is a space of class p = 2, and they are 
themselves of class p — 2. Checking all steps of our argument, we could easily see that 
our Theorem I holds if p = 6, but normal coérdinates can be avoided altogether and the 
class of the space reduced by several units. Compare G. Giraud, loc. cit. 








342 S. BOCHNER 


< p, and O[R(z, y)™’] within this belt. For such functions we shall consider 
the process of convolution. For instance 


(10) Q’'*Q” = [ ee. £)Q"'(E, y)dv;. 


7. The symbol #,(¢) will denote any function in 0 S$ t < « with the following 
properties. The function has derivatives of all orders, is non-negative through- 
out, has the value 1 in 0 S t S 4p, values between 0 and 1 in }p < ¢ S p, and 
the value 0Oinp St < @~, 


8. The symbol P(x, y) will denote any fixed function with the following 
properties: 
(11) P(x, y) = Ply, 2), 
(12) P(x, y) © Can-2, 


and there exists a p > 0 such that 


(13) P(z, y) = Re. y= ; if R(x, y) <p 


for some prescribed constant y. We shall prescribe 


(jn — 1) 
14 2 alanieme . 
(14) Y — 
The existence of such a function is trivial. For example, the function 


®,[R(z, y)] 


(15) P(z, y) = y= 
R(x, y)"* 
will do. 
Any function P(x, y) with these properties has the further important property 
(16) A. P(x, y) CS Ci.n-2- 


This property states essentially that the Laplacian, if applied to our function 
P(x, y), does not raise the exponent of its singularity on the diagonal z = y; 
whereas, as a rule, if applied to a function from C’%,,,, it increases the exponent 
r by 2. The proof of (16) follows easily if R(x, y) is expressed in normal co- 
ordinates issuing from zx; in such codrdinates z; we have® 


= _ n\ Alogg(z)_ pn 
A.R = (1 *) — 2K, 


and, by (7), the factor of R~" on the right is O(R’). 





*W. Feller, Lésungen der linearen partiellen Differentialgleichungen zweiter Ordnung 
vom elliptischen Typus, Mathematische Annalen, vol. 102 (1930), pp. 633-649; p. 639, foot- 
note. 




















MAPPING OF RIEMANN SPACES INTO EUCLIDEAN SPACE 343 


It will be important for us to consider not the Laplacian Ag itself, but the 
operator 


(17) A‘g = Ag — ce 
for some positive constant c. The constant ¢ being fixed, we shall write 
(18) K(x, y) = A: P(z, y), K(x, y) = K(y, x) = 4; Pty, 2). 


These functions again have the property (16). 


9. The distance R(z, y) is comparable in size to the Euclidean distance | 


(19) E(z, y) = (ys — 2"). 
Hence, the integral 
(20) vy) = / o(x)Q(x, y) doz 


exists for every y if g(x) C B, Q(z, y) © Co, r <n. For fixed Q(z, y) and 
variable g(x) the integral operator (20) will be written 


(21) ¥ = Qe. 


We shall be mainly interested in the operators Pg and Kg resulting from the 
functions P(x, y), K(x, y) that were introduced before. 


Part II. Preliminaries 


1. We shall state several properties of the expression (20), all of which are 
trivially true if Q(z, y) happens to vanish inside some belt R(x, y) < po. 

Lemma 1. If g(x) C B, Q(z, y) C C,., r < n, then the function (20) belongs 
to Co"; more precisely, the modulus of uniform continuity of ¥(y) depends only on 
M = l.u.b. | e(x) |. 

zCs 

Proof. With the function #,(¢) of Part I, §7, we form the expression ¥,(y) = 
J o(z)Q(z, y)[1 — ®.{R(z, y)}]dv.. For o fixed, our lemma holds for ¥,(y) in the 
place of ¥(y). But ¥.(y) tends to ¥(y) as o — 0, uniformly in y and M. 

Lemma 2. If g(x) C B, Q(z, y) C Ci., r < n — », then the function (20) 
belongs to C,, again uniformly in M, and the partial derivatives of ~(y) may be 
obtained by partial differentiation under the integral. For example, 


ay _ aQ 
(22) .* / g(x) S dv,. 


Proof. By using induction, it is sufficient to consider the case vy = 1. Since 


aQ C Corsi, 7 +1 < n, the integral 
TN 


(23) aw) = { (2) 2 ae 








344 S. BOCHNER 


represents a function in Cy. We restrict ourselves to a coérdinate neighborhood 
(yi, °**, Yn) and integrate the function g(y) with respect to the variable y 
between the fixed limits a and 8. Since the integrand on the right side is a 
Lebesgue integrable function on S* we can exchange, on the right side, the 
two processes of integration for almost all (yz, --- , yn), and hence 


8 
i g(x, 43 » Yn) dys _ v(8, Y2,"*"*, Yn) ae V(a, aes ** » Yn) 


for almost all (ye, --- ,Y»). But both sides of the last relation are continuous 
and, therefore, it holds throughout. Making now 6 variable, we find that 
¥(y) (C,, and that g(y) = dy/am. 
Lemma 3. If o(x) CC; and Q(z, y) C Ci,n-2, the function (20) belongs to C2. 
Proof. It will be sufficient to show that the function (23) belongs to C;. 


Relation (6) implies 
02; 02; 
ax; * dy; 
Hence, remembering that .Q(z, y) = A(z, y) R(x, y)” for R(x, y) < po, and 


using expression (8) for Q(z, y), we easily find that 


aQ , 2@ 
Oy Ox, 


O[R(a, y)). 


(24) 


belongs to Ci,,-2. Hence, on account of Lemma 2, the non-trivial part of 
function (23) is the function 


(25) h(y) = [oe ee doz, 


and it is sufficient to show that the latter function belongs to C,. We restrict 
the point y to a sufficiently small neighborhood U(y°) whose closure is contained 
within a sphere R(z, y°) < p. Forming with our previous function ®,(¢) the 
decomposition 


g(x) = {y(x)®,[R(z, y°)]} + {e(z) — o(z)®,[R(, yl}, 


we see immediately that the second component of g(x) contributes to the func- 
tion (25) a component of class C;. Hence we may assume that g(x) and ‘its 
derivatives vanish outside some coérdinate neighborhood and that y is a point 
of this neighborhood. Denoting the product of g(x) and the volume density 
g’ by ¢(x), we can therefore write 


h(y) = [ew 202 Yay -** dn, 


where A is an n-dimensional Euclidean domain and ¢(x) is a function of C; 
vanishing on the boundary of A. Let y be a fixed point within A, A, the 























MAPPING OF RIEMANN SPACES INTO EUCLIDEAN SPACE 345 


difference of A and a small sphere of radius p around y, and B, the inner boundary 
of A. By Green’s formula, 


/ #(2) 20 de, --- dan = - [ i oe il cos (é, ») 8(E)Q(E, y) day. 


P Ox, 


Since Q(E, y) = O(p""*”), the surface integral tends to 0 as p > 0, and therefore 
0 
ha) = — [ Qe, wider des. 
A O21 


By Lemma 2, h(y) belongs to C; . 

Lemma 4. If Q(z, y) C B? and L(z, y) C B?, then Q * L belongs to B?,.-» 
ifr+s—n> 0;to Bi, foranye> 0, ifr+s—n = 0; and to Boifr+s—n 
< 0. 

Proof. The lemma is known in ease S is a bounded set in Euclidean space 
and R(x, y) has the value (19). For our present case it follows from the fact 
that in every coérdinate neighborhood the quantity R(x, y) is majorized by 
(19) from above and below. 

Lemma 5. If Q(z, y) C Ci, r <n, then, from some positive v onward, the 
v-th iterated kernel 


(26) Q = Q*Q*---*Q (v factors) 


belongs to Co. 
Proof. By Lemma 4 there exists a Q,1(z, y) which is bounded. But 


Q(x, y) = | Q(x, HAE, y)dy: = / Q(z, QE, yd; . 


Therefore, by Lemma 1, Q,(z, y) is continuous in x uniformly in y, and con- 
" R a ° 2 
tinuous in y uniformly in z. Hence Q,(z, y) C Co. 


2. We shall now discuss the operator (21) for ¢ C H. To start with, the 
expression (20) is not defined for arbitrary elements ¢g in H. But it is defined 
for g in B, and the subset B of H is dense in H. This suggests the possibility 
of extending the operator formally into a wider set of H. In fact, the operator 
(21) can be extended uniquely over the whole of H. 

Lemma 6. For any Q(x, y) C Ci, r <n, the expression (20) defines a unique 
operator (21) which is bounded and completely continuous. Also, for any two 
such kernels Q' (x, y), Q’’ (x, y), the reiation 


(27) QQ" (e)] = (Q’ * Q")&) 


holds. 

Proof. We first assume that Q(z, y) is continuous throughout. In this case 
it is known that Q¢g is bounded and completely continuous for gin B. B being 
dense in H, the operator Qyg has a unique extension into the whole space H, 











346 S. BOCHNER 


and this extension is again completely continuous.’ The value of Qg for an 
arbitrary element ¢ C H can be computed in the following manner. Form any 
sequence of elements {¢g,,} from B of which ¢ is the limit in the strong topology 
of H: lim || ¢ — ¢» | = 0. The values ¥, = Qem can be computed from (20) 


m-72 

and they are in their turn convergent to an element y of H. This element is 
the desired value (21). Finally, relation (27) is obvious for ¢ C B by Fubini’s 
theorem, and for general ¢ it follows by the limiting process we have just de- 
scribed. 

In the case of a general kernel Q(z, y) we shall apply an important criterion.* 
An operator Qy is bounded and completely continuous in H, if corresponding 
to any « > O there exists a decomposition 


(28) Qe = Wet Reg 


of the following nature. R’g is bounded and completely continuous, and Q’¢ 
is a bounded operator whose bound is S «. We put 


Q = Q(z, y)®[R@,y], R =Q-—Q. 


R* is continuous, and hence the operator R’g is bounded and completely con- 
tinuous. As for the operator ¥ = Q’y, we have, for g C B, 


(29) vir = | ¥'(y) | dry = IJ o(é)e(n)M"(E, n)dv_dy, , 
where 
(30) MG») = | CE WL, nde. 
The kernel M’(&, ») has the following two properties: 
(31) M*(é, n) = 0, if RE, 1) > 20; 
there exists an exponent s, 0 S s < n, anda constant A such that 
4 


(32) |M"(é, 0) | S if R(E, 9) S 20. 


R(é, )*’ 
Property (31) is an immediate consequence of the fact that Q’(z, y) vanishes for 
R(x, y) 2 o; whereas property (32) is an easy consequence of Lemma 4. Putting 





. — |e) |? + | elm) 
e(é)e(n)| S 5 


in (29), we obtain 


(33) 2\\¥' | s | o)| fire, n)| dey | + [ eor| fare, n) | avg |, 


7S. Banach, Opérations linéaires, 1932, p. 99. 
8 Banach, loc. cit., p. 96, Théoréme 2. 











MAPPING OF RIEMANN SPACES INTO EUCLIDEAN SPACE 347 


But, owing to properties (31) and (32), there exists a positive quantity ¢(c), 
tending to 0 as o tends to 0, such that 


Jisee vidns do), | (ae wide s ee), 
Hence each of the integrals in (33) is S e(¢)-||¢||*, and therefore y’ is a bounded 
operator on B whose bound is S ¢(¢). The extension of ¥ from B onto the 
total space H does not increase the bound,’ and, therefore, the assumptions of 
the above mentioned criterion are fulfilled. This proves the first half of the 
lemma, whereas relation (27) can be proved as in the case of a continuous kernel 


Q(z, y). 
Part III. The Laplacian 


1. If in Green’s formula 


1 @ é 
{A‘ is a contravariant vector belonging to C;], we put 
p 
i dg : i ay dg 
» = g¢— xX = o— — y— 
az;’ - = ar; v ax;’ 


we obtain the relations 
c ij OG OY 2 
e _ A d = » ale ae d Ne 
(35) Je ¢g av [[o ax, aa; -+- ce | t 


(36) 


a 
aS) 
b, 
< 
> 
II 
‘eatie, 
< 
b 
° 
$ 
—s 
s 


From (35) we conclude 
(37) (y, Sv) = —cly, ¢), 
and relation (36) reads 
(38) (vy, AY) = (4%¢, ¥). 


Hence, on the dense subspace C2 of H, the operator — A‘y is positive definite 
with a lower bound 2 ¢ and symmetric. By a general theorem on operators 


II 


in Hilbert space there exists a closed extension of the operator A‘g for which the 
properties (37) and (38) remain in force.”® It will be this closure of the Laplacian 
that will forthwith be denoted by A‘g; in fact, in order to simplify the writing, 
we shall omit the upper index c, and thus write Ag instead of A°¢ until further notice. 
The domain of Ag is a well defined set D of H, C2 CD CH. 


® Banach, loc. cit., p. 58. ‘ 
10M. H. Stone, Linear Transformations in Hilbert Space, 1932, p. 49, Theorem 2.12. 








348 8. BOCHNER 


Lemma 7. The operator Ag satisfies the relations 
(39) Ke — ¢ = Pade, ife CD. 
(40) Ke — ¢ = APg, if Pe CD. 


Proof. By the definitions given in Part I, §§8 and 9, relation (39) is equiva- 
lent to 


(41) | [o(2)\.P(2, y) — Aee(2)P(2, y)] dv. = oly). 


It is sufficient to prove it forg CC,. For other ¢ it follows from the continuity 
of the operators Kg and Py. The left side of (41) is the limit, as ¢ — 0, of 


©. she 
42) — — (g’d’) dv, 
' [ gy Oz; ‘9 ‘ans : 


where S, is the total space S minus the geodesic sphere of radius ¢ around the 
fixed point y, and 
aP(z,y) a 
2<~ — —P(z, y). 
, . Oz; ( : y) 


A(x) = o(z) a 





By Green’s formula, (42) has the value 


(43) — | x@ EY a, 


where B, is the boundary of S, and da, is the invariant surface element on B, .” 
In order to estimate (43) we introduce normal coérdinates z; = z,(&, y) originat- 
ing at the fixed point y. We also make a linear affine transformation by which 
9:;(y) = 4:;. This gives for the quantity R(é, y) the value (2;2,)'. Hence 
we obtain, by a trivial calculation, 





i aR( »¥) 1 c) 7 
d'(é) ae = (2 — n)y-¢(é) —~ ee 


1 1 
= (2 — n)y-¢() -— (3). 
Therefore, (43) has the value 
n—1 
(44) (n — 2)ywn rely) + (5), 


where w,—; is the (mn — 1)-dimensional Euclidean volume of the unit-sphere 
z;z; = 1. Substituting from (14), we see that (44) tends to g{y) as « — 0, 
and this completes the proof of (39). 


1 W. Feller, loc. cit., §2;8. Bochner, Remark on the theorem of Green, this Journal, pp. 
334-338. 











e 








MAPPING OF RIEMANN SPACES INTO EUCLIDEAN SPACE 349 


Relation (40) will be proved if we show that for every element u of C2, 
(45) (u, Ke — ¢ — AP) = 0. 
Obviously 
(u, Kg) = | u(x)e(y) K(x, y)dv.dv, = (y, Ku) 
and 
(u, APe) = (Au, Pe) = J S.u(2)-e(W)P (a, a)deede 
= (¢, PAu). 
Therefore the left side of (45) has the value 
(vy, Ku — u — PAu), 
and this vanishes by (39). 


2. Lemma 8. If Q(z, y) © Chn-2, f C Co, ¢ CH, the relation 
(46) ¢=Q+f 


implies that g C C2. 
Proof. Iterating (46), we obtain, using relation (27), 


(47) = Q¢ + Qaf + Q,-2f + eee + f. 
Obviously, the functions Qf, Qof = Q(Qf), --- , @af = Q(Q,-2f) are all bounded. 
From a sufficiently high » onward, Q,(z, y) C B’ by Lemma 5, and hence by 


(48) | ae, neterae.| = f exe w*ae, | o(e)*ao., 


Q,¢ is bounded. Thus, by (47), ¢ is bounded. Consequently by Lemma 2, 
Qe C C,, and hence by (46), ¢ CC,. But then, by Lemma 3, Qe C C2, and 
therefore, again by (46), ¢ C C2. 


12 
2 





3. Lemma 9. The equation 
(49) (Ap =)Ap =f 


has a solution for any f C H. 

Proof. A solution, if any, is unique, by (37). It is sufficient to show that 
(49) has a solution for f C C, ; in fact, if we show this, we prove that the operator 
f = Ag has an inverse 


¢ = Jf 


on a dense set in H. By (37) this inverse is a bounded operator and has there- 
fore an extension into the whole of H. If we re-invert the fully extended opera- 











350 S. BOCHNER 


tor Jf we obviously obtain Ag; in other words, the counter-domain of Ag is the 
whole space H. 
Substituting (49) in (39), we are led to investigate the integral equation 


(50) Ke — ¢ = Pf. 

The operator Ke being completely continuous by Lemma 6, we can apply 
the generalized Fredholm theory.” According to this theory, the equations 
(51) Ke — ¢ = 0, 

(52) Ky —y=0 

have the same finite number of independent solutions. If we denote such basic 
solutions by 

Pry *** Om; Vi, °° Um, 

for a given element f the equation (50) has a solution, if, and only if, 

(53) (vy, Pf) = 0 (u = 1, --+ ,m). 


We remark once for all a fact which we shall tacitly use several times, namely, 
that any solution ¢ or y¥ of (50), (51), (52) is automatically a function of C2, 
and therefore a function of D. This follows from Lemma 8 [in conjunction 
with Lemma 3 for equation (50)]. Let yg, be any solution of (51). Combining 
(51) and (39), we obtain 


(54) PAg, = 0. 
Hence for ¥, = Ag,, we obtain from (40), Ky, — ¥ = 0. Also, if gi, --- , om 
are linearly independent, then Ag; , «++ , Agm are likewise linearly independent. 


Hence, the basic solutions of (52) can be assumed to have the form y, = Ag,. 
Now, (¥,, Pf) = (f, P¥,), and vanishes by (54). Thus equation (50) has a 
solution g* for an arbitrary element f C C,. Comparing now relations (50) 
and (39), we obtain for the function ¥* = f — Ag* the equation Py* = 0. By 
(40), ¥* is a solution of (52) and may be written in the form ¢,.y + --- + Cnn - 
In other words, the function g = g* + cgi + «++ + Cm@m is a solution of (49). 

TureoreM I. The operator f = Ag — cy is for every fixed c > 0 the inverse 
of a completely continuous operator ¢g = Jf in H. The solutions ¢ C H of the 
equation 


(55) Ay = he 


(A being any constant for which a solution exists) form a complete set of functions 
in H. They are all, automatically, contained in C2. 
Proof. From (50) we obtain 


¢ = K(f) — Pf = Jf. 


' Banach, loc. cit., pp. 159-161. 














MAPPING OF RIEMANN SPACES INTO EUCLIDEAN SPACE 351 


By Lemma 6, Pf is completely continuous, and so is A(Jf), since Jf is at any 
rate bounded. Therefore, Jf is completely continuous too. Hence, Jf has a 
pure point spectrum and so has its inverse A‘y. Therefore, the solutions of the 
equation Ag — cg = hg, or, what amounts to the same, those of equation (55) 
form a complete basis in H. Substituting (55) in (39), we obtain (K — AP)(¢) 
= g, and therefore ¢ C C2, by Lemma 8. 


Part IV. The mapping theorem 


1. The following theorem makes full use of our assumption that the under- 
lying space S is analytic. 

TuHeoreM II. The solutions ¢ of (55) are all analytic. Thus there exists in 
the function space H a complete basis {¢»,| whose elements ¢,, are analytic through- 
out S. 

Proof. The analytic character of the functions ¢,, is a purely local property 
and follows from the following theorem which for n 2 3 was first proved by 
J. Hadamard.” 

Lemma 10. [If the coefficients g;;(x) (the tensor g;; being symmetric and positive- 
definite), b;(x), c(x), f(x) are all analytic in the neighborhood of a point 2° = 
(ai, «++ , 2%), then every solution g(x) of the equation 
(56) iio + We + ow =S 

i vi 
which belongs to C2 is likewise analytic. 

Proof. Since Hadamard does not carry out the last steps of his argument 
we shall give a brief summary of it. By a fundamental theorem of 8. Kowalew- 
sky equation (56) has always analytic solutions (locally). Hence it is sufficient 
to prove our theorem for f(z) = 0. We denote for the moment the operator 
(56) by Fy and its formal adjoint by F*yg, and we consider Hadamard’s elemen- 
tary solution G(z, y) belonging to F*¢g. G(z, y) is a function of the form 


U(z, y) . 
(57) Ys +t log R(x, y)-V(z, y); U(z,z) = 1, 
a 
the coefficients U(z, y), V(x, y) being analytic in all 2n variables x, --- ,2,, 


"1, °**, Yn; and F2G(z, y) = 0. For a sufficiently small spherical surface B 
our solution g(x) of F.¢ = 0 satisfies a relation 


(58) ov) = | (ote) FEY + oeets, y)) de 


for every point y interior to B; the coefficients p(é) and o(¢) are continuous, 
and we shall not need their precise value. This is a consequence of Green’s 
formula, in which the volume integrals vanish because g(r) and G(z, y) are 
solutions of F = 0 and F* = 0, respectively. The term ¢(y) emanates from the 


13 Lectures on Cauchy’s Problem, 1923, p. 102. 








352 S. BOCHNER 


singularity (57) in just the same way as in the case of relation (39). In fact, 
the symmetry of the function P(z, y) has not been used at all for the derivation 
of (39), and a logarithmic term of the form log R(x, y)-V(z, y) would not alter 
the estimate (44) for n = 3. 

Now let A be any closed point set interior to B. For y C A, § CB, R&, y) 
has a positive bound from below; hence we can find a region A in the space 
of the complex variables y,, --- , y» over which R(é, y) is still analytic and has 
a real part which has a positive bound from below. Since U(&, y) and V(é, y) 
can also be continued into a complex y-neighborhood of the origin, it follows 
immediately that the function (58) is analytic in some neighborhood of the 
origin. 


2. We again choose a fixed constant c > 0, and denote by ¢ = Jf the inverse 
tof = A‘g and by J,f the »-th iterate of Jf. 

Lemma ll. There exists an integer vy = 1 and a constant A such that for every 
f CH the relations 


(59) | Jf(z) | = A-||f || 
(60) ' grad J, f(x) | S A-{|f || 
hold. 


Proof. We write fo = f, fusca = Jf, u 20. By (50) we have the relations 
fi=Kfi-— Ph, fe=Kfe—Phi, fs = Kfs — Phe, 
ete., from which we deduce the relations 
fi = Kf, — Pho, fe = KKf, — (KP + PK)fi + PPfo, 
fs = KKKf; — (KKP + KPK + PKEK)f. + (KPP + PKP + PPK)fi— PPPh, 
ete. In general, 
(61) f, = Quof, + Qufiia +--+ + Qnfo, 


the functions Q,,(z, y) being “polynomials” of »-th degree in the functions 
P(z, y), K(x, y). Since P and K both belong to B42, there exists by Lemma 4 
a vy = 1 such that the functions Q,, belong to B’. We choose a » with this 
property and hold it fast. Since Jf is a bounded operator, there exists an a > 0 
such that for every f C H 

| f(z) || S a || f || (u = 0, 1, --+,»). 
Therefore, by relations (61) and (48), there exists an A > 0 such that 


|f.(z)| = A-|Ifl- 
This proves (59). Raising the exponent v by 1, we can also assume (for some 
other A > 0) 

lfa(z)| S$ A-||F|l- 





4 








MAPPING OF RIEMANN SPACES INTO EUCLIDEAN SPACE 353 


Relation (60) follows now from 
Se _ Kf, vies Phin 


by Lemma 2. 
TueoreM III. [Jf t(x) is any function on S belonging to C,,, corresponding 
to any « > 0 there exists an analytic function (x) on S such that 


(62) | tz) — V(x) | S «, | grad [t(x) — ¥°(z)]| S «. 


Proof. Applying the operator A‘ v times to the function t(z), we obtain a 
function g(x) for which t(z) = J,¢(x). By Theorem II there exists an analytic 
function g‘(x) such that ||—~ — ¢°|| S «/A. The function ¥‘(z) = J,¢‘(z) 
is again analytic by Lemma 10, and relations (62) hold by Lemma 11, if applied 
to the function f(z) = g(x). — ¢‘(z). 

Theorem III completes the proof of the 

MappinG THEOREM. Any compact analytic Riemann space S, can be mapped 
topologically-analytically onto the Euclidean Een: . 


Part V. Tensors 


If we use the mapping theorem it can be easily shown that not only the Hilbert 
space of scalars on S has an analytic basis but that the same is true for the 
Hilbert space H of tensors of any given rank. As an illustration we shall 
consider tensors of rank 3, ¢ = gas,. The inner product is, of course, defined by 





(63) (y, y) = / Paby py" dy == J oo” Waby dv. 
Given a tensor ¢gasy, we form with the functions (1) the scalars 
x) mx g®M(q) Ble Ale ae 
AG, ¢, #32) = 9") O2%q O23 ALy 


(p, o, 7 = 1, 2,---,2m + 1). Conversely, in every coérdinate neighborhood 
on S the tensor components ¢**” can be expressed as linear combinations of the 
functions A(p, o, 7; x), the coefficients being rational functions of the partial 


i at . , a ee 
derivatives —*. Since the functions A(p, 7, 7; 2) are limits in square mean of 


analytic functions, the tensor gag, is likewise a limit, in square mean, of analytic 
tensors. 

Ancther more illuminating way of treating tensors is to set up a Laplacian 
for them and to investigate it as in the case of scalars. The definition of the 
Laplacian is 


Ag = 9” Papy.v.0« 











354 S. BOCHNER 


Putting 
vdeo = 0G" vady.0.0 
v grad g = ""'g”" vasy.p; 
Vv, 9) = gv” Paay.0, 


we have the formal identity 
pAg = div (y grad ¢g) — Vy, ¢) 


which leads again to (36), (37), (38). In order to have an analogue to (39) 
and (40), the quantity P(z, y) must be a tensor of rank 3 in both the variables 
x and y separately. Everything goes through strictly analogously if we replace 
definition (15) by 


y®,[R(z,y)] #2 aa a2 
R(x, y)"  AXadYn BX8dYy AX, Yr 





P(2, y) = Pasy:rw(2, y) - 


The analogy is immediate up to Lemma 10. This lemma is to be replaced by 
the following generalization. 
Lema 12. Jf in a system of equations 


i &e apr 

64 7 bai — c = 
( ) g ax; az; + Our ax, + Cyurgr Sia 
((,j = 1,-+*,n;\,u = 1, +--+ ,m; nand mare arbitrary integers) all given func- 
tions are analytic, then every solution ¢,, +++ , @m which belongs to C2 is also an- 
alytic. 

Proof. We introduce auxiliary variables 4,,---,t,. Putting (2, t) = 
t,¢,(z) and multiplying (64) by ¢,, and summing with respect to u, we obtain 

a® a’ ab 

65 Mig) ee > Eig, ome + be oe @ ES, 
= 9) srear, + Pmt arn, + tm op = SoS 

; a@ a°® 
Since —— = 0, we ce ld to the left side of (65) the term 6, ——. If we 
ince ah, dl, we can add to the left side of (65) the term di, dh a, we 
consider &(z, t) as a function in the n + m variables 7, +--+: ,2nj5ti,°*+ , tm, 


relation (65) has the form (56), and Lemma 10 applies for sufficiently small 
values of t, since for small values of ¢ the matrix of the coefficients of the second 
derivatives is again positive definite. Therefore #(z, ¢) is analytic, and so are 
the functions 


PRINCETON UNIVERSITY. 











—S — 








POLAR CORRESPONDENCE WITH RESPECT TO A CONVEX REGION 
By Fritz JoHn 


We denote as polar correspogdence (abbreviated P.C.) with respect to a 
convex region R in projective n-dimensional space 7, any one-to-one corre- 
spondence of the points of R and the hyperplanes outside R. The study o 
such a correspondence is essentially the study of a contragredient vector with 
special consideration of the convexity of the domain of definition. In §1 of 
this paper the representation of a P.C. in homogeneous coérdinates is discussed. 
A P.C. is called positive if a point and its polar plane are not separated by any 
other point and its polar plane. In §2 it is proved that every positive P.C. is 
continuous. §§3-4 deal with symmetric P.C.’s; a P.C. is called symmetric if 
in the neighbourhood of every point P it is approximated by an ordinary polar 
correspondence with respect to a quadric, denoted as the tangential quadric in P. 
A positive symmetric P.C. may be generated by a convex hypersurface in 
(n + 1)-dimensional space 7,4; in such a way that the line joining any point 
@ of the surface to a fixed point of 7,,; and the tangential plane in Q intersect 
7, in a point and its polar respectively. 

In the remainder of the paper a general class of P.C.’s is discussed, which are 
generated by continuous positive mass distributions on R. Given any hyper- 
plane p outside R, the pole of p shall be that point which becomes the center 
of mass of R with the given mass distribution, in case p is chosen as plane at 
infinity. In §4 it is proved that a P.C. generated in this way is always positive 
and symmetric. In §$§5-6 it is shown that the tangential quadric at a point 
P is identical with Legendre’s ellipsoid of inertia of R if the polar of P is plane 
at infinity. Moreover, some inequalities involving R and its tangential quadrics 
are given.’ 


1. Let R be an open convex region in projective n-dimensional space 7, ; 
i.e., an open set with the following properties: 

(1) If P and Q are points of R, one of the two straight line segments bounded 
by P and Q belongs to R; 

(2) If S denotes the set of points which are neither points of R nor boundary 
points of R, there is at least one hyperplane in S. 

DEFINITION. A one-to-one correspondence between the points of R and 


Received October 23, 1936. 

' IT am indebted to the referee for pointing out that the methods used in §1 are closely 
related to those used by Steinitz in his paper Bedingt konvergente Reihen und konvere Sys- 
teme in Crelle’s Journal; cf. in particular vol. 146, p. 32 et seq., where Steinitz deals with 
convex regions in projective space. Our sets p and j appear there as number sets A and 
— A, and theorems corresponding to our Theorems 1.7 and 1.8 are given. 


355 








356 FRITZ JOHN 


the hyperplanes of S is called a polar correspondence (abbreviated P.C.) with 
respect to R. If a point P and a hyperplane p are corresponding elements, 
then P will be called the pole of p and p the polar of P. 


Let x, be referred to homogeneous coérdinates 2; , +--+ ,2n41. Let p denote 
the set of ordered sets x = (2, %2, «++ , 2ny1) corresponding to points of R, 
and let o denote the set of ordered sets u = (wu , Ue, *** , Unsi) Corresponding to 


hyperplanes ue2q = 0 contained in S.’ 

For every x C p our P.C. uniquely determines, but for a common factor, 
au Co; we assume in what follows that this factor is chosen in such a way. that 
for a point z and its polar u the relation 


(A) Uele = 1 
holds. This is possible, as obviously ueta ~ 0. The uv. then become uniquely 
determined functions ua = Ua(z) of x = (m1, «++ , 2n41) for e C p in the given 
P.C. 

The following properties of the u. follow immediately from our assumptions: 
1.1. alt, +++ , Ln41) 18 homogeneous of degree —1. 


1.2. For every u C oa the equations ua(x) = Ua (2 = 1,-:-,n + 1) havea 
unique solution x C p. 
1.3. If a new codrdinate system is introduced by x = asXz the ug undergo the 
contragredient transformation ug = Gag Ue. 
1.4. Ifx Cpandy Cp, then ua(x)ya ¥ 0. 

The convexity of R finds its expression in the following theorems. 
1.5. Ifzx Cp, y Cp, the equation 


Ua(z)(ATa + Wa) = O 
has either for no z of p or for every z of p a solution with > 0, > 0. 
Proof. dx + uy gives for \ > 0, u > 0 one of the two segments bounded by 
x and y; according as this segment is contained in R or not it will be intersected 
by no or by every hyperplane contained in S. 
1.6. Forz,y,z Cp, 
(1) Ual2)La*Us(X)Ys"Uy(y)zy > 0; 
in particular for y = 2, 
UalY)Ta*Us(x)ys > 0. 
Proof. Let x and y be such that 
Ua(z)(ATa + Wa) ¥ O 
forallz CpandA >O0,u>0. As u.(z)(Ata + wa) represents a continuous 
function of \ and uy, it will have the same sign for \ = 0, 1» = 1 as for A = 1, 
uw = 0. Thus 
(2) Ua(Z)ta*Us(z)ys >0 


? Throughout this paper the summation convention is used, the letters a, 8, y, --- rang- 
ing over 1, --- ,n + 1, the letters 7, k, l, --- ranging over 1, --- , n. 

















~I 


POLAR CORRESPONDENCE 35 


for allz Cp. In particular, forz = xz orz = y, 


(3) Ualr)Ya > 0, us(y)es > 0; 
hence 
(4) Ua(X)Ya*Us(y)rs > 0. 


If, on the other hand, z and y are such that 
Ua(z)(ATa + Ya) = O 
for every z C pand some \ > 0, u > 0, we may conclude similarly that for every 
zCop 
(2’) Ua(Z)La°Ua(z)ys < 0 
(3’) Ua(Z)Ya < 0, us(y)xs < 0, 
so that (4) holds. As (4) is proved for every two elements of p, we also have 
us(Z)Ys°Uy(y)zy > 0. 
Thus from (2), (3), (4) or from (2’), (3’), (4) follows (1). 


Derinition. Let 2z° be a fixed element of p. By j we denote the subset of 
all elements y of p for which 


Ua(x)ya > 0. 


Obviously for every y C p either y C por —y Cf. It follows immediately 
from 1.6, that 
1.7. If x Cpand y C§, then 
Ua(x)Ya > 0. 


Thus f may just as well be generated by any other one of its elements as by 2°. 
18. Ifz Cpandy C§, thendxr + py Cpforr > 0,yn > 0. 
1.9. Letzx” Cpforvy = 1,2, --- andlimz’ = x Cop. 


vow 


Then there exists a subsequence of the u(x’) which converges to some (u) ¥ 0. 
Proof. There is certainly a subsequence of the u(x”) for which 


v” = u(z’) [i u2(x’)}? 


converges toward some v ~ 0. As u,(x’)a, = 1 (v not summed), we have for 
this subsequence 


Va Se = [dX u2(2’)], Vala = lim [d wea’). 


The limit on the right side must be # 0, for the plane v.y. = 0 cannot contain 
an interior point of R, as it is the limit of exterior planes of R. Thus [>> u2(z’)]! 
converges for that subsequence towards a limit ~ 0, and therefore the cor- 
responding u(x”) converge toward some u ¥ 0. 

1.10. p 7s an open set. 








358 FRITZ JOHN 


Proof. If x Cf and x = lim 2’, then lim u.(z)z, = ue(x)ra = 1. Thus 


ve vn 


u_(x)r, > 0 for sufficiently large v;i.e., x” C p for sufficiently large v. 


2. Derinition. The P.C. is called positive if for any x C p, y C p and 
rFy 
(6) Ua(Z)Ya'Us(y)xs > 1. 

(Compare with 1.6.) 

Let P and Q be the points of R corresponding to z and y; let P’ and Q’ be 
the points of intersection of PQ with the polars of P and Q, respectively. Then 
the cross ratio (PP’/QQ’) is given by 

- 1 
Ua(Y)La*Ua(x)ys 
Thus (6) means that the pair of points PP’ is not separated by the pair of points 
QQ’. 
2.1. Ina positive P.C. the functions ua(%1 , +++ , Ln41) are continuous. 
Proof. Let 2° C pforv = 1, 2,--- and let lim 2” = zt Cp. We have to 


vo 





prove that lim u(x") = u(x). According to 1.9 a subsequence of the u(z’) will 


converge towards some v # 0. We restrict ourselves to the consideration of 
that subsequence, which may again be denoted by u(z’). Our statement will 
be proved if v = u(x). Let us assume that lim u(z’”) = v # u(x). There are 


vow 
two possible cases: (a) v C oa, (b) v is on the boundary of ¢; i.e., corresponds to 
a plane of support of R. 
We deal first with the case (a), where lim u(x”) = v Co. Thereisay Cop 


such that v = u(y), and y # z, since u(y) # u(x). Without restriction of 
generality we may assume that x C p. Then according to 1.10 all but a finite 
number of the xz’, which may be neglected, are contained in p. From lim 2” = z, 


yv-7@ 


lim u(x") = u(y), and u,(x’)z, = 1, it follows that 


Ualy)La = 1; 


therefore y Cp. Moreover, for any z C p, 


IV 


Ua(z)x, -Ug(x")zg 


thus forvy > «, 


IV 


Ua(Z)ta*Us(y)zs 2 1. 
Let z = ux + Ay, where X > 0,4 > 0. Thenz Cfand 
[wa(dy + ux) ra] [us(y)(Ay + uars)] 


[wa(ua + Ay)ra](u + X). 


1 


lA 


(7) 











POLAR CORRESPONDENCE 359 
On the other hand, 


Usa(ux + AY)La = — Ualur + Ay)(uTa + AYa) — * alae + Ay)Ya 


IN 
= 7, Malus + Ay) Ya 


m4 l 
M Ual(Y) (uta + Ya) , 


by use of (6) and the fact that ua(y)(ura + AYa) > O, sincezandy Cp. The 
right member is 


< 





1 
“ 
1 
u 
1 
Pi 


1 A 1 1 


hou e+N. p+’ 
But the last result contradicts (7). 
In case (b), we have lim u(x") = v, where vay¥a = 0 is the equation of a plane 





of support of R (or of p). Then there exists a y on the boundary of p such that 
VaYa = 0; y may be chosen in such a way that ua(x)ya > 0. Then for A > 0, 
uw > 0, 


Ua(X)(uta + AYa) = w+ AUd(T)Ya > O, 
i.e., wt + Ay C f for all positive wu, A. Then 
Ua(ux + Ay)(miTa + MYa) > O, 
if uw, and \; are positive. Thus for «4, — 0 we have 
(8) Ua(ux + y)Ya > O, 
since Ua(ux + Ay)yYa = Vis not possible. On the other hand, it follows from 


Ua(ur + Ay)re-Us(x")(uxs + Ays) 2 1 


IV 


forvy — o that 
Ua(ux + Ay)ta* (urs + Ays)vs 2 1. 
AS Vala = 1, VaYa = 0, we have 
Ua(ux + ry)ura 2 1; 
subtracting this from uwa(ux + Ay) (ura + AYa) = 1, we obtain 
Ua(ux + ry)ya = 9, 


in contradiction to (8). 
3. Let the P.C. be positive. Let 


(9) ve» [ elt <e+ ek - soe 











360 FRITZ JOHN 

U(x, y) is defined for x C pf and y C §, as then also (1 — t)r + ty C f for 

0 < t < 1, according to 1.8. 

3.1. U(z,y) = —Uly, 2), U(x, Ay) = u(, v). 
DerIniTion. A positive P.C. is called symmetric, if for all x, y, z Cp 

(10) U(r, y) + Uy, z) + Uz, x) = 0. 

3.2. Ina positive symmetric P.C. 

aU(z, y) _ 
9Ya 

Proof. From (10) we obtain for z3 = ys + ih 


Ugly). 


1 
U(x, z) — U(z,y) = Uy, 2) = [ Ualy + (z — y)tlhdt 
or 


m <a 1 
lim U(, 2) — UC, y) = im [ Ualy + (z — y)tldt = ualy). 
h—0 h 0 
3.3. A positive P.C. is symmetric if there exists a function u(x, +++ ,Znsi1) > O 
defined in p such that 
oes « 
O2a 


= a: 


(11) 


Proof. If there is a function u satisfying (11), then 

u(y) 

u(x)’ 

and (10) will be obviously satisfied. If on the other hand (10) holds, the fune- 
tion u(y) defined by 


(13) u(y) = e*” u(x) (u(x) > 0), 


where z is some point of A, will satisfy (11) according to 3.2, and will be positive. 
It follows from (12) that all solutions u of (11) can only differ by a constant 
factor and are given by (13). 
3.4. The function u(x) is homogeneous of degree one. 
Proof. Let u be given by (13). According to (10) for \ > 0, 


(12) U(z,y) = log 


u(Ay) = oF u(z) - - atatiletie > = u(yeO™, 


Now from 1.1 and (9) it is easily deduced that U(y, Ay) = log X. 

Let ms: denote the projective 2; --- 2,41 y-space and let Y be the point 
t= te = +++ = Inia = O and z, the plane y = 0. Then the equation y = 
u(a, *** , 2n41) for  C f defines, according to 3.4, a certain surface Z in my41. 
As u has continuous derivatives, = has a continuously changing tangent plane 

















POLAR CORRESPONDENCE 361 


at every point. Moreover, for every point Q of = the line YQ intersects y = 0 
in a point of R. 
3.5. If Q is a point of =, then the point of intersection of the line YQ with x, and 
the (n — 1)-flat of intersection of the tangent plane in Q with x, are respectively 
pole and polar in our P.C. 

Proof. The intersection of the tangent plane in Q with y = 0 is given by 


ou 


a ar. = VU, 
or according to (11) by ua(x)ya = 0. 
3.6. > is a convex surface with only regular points and planes of support.’ 

Proof. Let P; and P; C R. The 2-flat through P, P2Y intersects = along 
some curve L. From the geometrical interpretation of the positiveness of the 
P.C. (ef. the definition, §2) it follows that L is a convex curve; for if P{, P are 
the points of intersection of the line P; P: with the tangents of L in the points of 
intersection of P,Y and P:Y with L, then P,P; and P:P; do not separate each 
other. = is a convex surface, as it is intersected in a convex curve by every 
2-flat through Y. The regularity of the points and planes of support of = 
follows immediately from the one-to-one character of the P.C. 

The following inverse statement is easily proved: 

3.7. Any surface = in mp4, will generate a positive, symmetric P.C. with respect 
to the convex region R in m,, in the manner of 3.5, if 

(a) for every point Q of = the line YQ intersects x, in a point of R; 

(b) 2 together with R and the boundary of R forms the boundary of a convex 
region in Wir} 

(c) = has no points in common with rp ; 

(d) = has only regular points and planes of support. 


4. Throughout this section we assume that the P.C. is positive and that the 
functions u,(x) have continuous first derivatives. We put 


OUa 
— = Uagg\TZ). 
_** s(x) 


Derinition. The correlation in which a point € corresponds to the hyper- 
plane given by 


(14) [was(x) + 2ua(x)us(x)] Eans = O 
will be denoted as the tangential correlation at z. If we write 


[usa(r) + 2ua(x)ug(x)] Es 
[uys(x) + Qu, (x)ug(x)) Eyes’ 





(15) valé) = 


3 Regular points are points through which only one plane of support passes; regular 
planes of support are planes containing only one point of contact. Cf. the definition in 
Bonnesen and Fenchel, Konvere Kérper, pp. 13-15. 








362 FRITZ JOHN 


the plane v.()n2 = 0 corresponds to the point £ in the tangential correlation 
and va(t)Eq = 1. 
4.1. (a) va(x) = ua(zx), i.e., the polar of x is the same for the P.C. and the tan- 
gential correlation at (x); 
(b) va(t)te = 0 implies 0 = va(x)Ea, t.e., in the tangential correlation the 
poles of planes through (x) lie on the polar of (x); 
(c) vas(x) = (#a@) = Uas(2), t.e. the tangential correlation gives the best 


d&s 
approximation to the P.C. in the neighbourhood of x. 


Proof. From 1.1 and (A) it follows that 
Uas(1)La = Usa(Z)La = —Us(Z),” Uas(X)Latg = —Us(x)xg = —1. 


4.2. When ua has continuous derivatives, the P.C. is symmetrical, if and only if 
Uas(X) = Uga(Z). 

Proof. U(a, y) is the line integral of the function u over the straight line 
segment joining the points x and y of . 

Since for vag = Usa the tangential correlation (14) is the polar correspondence 
with respect to a quadric, we see that a P.C. is symmetric if in the neighborhood 
of a point it can be approximated by an ordinary polar correspondence with 
respect to a quadric. This quadric, which is given by 


[was(x) + Qua(x)us(x)JEats = 0, 


will be called the tangential quadric at x. It follows from 4.1 that 
4.3. If = is the surface in my. generating a symmetric P.C., the tangential quadric 
at a point P of R is the intersection of x, with that (n + 1)-dimensional quadric 
which (a) has a contact of the second order with = at the point of intersection of PY 
with = and (b) for which Y is the pole of the plane r,, . 
4.4. If the tangential quadric at x in a positive symmetric P.C. is non-degenerate, 
it has no point in common with the polar of x. (This implies that the tangential 
quadric ts projectively equivalent to an ellipsoid.) 

Proof. According to (6), 


Ualt)Xa°Us(r)Es = 1; 


in particular for § = x + dz, if we neglect terms of higher than the second order 
in dz, 


_ 
Il 


S [ua(z + dx)x.][us(x)(zs + dxg)] 

[1 — uwa(x + dx)dxa)[1 + us(x)dx5] 

[1 — ue(x)dte — Uay(x)dxadxr, + -+-|[1 + us(x)dz,] 
1 — [ua(x)dx_)” — tay(2)dradr, + -:- 


Consequently for any dx = 7, 


[uas(2) + 2ua(r)us(x)] Na 8 > [wa(x)nel- 








Ta ee adn eae 











POLAR CORRESPONDENCE 363 


From this inequality it follows that, provided the quadric 
[waa(x) + 2ua(x)us(x)] nNans = 0 


is non-degenerate, it can have no points in common with the plane u.(z)n. = 0. 


5. Let the convex region R be covered with mass of density 4. We assume 
that » is a continuous positive function in R and on its boundary, and that u is 
invariant under collineations. 

Derinition. Let p be any hyperplane of S. If p is taken as plane at 
infinity in a non-homogeneous coérdinate system, R will have a certain center 
of mass P under the given mass distribution. P will be called the pole of mass 
of p and p the polar of mass of P. 

If an ellipsoid or a simplex is covered with homogeneous mass, the pole of 
mass of a plane coincides with the pole in the ordinary definition. 

As a consequence of the well-known theorem that the center of mass of a con- 
vex region with a positive mass distribution lies in the interior of the region, 
and is invariant under affine transformations, we have: 

5.1. Every plane of S has a uniquely determined pole of mass, which is contained 
in R. 

In-this and the following paragraphs we shall always refer R to a non-homo- 
geneous codrdinate system 2, , --- , 2, , the plane at infinity being a plane of S 
and the origin a point of R. Let gi; = 1 be the equation of a plane p of S. 
In order to calculate the pole of mass P of p, we introduce new non-homogeneous 
coérdinates 

ho (¢ =1,---,n) 
such that p becomes plane at infinity in z’-space. If £, ---, & denote the co- 
érdinates of the center of mass of R in z’-space referred to zx’ coérdinates and 
&,--+- , &, the codrdinates of the same point referred to the z-system, we have 


[ pa; dx; +++ dz, 
R’ 


t; ™ ’ 
, [ ails. all 
E 


[wpe sane, 


[san -+* AZ, 
R 


_ (i, +++ Be) _ 1 





or in the z-system, 





when 


J 





1 85(1 — qm2m) + 269% | 


~ O(a, *** En) (1 — U2)" 
= (1 - na). 


364 FRITZ JOHN 


Thus 





[ pal — uxt)” day -++ dz, 
(16) —— > ; ‘ 


= : 
_ i (1 — ma)" da, «++ dz, 


R 


Let us write 
1 
(17) F(q,°** 9) = (/ w(l — mai)" da, --- iz.) ati | 
R 


If F; stands for Ff /dq;, (16) may be written | 
gi Ps (¢=1,---,n). | 
' 


(18) = Seen ‘ 
1 — mk F 

Solving for &;, we finally obtain 

oni, 

(19) ees ay } 

ada qe Fi 


F(q, *** , Qn) is defined for all (q:, «++ , qn) for which q,x2; < 1 for all (1, 
- ,2n) C R; i.e., for all q contained in the polar region R’ of R with respect 


to the unit sphere. F gives essentially the mass of R in any coérdinate system. 
5.2. F(q) = F(q., +--+ , dn) ts strictly concave for q C R’. 

Proof. We have to prove that for 0 < 38 < land@q’ # q” 

F[dq' + (1 — 9)q")] > dF’) + (1 — 8)F(Q”). 
Let 
a(x) = Sond. J b(x) = seth LJ 
n+1 n+l 
& “ 


efx) = L— a + = Oar )r 


n+l 


u 


Then c(z) = da(x) + (1 — &)b(x) and 


1 


F[dq’ + (1 — dq") = | / eo" (x) day ++: az, | 
R 
~~ = 
= (f [da + (1 — db" day, --- az.) n+l 
ne F —s 
> {/ [da] “"*” da, --- az.) m+l 4 ‘| [a — voy" dx, +: irs n+l 
= 8F(q') + (1 — #F(q’”) 


° . ° P . 4 
according to Minkowski’s inequality. 


‘Cf. Hardy, Littlewood, Pélya, Inequalities, p. 146, Theorem 198. 











ar a 











POLAR CORRESPONDENCE 365 


Obviously the equation F = F(q, --- , qn) will be the equation of an open, 
concave surface >’ in q, --- q,F-space lying above the region R’. Every 
parallel to the F-axis through a point of R’ will intersect ’ in a single point. 
As F is strictly concave, only one point of 2’ will be on every plane of support 
of =’. As F has continuous derivatives, there will pass only one plane of support 
through every point of =’. As u > 0 on the boundary of R, F will tend towards 
0, if (q@, +++ ,@n) approaches the boundary of R’. Thus >’ together with R’ 
and its boundary will form the boundary of a convex region in q --+ gn F-space. 
Hence through every (n — 1)-flat of F = 0 outside R’ will pass one plane of 
support of =’ having a point of contact on D’ (not on its boundary). 

Let = be the polar reciprocal of 2’, i.e., the surface consisting of the poles of 
the planes of support of >’ with respect to the unit sphere.’ > will be given in 
tangential coérdinates by the equation 


(20) qi fi + nF (qu , vie Qn) = 1 (q Cc R’), 


&,-°-:,&, » denoting the codrdinates of a point of a tangential plane of >. 
By dualizing the previous statements about >’, we see that = will be a concave 
surface with only regular points and planes of support; through every (n — 1)- 
flat outside R will pass exactly one plane of support of = having one point in 
common with >. Every parallel to the y-axis through a point of R will intersect 
y, ie., = lies above R. If qg:& = 1 is any (n — 1)-flat p in » = 0 outside R, 
the codrdinates of the point of contact of the plane of support of 2 through p 
are given by (20) and the equations 


(21) & + nF, = 0 (k = 1,---,n). 


From (20) and (21) we find (19). 

Accordingly, the projection of the point of contact is the pole of mass of p. 
Hence the correspondence between poles of mass and polars of mass is generated 
by the surface = in the manner of 3.5, if the point at infinity of the y-axis is 
taken as Y. Thus according to 3.7: 

5.3. The correspondence between poles of mass and planes of mass is a positive 
symmetric P.C. 

Incidentally we have proved that every point of R is pole of one hyperplane 
in S; in other words, that for every point P of R there is a collineation such that 
the image of P becomes center of mass of the transformed region. This might 
have been more directly concluded from (19) and 5.2 in the following way. 


It is sufficient to prove that the origin & = & = --- = & = 0, which was an 
arbitrary point of R, is pole of mass of some plane q,;z; = 1. Thus according 
to (19) we have to prove that the equations F; = 0 (¢ = 1, --- , n) have a solu- 


tion, i.e., that F is stationary for some q C R’. Now, as F is a concave positive 
function in R’ and F = 0 on the boundary of R’, F will have a maximum in R’; 
this maximum will be attained at only one point of R’, as F is strictly concave. 
Thus the origin will be pole of mass of exactly one plane. 


5 Cf. Bonnesen and Fenchel, loc. cit., p. 28. 








366 FRITZ JOHN 


Let us calculate the tangential quadric of the P.C. (19) at a point P of R with 
codérdinates (z{, ---, 2%). Let gfy; = 1 be the equation of the polar p of P. 
The tangential quadric Q at P is according to §4 characterized by the conditions 
that the codrdinates z; of a point and their first derivatives x; with respect 
to the codrdinates q, of its polar plane have for q = q° the same values under the 
polar correspondence (19) and under the polar correspondence with respect to Q. 
In order to simplify the calculation we assume that P is origin and p plane 


° : - : 9 0 
at infinity of our non-homogeneous coérdinate system. Then z; = 0, q = 0. 
Q is given in tangential coérdinates by 


(22) TnQige = 1. 2 


For the pole (z,, --- , 2.) of a plane q;y; = 1 with respect to the quadric (22) 
is given by 


Li = LikQe 5 


hence for q; = q@2 = -*: = qn = 0 
y= 0, be = Zu. 
00% 
According to (19) we have for q@ = @ = --: =q, = Oanda,=--- =2,=0 
F; = 0, ee 
gx F a4 


Thus the tangential quadric is given by 


aF; 

— qi F = 0. 

ann GG + 

Substituting for F its expression (17), we obtain for g; = 0 


1 
F = (/ udary ++ az.) ™, tn + en | px; t,dx, +++ dan. 
R 070k R 


Thus the equation of Q in tangential coédrdinates becomes 


(23) Jf wae. +++ dz, = (n + 2) [ w(xiqi) dx, +++ dap. 


Q is by definition covariant under affine transformations. Thus we may 
assume, without restriction of generality, that Q is the unit sphere gi + q2 + --- 
4. qd. = 1. In that case we must have 


(24) (n + 2) [ paz, dx,-++ dz, = at | pdx, --+ dz,. 
R R 


These equations may be interpreted as stating that the moment of inertia of R 
with respect to any axis is the same as that of the homogeneous unit sphere 
of same mass as R. Since the property that two regions have the same moment 





ae eee 











POLAR CORRESPONDENCE 367 


of inertia with respect to every axis is invariant under affine transformations, 
we see that irrespective of the equations (24) R has the same moment of inertia 
as Q with respect to every axis, if Q is covered with homogeneous mass of same 
total mass as R, i.e., we have proved 
5.4. The tangential quadric at a point P of R is identical with Legendre’s ellipsoid 
of inertia of R, if the polar of mass of P is taken as plane at infinity, i.e., if P is 
the center of mass of R.° 

If we denote as the generalized ellipsoid of inertia of R in P a quadrie which 
is covariant under collineations and coincides with Legendre’s ellipsoid of 
inertia in case P is the center of mass of R, the tangential quadric will be identical 
with the generalized ellipsoid of inertia, i.e., the P.C. with respect to R is in the 
neighborhood of P approximately given by the P.C. with respect to the general- 
ized ellipsoid of inertia in P. 

Let us again make use of the special non-homogeneous coérdinate system, 
in which Q is the unit sphere about P, i.e., in which (24) holds. It then follows 
that 


(25) n+ 2) f alat + toe + a? )da +++ dt, = m ff ni da. 
R R 


From this equation it follows that the inequality 


tits +2n< =< 
cannot hold for all points of R. Hence: 
5.5.1. If R is enlarged in the ratio (n + 2)':n', it is not contained in Legendre’s 
ellipsoid of inertia of R. 

5.5.1 may be expressed in the projectively invariant form: 
5.5.2. Let Q be the tangential quadric at a point P with polar of mass p. Then 
there is at least one point S of R, such that the cross ratio (P, p/S, s) = n/(n + 2), 
where s is the polar of S with respect to Q. 

5.5.2 is remarkable, because it gives an inequality for the tangential quadric 
in a P.C. generated by a mass distribution, which seems not to be satisfied in 
every positive, symmetric P.C. 


6. Derimition. The P.C. corresponding to a homogeneous mass-distribu- 
tion (u = const.) may be denoted as the principal P.C. with respect to R. 
6.1.1. Let P be a point of R and p its polar of mass in the principal P.C. Let 


6 Legendre’s ellipsoid of inertia of a body is defined as the homogeneous ellipsoid having 
the same moment of inertia with respect to every axis as the body. (Cf. Blaschke, Ber. 
Verh. sachs. Akad. d. Wiss., vol. 70 (1918), pp. 72-75.) Legendre’s ellipsoid is essentially 
the polar reciprocal of Binet’s fundamental ellipsoid of inertia, which is characterized by 
the property that the moment of inertia with respect to any plane through its center is 
inversely proportional to the square of a radius vector perpendicular to it. (Cf., e.g., 
A. G. Webster, Dynamics, p. 231.) 











368 FRITZ JOHN 


f be any (n — 2)-flat contained in p. If then q; and qy denote the planes of support 
of R passing through f and q; the plane through P and f, then 


J , ” 1 
—n & (pq;/aras) & 4 


(If R is a quadric, the cross ratio always has the value —1.) 

Proof. 6.1.1 is the projective formulation of the following affine theorem 
given by Minkowski: the distance of the center of mass of a homogeneous convex 
body from any plane of support s lies between B/(n + 1) and nB/(n + 1), where 
B is the distance of s from the plane of support parallel to s.’ 

The following dual theorem is proved similarly. 

6.1.2. Given a point P of R and its polar p under the principal P.C. Let F 
denote any line through P and let Qr, Qr , Qr be respectively its points of inter- 
section with p and with the boundary of R. Then 


=n S (PQr/QrQ) < -~. 


If uw is constant and if a non-homogeneous coérdinate system is chosen, in 
which Legendre’s ellipsoid of inertia is the unit sphere about P, according to (25) 


(26) (n+ 2) [ (zi +--+ + 22)dx--- dz, = nf dz, --- dz, = nV, 
R R 


V being the volume of R. Let us introduce polar coérdinates by z; = ré, 
where (& , --- , &) is a point of the unit sphere Q. If dw is the element of sur- 
face of 2, (26) may be written 


(27) [ora = [ore 
2 2 


where r = p(&,--- , &) is the equation of the boundary of R. From (27) it 
follows that p can neither be > 1 nor < 1 for all points of the boundary of R. 
Thus the boundary of R has certainly points in common with the boundary of 
the unit sphere. This proves the following projectively invariant property 
of the principal P.C. 
6.2. The boundary of R is intersected by the boundary of every tangential quadric 
in the principal P.C. 

Compare with 5.5.2 and 4.4. 

Some other consequence of (26) may be pointed out. Let again the unit 
sphere about P be Legendre’s ellipsoid of inertia of R. Let = be the sphere of 
volume V about P. Then 


(28) | r'dx,--- dz, = [ rdx, +++ dx, ‘ 
R > 


7 Cf. Bonnesen and Fenchel, AKonveze Kérper, p. 52. 
’ per, 














POLAR CORRESPONDENCE 369 


as r° is less in any point of = not in R than in any point of R not in S. If ps 
is the radius of =, then 





2 n 2 > 
[rae dt = Sg. 


Thus according to (26) and (28) 
V2 ps: V, 


i.€., py S 1; this means that the sphere of volume V has no greater radius than 
the unit sphere, and consequently the volume of the unit sphere is not less than 
that of R. As Legendre’s ellipsoid is covariant under affine transformations, 
we have thus proved the following theorem of Blaschke: 

6.3. The volume of Legendre’s ellipsoid of inertia of R is not less than that of R.° 

Every theorem on moments of inertia of homogeneous convex regions may be 
interpreted as a statement on the tangential quadrics in a principal P.C. Cf., 
e.g., in this connection the author’s paper Moments of inertia of convex regions, 
this Journal, vol. 2, pp. 447-452. 

Remark. Given an algebraic surface of degree 2m consisting of m closed sur- 
faces containing one another. Let = be the (convex) most interior of these 
surfaces. Then there is a 1-1 correspondence of the points of = and the planes 
outside 2, if the plane corresponding to a point is its ordinary plane polar with 
respect to the algebraic surface. This correspondence is a positive, symmetric 
P.C. 


UNIVERSITY OF KENTUCKY. 


8 Cf. Blaschke, loc. cit. footnote 6, where this theorem appears as a generalization of 
Sylvester’s four-point-problem. The two volumes in 6.3 can only be equal, if F is an ellip- 
soid, as the equality sign in (28) holds only if R is a sphere. The convexity of R is not 
actually used in the proof. 








368 FRITZ JOHN 


f be any (n — 2)-flat contained in p. If then q; and q; denote the planes of support 
of R passing through f and q; the plane through P and f, then 


0 1 
—n S (pq;/ara7) & i, 


(If R is a quadric, the cross ratio always has the value —1.) 

Proof. 6.1.1 is the projective formulation of the following affine theorem 
given by Minkowski: the distance of the center of mass of a homogeneous convex 
body from any plane of support s lies between B/(n + 1) and nB/(n + 1), where 
B is the distance of s from the plane of support parallel to s.’ 

The following dual theorem is proved similarly. 

6.1.2. Given a point P of R and its polar p under the principal P.C. Let F 
denote any line through P and let Qr, Qr , Qr be respectively its points of inter- 
section with p and with the boundary of R. Then 


<n S (PQr/QrQ) < -~. 


If w is constant and if a non-homogeneous coérdinate system is chosen, in 
which Legendre’s ellipsoid of inertia is the unit sphere about P, according to (25) 


(26) (n +2) [i+ os) + g)dx --++ dt, = n fdr +++ dz, = nV, 
R R 


V being the volume of R. Let us introduce polar coérdinates by z; = ré,, 
where (£ , --- , &:) is a point of the unit sphere 2. If dw is the element of sur- 
face of 2, (26) may be written 


(27) [ora = [ore 
2 2 


where r = p(&, °-- , &) is the equation of the boundary of R. From (27) it 
follows that p can neither be > 1 nor < 1 for all points of the boundary of R. 
Thus the boundary of R has certainly points in common with the boundary of 
the unit sphere. This proves the following projectively invariant property 
of the principal P.C. 
6.2. The boundary of R is intersected by the boundary of every tangential quadric 
in the principal P.C. 

Compare with 5.5.2 and 4.4. 

Some other consequence of (26) may be pointed out. Let again the unit 
sphere about P be Legendre’s ellipsoid of inertia of R. Let = be the sphere of 
volume V about P. Then 


(28) [ r'dax,--- dz, = [ r'dx, +--+ drn, 
R z= 


7 Cf. Bonnesen and Fenchel, Konvere Kérper, p. 52. 











POLAR CORRESPONDENCE 369 


as r° is less in any point of = not in R than in any point of R not in ¥. If ps 


is the radius of =, then 





Thus according to (26) and (28) 
V =ops'V, 


i.€., py S 1; this means that the sphere of volume V has no greater radius than 
the unit sphere, and consequently the volume of the unit sphere is not less than 
that of R. As Legendre’s ellipsoid is covariant under affine transformations, 
we have thus proved the following theorem of Blaschke: 

6.3. The volume of Legendre’s ellipsoid of inertia of R is not less than that of R.° 

Every theorem on moments of inertia of homogeneous convex regions may be 
interpreted as a statement on the tangential quadrics in a principal P.C. Cf., 
e.g., in this connection the author’s paper Moments of inertia of convex regions, 
this Journal, vol. 2, pp. 447-452. 

Remark. Given an algebraic surface of degree 2m consisting of m closed sur- 
faces containing one another. Let = be the (convex) most interior of these 
surfaces. Then there is a 1-1 correspondence of the points of = and the planes 
outside 2, if the plane corresponding to a point is its ordinary plane polar with 
respect to the algebraic surface. This correspondence is a positive, symmetric 
P.C. 


UNIVERSITY OF KENTUCKY. 


8 Cf. Blaschke, loc. cit. footnote 6, where this theorem appears as a generalization of 
Sylvester’s four-point-problem. The two volumes in 6.3 can only be equal, if F is an ellip- 
soid, as the equality sign in (28) holds only if R is a sphere. The convexity of R is not 
actually used in the proof. 











INTERIOR TRANSFORMATIONS ON COMPACT SETS 
By G. T. WHyBuRN 


1. As originally defined by Stoilow,’ a single-valued continuous transforma- 
tion 7(A) = B is called an interior transformation provided (1) the image of 
every open set in A is open in B and (2) the inverse set 7~'(b) of each point b 
in B is totally disconnected. Most later writers have omitted condition (2) 
and have spoken of an interior transformation as one satisfying (1) alone. 
While this latter point of view will be adhered to in this paper, it turns out that 
in most cases the hypotheses in the theorems are such as to make both (1) and 
(2) satisfied. In order to save words we shall call a continuous transformation 
satisfying (2) a light transformation. Hence a light interior transformation in our 
terminology is the same as an interior transformation as defined by Stoilow. 

Our object will be to develop fundamental properties of interior transforma- 
tions as applied to compact metric sets with a minimum of restriction on the 
image set. All our sets are assumed to lie in a metric space. We begin with 
some basic lemmas and theorems. 

(1.1) Lemma. Let T(R) = S be single-valued and let Q be a subset of R such 
that 


(i) Q = T'T(Q). 
Then for any subset X of R we have 
(ii) T(Q:X) = T(Q)-T(X). 


Proof. Obviously T(Q-X) C T(Q)-T(X). To prove the reverse inclusion, 
let z ¢ T(Q)-T(X). Then T(z) C Q, T"(x)-X ¥ 0. Thus if ye T(2)-X, 
we have y «Q-X-T™'(z), a relation which gives T(y) = ze T(Q-X). 

(1.2) Lemma. Jf T(R) = S is an interior transformation on R, and Q is any 
subset of R satisfying (i), then T is an interior transformation on Q. 

Proof. Let X, be any open subset of Q. There exists an open subset X of 
R such that X, = X-Q. By hypothesis T(X) = U is open in S, and by (1.1) 
T(X,) = T(Q-X) = T(Q)-T(X) = TQ)-U. 

Thus 7(X,) is open in 7(Q) and the lemma follows. 

(1.3) Lemma. If A is compact and T(A) = B is continuous, then for any set 
RCA 
(i) T(R) = T(R), 

(ii) T(R) — T(R) CT(R — R). 
Received March 19, 1937. 
! See Stoilow, Annales Scientifiques de |’Ecole Normale Supérieure, vol. 63 (1928), pp. 


347-382 and Annales de |’ Institut Henri Poincaré, vol. 2 (1932), pp. 233-266. 
; 370 














INTERIOR TRANSFORMATIONS ON COMPACT SETS 371 


Thus if T is an interior transformation and R is open in A, 
(iii) F(T(R)] C T[F(R)], 


where in general F(X) denotes the boundary of X. 

(1.31) Corotuary. The Menger-Urysohn order of a point in A is never in- 
creased when A undergoes an interior transformation. The properties of being 
(i) a curve of order S n, (ii) a regular curve, (iii) a rational curve, are invariant 
under interior transformations. 

(1.4) THrorem. Let A and B be compact and let T(A) = B be interior. Then 
if R is any quasi-connected open set in B, every quasi-component @ Q of T~'(R) 
maps onto all of R under T. 

Proof. Suppose, on the contrary, that for some p « R, T-'(p)-Q = 0. Then 
since T”'(p) is compact and is C 7” "(R), it follows by an application of the 
Borel theorem that there exists an open set U such that 


(i) QcU CT (R), F(U)-T"(R) = 0, U-T(p) = 0. 
Now T(U) = V is open, and we have 
(ii) V >7(Q) CR, VCR -p, 


whence, since R is quasi-connected, F(V)-R # 0. But by Lemma (1.3), 
F(V) C T[F(U)] and by (i), T[F(U)]-R = 0. Thus our theorem is proved. 

(1.41) Corotiary. If A is locally connected, if By is any closed set in B and 
R is any component of B — By , then T~'(R) has just a finite number of components 
and each one of these components maps onto all of R under T. 

(1.42) Corotuary. If A is locally connected and B is connected and cyclic 
(i.e., without cut points), then for each x « B, A — T~*(x) has just a finite number 
of components and each of these components maps onto B — x under T. 

(1.5) THeorem. Let T(A) = B be an interior transformation, where A is 
compact, and let C be any continuum in B. Then for each component K of T~*(C) 
= Q, we have T(K) = C. 

For by (1.2), the transformation T(Q) = C is interior. Thus, since C is 
connected and open in C, it follows by (1.4) that each component of Q maps onto 
all of C under T. 

(1.6) Let T(A) = T2:T,(A) = B, where T,(A) = A’ and T,(A’) = B are 
continuous. If T is interior, so also is T2 . 

For let R be any open set in A’ and let Q = T;'(R). Then since 7; is con- 
tinuous, Q is open in A; and since T' is interior, T7(Q) is openin B. But T(Q) = 
T:T,(Q) = T.(R). Accordingly, T2(R) is open in B. 

(1.61) Corotuary. If A is compact, any interior transformation T(A) = B 
can be factored into the form T(A) = T2T,(A), where T, is monotone’ and T+ is 
interior and light. 


2 A transformation 7(A) = B is monotone provided that for each b « B, T-*(b) is con- 
nected. 











372 G. T. WHYBURN 


For T can be factored into the form 727;, where T; is monotone and 7; is 
: 3 » 7s : ° ° 
light by a known theorem; and by (1.6), 72 is necessarily interior. 


2. Mapping of the Betti groups. Lilenberg* has raised the question whether 
under an interior transformation T(X) = Y, where X is compact, the Betti 
groups of X map homomorphically onto the corresponding groups of Y. With 
respect to the integral coefficient domain G, this question is easily answered 
in the negative by the transformation w = 2° of the circle |z| = 1 into the 
circle | w | = 1, since in this case clearly’ B4(X) maps onto 2B4(Y). 

However, if we consider the Betti groups relative to the rational coefficient 
field R, this question is not nearly so easy to answer. In the above case, clearly 
B},(X) maps homomorphically onto B,(Y), and indeed it will be proved in the 
next section that this is always true for any linear graph X undergoing an interior 
transformation.® 

The following example is of interest in this connection as well as in connec- 
tion with a theorem to be proved below in §4. 

(2.1) Exampie. There exists a compact continuum K with a vanishing one- 
dimensional integral Betti group and an interior transformation f(K) = J of 
K onto a circle J. 

To exhibit this, we employ a continuum K which has been constructed by 
Vietoris’ for a different purpose. This continuum is constructed by first taking 
a Cantor ternary set on both the upper and lower bases of the unit square with 
vertices (0, 0), (1, 0), (1, 1), (0, 1) and joining corresponding points in these 
two sets (i.e., points with the same abscissa) by the altitude of the square they 
determine. This gives a set C which may be considered a “Cantor set of unit 
intervals”. K is then constructed from C as follows. We identify the lower 
endpoints of the left half of these intervals with the upper endpoints of the 
right half in the same sense from left to right; then divide the remaining lower 
endpoints in half and identify the left half with the upper right half of the re- 
maining upper endpoints in the same sense; next divide the remaining lower 
endpoints in half and identify the lower left half with the upper right half of the 
remaining upper endpoints, and so on indefinitely. Finally identify the points 
(1, 0) and (0, 1). 

Vietoris (loc. cit.) has shown that the one-dimensional integral Betti group 
of this continuum K reduces to the null element. 

We now proceed to define an interior transformation f which will map K 


3 See Eilenberg, Fundamenta Mathematicae, vol. 22 (1934), p. 292 and G. T. Whyburn, 
American Journal of Mathematics, vol. 56 (1934), p. 297. 

* See Fundamenta Mathematicae, vol. 24 (1935), p. 175. 

5 We employ here the notation (see Alexandroff-Hopf, Topologie) B}(K) for the r-dimen- 
sional Betti group of a complex (or set) K relative to a coefficient domain J. 

6 Results obtained recently indicate that this also holds for any compact set X with 
p(X) finite. This is contrary to a statement made by the author in an abstract in the 
Bulletin of the American Mathematical Society, vol. 43 (1937), p. 183. 

7 See Mathematische Annalen, vol. 97 (1927), p. 459. 

















INTERIOR TRANSFORMATIONS ON COMPACT SETS 373 


onto the unit circle J. Let f,(C) = K be the (continuous) transformation 
representing the above “identifying”? construction of K from the set C. Let 
p «K and let (x, y) «fi '(p), where z and y are Cartesian coérdinates. We define 


S(p) = (2, y’), 


where 


x’ = cos ry x’ = cos r(1 + y) 
for x < 4, and ; for x > }. 
’ = sin r(1 + y) 


, 


y’ = sin ry y 


Clearly this transformation is interior and light and it maps K onto the circle 
72 pr 
x? + y" = 1. 


3. Linear graphs. 

(3.1) THrorem. Let A be a linear graph and let f(A) = B be interior. Then 
B is a linear graph and there exist subdivisions of A and B respectively into finite 
complexes K, and Ky, such that f maps each edge of K, topologically into an edge 
in Ky. Thus f(Ka) = Ky is a simplicial transformation. Furthermore, if J 
is any simple closed curve in B, there exists a simple closed curve C in A such that 
f(C) = J and, on C, f is topologically equivalent® to the transformation w = z* 
on |z| = 1 (k an integer). Consequently By(K.) maps homomorphically onto 
By (Ks) under f. 

Proof. Since A has only a finite number of points of order > 2 and has no 
point of increasing order, and since, by (1.31), the order of no point can be 
increased under f, it follows that B has these same properties. Accordingly, 
B is a linear graph. 

Furthermore, for each b « B, f '(b) must be a finite set of points. For other- 
wise there would exist infinitely many components of A — f'(b), and clearly 
this is impossible, since by (1.41) the inverse of each one of the finite number 
of components of B — b is a finite set of components of A — 7~*(b). 

Now let EZ, and E respectively denote the sets of points in A and B of order 
~ 2. Let D, = Ey + f(E.) + one point from each component of B — [Ey + 
f(E2)] which has at most one limit point in E, + f(E.); let Ds = f-'(Ds). The 
sets D, and D, subdivide A and B into complexes K, and K,, respectively. 
Let zy be any edge of K, and let f(z) = 2’, f(y) = y’, f(zy) = 2’y’. Then 
x’ # y’, since otherwise f(xy — x — y) would be a free are in B having just one 
endpoint, which is impossible by the choice of D,. Furthermore, for any 
pex'y’ — x’ — y', f '(p)-zy reduces to just one point. For if not, there would 
exist an open arc pip. in zy — zy-f '(p) with p: + pe Cf “(p); this would give 
f(ip2) DP x’ or f(fiyp2) P y’; and both are impossible, since f“(z’)-zy = 2, 


8’ Two transformations 7(A) = Band W(A’) = B’ are said to be topologically equivalent 
provided there exist topological transformations H,(A) = A’ and H2(B’) = B such that 
T(z) = H.WH,(z) for every x<¢ A. See my paper Completely alternating transformations, 
Fundamenta Mathematicae, vol. 27 (1936), pp. 140-146. 








374 G. T. WHYBURN 


f'(y’)-ay = y’. Accordingly, f(zy) = 2’y’ is topological and f(K,) = Ky» is 


simplicial. 
Now let J be any simple closed curve in B, let the vertices of K, on J be 
° =| - ° 
ordered cyclically po, pi, -** , Pn, Po and let qoef (po). There exists an edge 


qm of K, which is contained in f~'(pop,) and such that q ef '(p:). Similarly 
there is an edge qiq2 with qig2 Cf ‘(pipe) and q2 ef (po), andsoontop,. There 
is an edge q.q With qngo Cf ‘(pa po) and qi ef ‘(po). If qs = qo, we can bring 
our selection to a close. If not, we choose an edge qiqi in f‘(pom:). Again if 
qi = "1, we may stop. Otherwise we choose an edge qiq2 in f ‘(pi pz), and so on. 
Continuing this process, since f'(b) is a finite set for each b ¢ B, after a finite 
number of steps we must eventually reach, for the first time, a point gj (¢ S n) 
such that qj = qj (m <j). Then clearly the edges 


m+1 m+ m+1 m+1 m+1 m+1 m+1l_ m+2 


Go Gy tty Tn-1dn 4 In Wy *** GA, Vin, 
GisiQit2, °° Qn Go 

fit together to form a simple closed curve C which maps onto J exactly 7 — m 
times; and since gq, qj4: maps topologically onto p,p.4:, it follows at once that 
if we set j — m = k, the transformation f(C) = J is topologically equivalent 
to the transformation w = z* on the circle | z | = 1 of the complex z-plane. 

Since, as shown above, f can be considered as a simplicial transformation of 
K, onto K,, it follows that f generates a homomorphism, which we shail also 
call f, of Bz(K.) into B,(K»). To see that actually f[B,(K.)] = Br(K»), let 
us take any rational l-cycle z in K,. As is well known, z can be expressed as a 
linear form 


Z = A121 + Aoze + ++ + An2n (a; rational), 


where z; is a simply oriented simple closed polygon. By the above, we can 
find a simple closed curve C; in A such that f(c;) = kz; (¢ S n, k; an integer), 
where c; is an oriented fundamental cycle on C;. Accordingly if we set 


ay a2 Qn 
Te Eat Ee t + go 


then y is a rational 1-cycle in K, and f(y) = z. Thus the homology class of 
y in B,(K.) maps onto the homology class of z in B,(K>) and our result follows. 

(3.11) Corotnary. p'(Ks) < p'(Ka) (where p’(X) denotes the r-dimensional 
Betti number of X). 

(3.2) If A is a simple closed curve and T(A) = B is interior, then B is either 
a simple closed curve or a simple arc. If B is a simple closed curve, there exists 
an integer k such that T is topologically equivalent to the transformation w = 2* 
on the circle|z|=1. If Bis anare, there exists an integer k such that T is equiva- 
lent to the transformation f(1, 6) = sin k@/2 of the circle p = 1 into the interval 
(—1, 1). 

Proof. It follows from (1.31) that every point of B is of order S 2. Ac- 
cordingly, B is either a simple closed curve or a simple are. 











INTERIOR TRANSFORMATIONS ON COMPACT SETS 375 


If B is a simple closed curve, it follows from (3.1) by taking J = B that T is 
equivalent to w = z* on | z| = 1 for some integer k. 

If B is a simple are zy, referring back to the proof of (3.1), we see that D, = 
a + yand hence D, = T(x) + T'(y). Thus if we order the points of D, 
cyclically on A: py, po, *** , Pe, Pr, Where p, e T ‘(x), then by (3.1), pipe maps 
topologically onto ry, pep; maps topologically onto yz, p3p, onto zy and so on to 
PxPi, Which maps onto yx. (Note that k must be even.) Hence T is equivalent 
to the transformation f(1, 6) = sin 3k@ of p = 1 into (—1, 1). 

Remark. In each case the integer k is exactly the maximum power among 
the sets 7” '(b) for allb eB. If k is odd, B is necessarily a simple closed curve, 
T is completely alternating’ and all the sets T~*(b) are of power k. If k is even, 
B may be either a simple closed curve or an are zy; and in the latter case, T~'(x) 
and 7~"(y) are each of power 3k, while for every other point b of B, T~*(b) is of 
power k. 

(3.3) If A is a simple arc pq and T(A) = B is interior, B is a simple arc 
and there exists an integer k such that T is topologically equivalent to the transforma- 
tion f(x) = cos krx of the interval (0, 1) into the interval (—1, 1). 

Proof. Since, by (1.1), B must have at least two points of order 1 but no 
point of order > 2, it follows that B is a simple are uv. 

Referring again to the proof of (3.1), we see that in this case D) = u + vD 
T(p) + T(q) and D, = T*(u) + T(v). Hence if we order the points of Dg 
on A from p to g:p = po, Pi, *** , Pk = G and suppose the notation chosen so 
that T(p) = u, then by (3.1), poy: maps topologically onto wv, p;p2 maps topo- 
logically onto vu, pops onto uv, and so on to px_1p,, Which may map either onto 
uv or vu. Hence it is seen at once that T is equivalent to the transformation 
f(x) = cos krz of (0, 1) into (—1, 1). 

By Corollary (3.11), the Betti numbers of a graph cannot be increased under 
an interior transformation. This conclusion can be strengthened somewhat 
for graphs without endpoints, namely, in that we can obtain the same conclusion 
for any continuous transformation 7(A) = B which increases the (Menger- 
Urysohn) order of no point of A. Such a transformation will be called non- 
order-increasing. 

(3.4) Lemma. If A is aconnected linear graph without endpoints and T(A) = 
B is a non-order-increasing transformation, then B is a linear graph and p'(B) < 
p (A). 

Proof. Since A has only a finite number of points of order # 2 and has no 
point of increasing order, it follows that B must have these same properties 
and hence must be a graph. 

Let X be a finite set of points in A such that X contains all points of A of 
order # 2 and T7(X) = Y contains all points of B of order # 2 and such that 
each component of A — X or B — Y has two distinct limit points in X or Y 
respectively. Let a° and f° be the number of points in X and Y respectively. 
Let a’ and #’ be the number of components in A — X and B — Y respectively. 


® Loe. cit. 








376 G. T. WHYBURN 


Then by the Euler-Poincaré formula we have 
(i) p(A) = a — a’ +1, p(B) =8'-~ +1. 
Also, if we set 
go 
X= 2%, Y=Lu, 
we have 


go 
(ii) > o(z,) = 2a’, Dd o(y:) = 28", 
1 


1 
where in general o(x) denotes the Menger-Urysohn order of the point z. Now 
for each i S 8, let 


X-T (yi) = tit ai +e + 2". 


Then 
ge ge kg ; a? 

(iii) Yk =e and Y Ve =X= Dx. 
1 i=l 7=1 1 


Since by hypothesis o(2j) 2 o(y,) and o(ai) = 2, we have 
ky ki L 
(iv) > o(xi) = o(ys) + DE o(zi) = ofys) + (ks — 1), 
7=1 7=2 
whence 
3° 
(v) >» 
1=l1 3 
Thus from (ii), (iii) and (v) we get 


2a' = 26' + 2a’ — 28° 


p°® ge 
, o(xi) 2 2 o(yi) + 2 p> kg — 26°. 


‘“s 
(a' — B') — (a’ — 6) 20, 
whence, from (i), 
p'(A) — p(B) = (a — 8’) — (a — #*) 20 
oe 
p'(A) 2 p(B). 


(3.5) Let T(A) = B be non-order-increasing. Let A be a connected graph 
having a set X of q endpoints and let T(X) contain r points. Then B is a graph 
and p'(B) s p'(A) + @)@ — 7). 


For we can replace (iv) by 


(iv)’ > o(x}) = o(ys) + > o(z?) = o(y:) + 2h —D —-@—) @sr), 











INTERIOR TRANSFORMATIONS ON COMPACT SETS 377 


where q; is the number of points of order 1 of A in T ‘(y;) and the notation is 
chosen so that o(z}) = min [o(z?)] and q; > Ofori < r. Summing, we get 


gp? ky ; po go Y 
(v)’ p> 2 o(xi) = p> o(y:) + 22> k; — 26° — > qtr. 


Whence, as before, 
2a' > 28' + 2a° — 26” — (q — 1), 
(a' — B') — (a’ — fp’) = -3(9 —- 1»), 
p'(A) — p(B) = -4(q — »), 
p(B) < p'(A) + 3(q — 7). 

(3.51) Corotuary. If 

(i) g = 0, w.e., there are no endpoints, 

(ii) q =r, 2.e., T is (1-1) on the set X of endpoints of A, 

(iti) ¢g — r S 1, ¢.e., at most one pair of endpoints in A map into one point in B, or 
(iv) q S 2, .e., there are at most two endpoints in A, 

then p(B) S p'(A). 

(3.6) THrorem. If A is a one-dimensional compact locally connected con- 
tinuum and T(A) = B is an interior transformation, then p'(B) < p'(A). 

Proof. Clearly we may suppose p'(A) finite. If, contrary to our theorem, 
p'(B) > p'(A), we can choose an A-set H in B without endpoints and such that 
p'(H) > p'(A), for we need only take a finite number of true cyclic elements of 
B so that their sum S has Betti number p'(S) > p'(A) and let H be the smallest 
A-set in B containing S. 

Let K be a component of T”'(H). Then since 7(K) = H is an interior trans- 
formation [by (1.5) above, T7(K) = H; and T is interior on K, since there are 
only a finite number of components of T~'(H)], and since H has no endpoints, 
it follows that K has no endpoints because 7 is non-order-increasing on K. 
Whence, by (3.4), p'(H) < p'(K). But p'(K) S p'(A), and this gives p'(H) < 
p'(K) < p'(A), contrary to our supposition. Accordingly p'(B) < p'(A). 

(3.61) Corotuary. If A is a dendrite, so also is B. 


4. Light interior transformations. It will be recalled from the introduction 
that a transformation 7(A) = B is light provided that for each b « B, T”*(b) is 
totally disconnected. We begin this section with a theorem for arbitrary com- 
pact sets analogous to a result of Stoilow’s” for.the case of a transformation of 
one plane region into another. 

(4.1) Turorem. Let T(A) = B be a light interior transformation, where A is 
compact. Then if pq is any simple arc in B and po is any point in T™'(p), there 
exists a simple arc pogo in A such that T(poqo) = pq and T is topological on po qo. 


10 See Stoilow, loc. cit. The author learned recently that a theorem essentially the same 
as (4.1) has been discovered independently by Zippin and Montgomery. See an article by 
Montgomery in a forthcoming issue of the Transactions of the American Mathematical 
Society. 











378 G. T. WHYBURN 


Our proof for this theorem will make use of the following definition and lemma. 
Derinition. If o:p = 2%, %1,°*°*,2%n = @ (x; precedes z;,; in the order 
p, q) is a subdivision of pq, a continuum K in T~'(pq) will be said to proceed 
directly from T~'(p) to T~'(q) relative to the subdivision ¢ provided T(K) = pq 


- , ys . ° v1 
and K = t » K;, where K; is a continuum in 7 (2;:2;). By the norm of 
: 1 


such a continuum K will be meant max 6(K;). 

Lemma. (Given any subdivision o’ of pq and any « > 0, there exists a subdivision 
o of pq containing a’ and a continuum K of norm < «in T (pq) which contains 
po and which proceeds directly from T~'(p) to T~*(q) relative to o. 

Proof of Lemma. Let S denote the set of all points q’ on pq for which such 
a subdivision exists on pq’ containing all points of o’ on pq’. Let x be any 
point of pq. 

If x « S, let us take the subdivision ¢ = ¢, on pz and the corresponding con- 
tinuum K = K,. Let zeK-T (x). There exists an eneighborhood U of z 
such that F(U’)-T"'(xz) = 0. There exists a point v on z, such that if w is any 
point whatever of 2v, F(U)-T"'(w) = 0 and w does not belong to o’ unless 
w = 2. It follows that Q = T”‘(xv)-U is both open and closed in 7” *(zv). 
Accordingly 7(Q) = xv is an interior transformation. Let K,4: be the com- 
ponent of Q containing z, let ¢, be the subdivision p = 2%, %1, °** ,2n,%n4i = 0 


n+l 


of pv and let K, = > K;. Then clearly K, is a continuum of norm < ¢ which 
1 


proceeds directly from 7 '(p) to T~“(v) relative to o, , so that v belongs to S. 
Thus S is open in pq. 

Now if x «pq — S, we can choose x to be the first point of pg — S on pq. 
Hence pr — x CS. If we take a sequence #1, yz, --- on px so that y; — 2, 
and choose points z; « K,,-T'(y;), where K,, is a continuum of norm < e¢ in 
T'(py:) proceeding directly from T~*(p) to T~*(y,) relative to a subdivision 
oy, of py; which contains all points of o’ on py; , the sequence 2 , 2, --- will 
have a limit point z on 7” '(x), and we may suppose z; — z. Let us choose the 
e-neighborhood U of x exactly as before. There exists an 7 such that for any 
wey 2, T '(w)-F(U) = Oand such that no point of y;z, except possibly z, belongs 
to o’. Hence if Q = 7 ‘(y;x)-U, Q is both open and closed in T”'(y;x) and 
7(Q) = y:2 is interior. Now if oy, is denoted by p = 2%, %1, *** , 2, = ys and 


K,; = > K.,., let o, be the subdivision p = 2, 21, °** , tn, fn = 2 OF pz, let 
1 
? n+l 


K,4, be the component of Q containing z; and K, = ps K;. Then clearly K, is 
1 


a continuum of norm < ¢ which proceeds directly from T~'(p) to T~*(z) relative 
to o,, and o, contains all points of o’ on px. But this gives z e S, contrary to 
our supposition. Accordingly S = pq, and the lemma is proved. 

Proof of the Theorem. There exists a subdivision 01: p = 2%, °**,2%n, = 4 


m1 
of pg and a continuum K’ = >> Kj of norm < 1 which contains pp and proceeds 


1 
directly from 7” '(p) to T~*(q) relative to o . 








INTERIOR TRANSFORMATIONS ON COMPACT SETS 379 
In general, for each k there exists a subdivision 
, k oe 
On: P = Xo, °*** , ny = Y; 


nk 
. . . rk rk . 
which contains o,_;, and a continuum K™ = Zz K; of norm < 1/k which con- 
1 


tains po and proceeds directly from T~'(p) to T~'(q) relative to ox . 

Since the sequence K’, K*, --- contains a convergent subsequence, we may 
suppose, without loss of generality, that the whole sequence converges to a 
limit which we shall call E. Since each K‘ is a continuum with 7(K') = pq, 
it follows that EZ is a continuum and T(£) = pq. To prove that £ is an are 
and T(E) = pq is topological, it suffices to prove that for each y € pq, T'(y)-E 
reduces to a single point. 

Now since for each k, i, j (i < j, i, j S ng) there exists a continuum Ki; = 

i a, 

> K+, in K* which contains K*-T"(zix}), it follows that for each k, 7, j there 
m=i+1 a, 
exists a continuum E}; = lim Kj*?, in E which contains E-7~'(ri1xj). (Note: 
(m) 
i» and jm are integers such that 2;*" = aj, 2j)*" = 23.) 

Now for any y e pg and any k let us choose 7 and j so that (i) y is interior 
to the are ajzsifp ~y ¥ 4, (ii) y= 2, = 2 = pify=p,y=27, =qify = 4, 
and (iii) 0 <j — i S$ 2. Then since in all cases E-T ‘(y) is interior to Ej; 
(rel. EZ), it follows that E-T™'(y) = lim E!;. Accordingly, since each E%; is a 

(kv) 


continuum, E-7™'(y) is a continuum. Therefore E-7~'(y) reduces to a single 
point, since T”*(y) is totally disconnected, and our theorem is proved. 

Remarks. It is of interest to note that the hypothesis of the theorem just 
proved is satisfied even in cases such as that exhibited in the example given in §2. 
Also it results at once from this theorem that the property of containing no are 
is invariant for compact sets under light interior transformations. It may be 
noted further that, whereas the initial point po of the are pogo can be chosen 
arbitrarily in T~'(p), the terminal point q@ is not subject to choice but may 
easily depend on the choice of po. 

The methods of proof used in (3.1) together with the theorem just proved 
yield at once 

(4.2) Turorem. Under the conditions of (4.1), if J is any simple closed curve 
in B, then for any integer n > 0 either there exists a simple closed curve C in A and 
an integer k S n such that T(C) = J and, onC, T is topologically equivalent to the 
transformation w = 2‘ on |z| = 1, or there exists a simple arc X in A such that 
T(X) = J and, on X, T is equivalent to the transformation w = e”' on the interval 
—(n+1)r Sy S (n+ 1)x. [If there exists no such simple closed curve C for 
any k, then there exists an open curve L in A such that T(L) = J and, on L, T is 
equivalent to the transformation w = e”' on — «@ <y < ~. 

Note. Since on a linear graph A any interior transformation is necessarily 
light, it is clear that in so far as it concerns simple closed curves in B, (3.1) is a 
consequence of (4.2). Also since the open curve L (when it exists) necessarily 











380 G. T. WHYBURN 


contains infinitely many disjoint intervals all of diameter greater than some 
d > 0, we have 

(4.21) Corottary. If T(A) = B is interior and light, where A is hereditarily 
locally connected, then for any simple closed curve J in B there exists a simple closed 
curve C in A and an integer k such that T(C) = J and, on C, T is topologically 
equivalent to the transformation w = z‘ on |z| = 1. 

(4.3) Tueorem. Let T(A) = B be interior, where A is compact. For any 
simple arc pq in A there exists a continuum E in A such that T(E) = pq and, on E, 
T is monotone. 

Proof. Let us factor T into the form 7:7, where 7; is monotone and 7% is 
light. Let A’ = 7,(A). Then, by (1.6), 7:(A’) = B is interior and light. 
Hence by (4.1) there exists an are pogo in A’ such that T2(poqo) = pq is topologi- 
eal. Let E = Ty'(poq). Then E is a continuum since 7; is monotone, and 
since 


T(E) = T:T\(E) = 727; Ty '(poq) = T2(poqo) = pq 


and 72 is topological on 7;(£) = pogo, it follows that T(#) = pq is monotone. 

Note. Theorem (4.1) also follows directly from this theorem. 

(4.4) Tueorem. Let T(A) = B be interior and light, where A is compact. 
If K is any locally connected continuum in B and C is any component of T~’(K), 
the transformation T(C) = K is interior. 

Proof. Let E be any open subset of C. There exists an open subset D of 
T'(K) such that D-C = E. Let y be any point of T(£) and let z « E-T“(y). 
Since dim 7”'(y) = 0, there exists a set U open in D which contains z and is 
contained in D and such that F(U)-T"'(y) = 0. Let T[F(U)] = Ko, let R 
be the component of K — Kp» containing y and let Q be the quasi-component 
of 7 '(R) containing x. By (1.4), 7(Q) = R; and since T"'(R) C T"'(K) — 
T (Ko) CT "(K) — F(U) and Q DxreU, we have Q CU CD. Whence, 
Q C D-T (KR); and since Q is quasi-connected, this gives Q C C, so that Q C 
D-C = E. Accordingly T(£) > T(Q) > R, so that T(£) is open in K. 


5. Transformations on plane sets. For light interior transformations applied 
to plane locally connected continua we prove the following result concerning 
the invariance of local connectivity under the inverse transformation. 

(5.1) Turorem. Let A be a plane, compact, locally connected continuum 
and let T(A) = B be a light interior transformation. Then if K is a locally con- 
nected subcontinuum of B, T~'(K) is locally connected. 

Proof. If this is not so, there exists" a point p e 7” '(K) = H, a simple closed 
curve J about p with interior Z and an infinite sequence M, , M2, --- of quasi- 
components of E-H converging to a continuum M in J + E containing p and 
such that M,, M., --- lie in distinct components N,, Nz, --- of H-(E + J) 


" For details of how to obtain these sets the reader is referred to papers by R. L. Moore 
in the Bulletin of the American Mathematical Society, vol. 25 (1919), pp. 174-176, and 
vol. 29 (1923), pp. 289-302. 











INTERIOR TRANSFORMATIONS ON COMPACT SETS 381 


and such that for each i, k > 0, M; separates M,_, and M;,, in J + E. Since 
by hypothesis 7” 'T(p) is totally disconnected, we may suppose J chosen so that 
J-T'T(p) = 0. Let By = T(A-J/) and let R be the component of K — K- By 
containing T(p). [Note that T(p) cannot belong to By , since J-T'T(p) = 0.] 

Now by the local connectedness of K, R is open in K. Hence for almost all 7, 
T(M,)-R # 0, and thus we may suppose this holds for all 7. Since T(H) = K 
is interior [by (1.2)], it follows by (1.4) that each quasi-component of 7” '(R) 
maps onto all of R under 7. Since T~'(R)-J = 0, it follows at once that there 
exists a sequence L, , Lz , «++ of distinct quasi-components of 7” '(R) converging 
to a continuum L where p e L C M and for each i, L; C M;. 

Since A is locally connected at p but H is not locally connected at p, it follows 
that there exists a component Q of B — (K + Bo) which has a limit point q 
in R. Since T(L,;) = R for each 7, it follows that for each 7 there exists a point 
a.¢€T '(q)-L;. Now since, by (1.41), 7”'(Q) consists of a finite number of 
components of A — H — T~'(Bo) and each q; is a limit point of 7~*(Q), it follows 
that there exists some component W of 7~'(Q) such that at least three, say 
ix» Vis » Vig (tx < te < 43) of the points q; are limit points of W. Since T™'(Bo) = 
A-J and q; C E, we have W C E, and since H > M; for each 7, we have W- M; 
= Oforeachi. But now M,,separates M;, and M;, in J + E, and since W C E, 
W > qi, + a, this gives W-M;, # 0, which is impossible. Thus the sup- 
position that our theorem is false leads to a contradiction. 

(5.11) Corotuary. Under the conditions of (5.1) there are only a finite 
number of components C,, C2, --- , Cn of T'(K) and, for each i, T(C;) = K is 
interior. 

It may be shown by simple examples that (5.1) does not remain valid if we 
remove either the condition that A lie in a plane or that 7 be a light trans- 
formation. 


UNIVERSITY OF VIRGINIA. 











BETTI NUMBERS OF 3-FOLD SYMMETRIC PRODUCTS; A 
CORRECTION 


By M. RicHAaRDSsON 


In a previous paper! the writer gave a general method for computing the 
Betti numbers of a complex k obtained by identifying the points of a given 
complex which are congruent under a finite group G of transformations subject 
to certain combinatorial conditions. The Betti numbers of k appeared as the 
ranks* of certain matrices (27;). As one of the applications of this general 
method, the Betti numbers of the 3-fold symmetric product k;, of a given 
complex K, were computed and the results published without proof.’ In 
calculating the rank of the matrix (z7;) for this case, an error was made, which 
invalidates the formulas of Theorem 5. The correct formulas are given here. 

Tueorem. [fk is the 3-fold symmetric product of K, then 


[3 [Rm (Kan) + 2R. + 320 (-1)'RiRns], ——-m = 3s, 


Rn(k) = 3 
Lneancs +30 (-1)'R:Rn-ai, m # 3s, 
\ i 


where Ra = R.(K), and K;, = K X K X K. 

Remark. If in the mechanical application of these formulas, a term happens 
to contain R,(K), where a > n, or a < 0, this term is to be ignored. 

We note also the following correction to the cited paper. Theorem 1 as stated 
is valid for vu = 1. For u > 1, a slightly modified argument proves that 


R,.(k, x“) = the rank mod + of (x;;) with elements reduced mod z, (i = 1, --- , 8; 
j =1,---,7), p not divisible by x. This change does not affect later develop- 
ments. 


INSTITUTE FOR ADVANCED Stupy. 


Received October 14, 1936. The existence of an error in the formulas was called to 
the author’s attention by R. J. Walker. 8S. H. Kimball had previously and independently 
noted the error and obtained the correct formulas. The author had the advantage of 
seeing his results while preparing this note. 

1 On the homology characters of symmetric products, this Journal, vol. 1 (1935), pp. 50-69 
For definitions, etc., see that paper. 

? Loc. cit., Theorem 1, pp. 52-53. 

3 Loc. cit., Theorem 5, p. 61. 














THE EXPANSION THEORY OF ORDINARY DIFFERENTIAL SYSTEMS 
OF THE FIRST ORDER 


By Rupo.pen E. LANGER 


1. Introduction. It is a sufficiently curious fact that in the extensive litera- 
ture of the expansion of arbitrary functions in series of characteristic solutions 
of ordinary linear differential systems almost no works dealing with the case 
of the differential systems of the first order are to be found.’ This is the more 
remarkable since in this case the integration of the differential system is possible, 
and much of the analysis which invariably encumbers the discussions of systems 
of higher order is thereby obviated. To be sure, the Fourier’s series, which 
stands as a prototype in this field, is usually associated with a differential system 
of the second order, and so the systems of higher order would naturally have 
suggested themselves as generalizations. Even for purposes of generalization, 
however, the system of the first order also merits attention, for because of 
the relative simplicity of its analysis a material generalization becomes possible 
in the way of a relaxation of restrictive hypotheses. This is found to be far 
from trivial. The theory of its expansions, as it is to be found in the literature, 
involves in fact a number of striking peculiarities, in which it contrasts sharply 
even with the expansion theories of the most closely analogous differential 
systems of the second order. Thus, by way of instance, one and the same 
expansion may be generated by an infinity of essentially distinct (i.e., non- 
equivalent) functions; and again the expansion of a given function, though it 
may converge, only rarely converges to the function immediately concerned. 

It is the purpose of the present paper to present here a new expansion theory 
for the differential systems of the first order, one which differs from that in the 
literature and is believed to have advantages over it. It will be found to 
permit, on the one hand, of a material further relaxation of the restrictions 
upon the system, and to lead, on the other hand, to results which, on the whole, 
are much more nearly in consonance with those which obtain in the existing 
theories for systems of the second or higher orders. In particular, it will be 
found that the formal association of a function with an expansion is in a suitable 
sense unique, and that under quite customary conditions an expansion converges 
to the function with which it is formally associated. Even so, to be sure, some 
distinctive peculiarities persist. These will generally be recognizable, however, 
as inherent in the nature of the case. 


Received April 30, 1937. 
' A notable exception is M. H. Stone, An unusual type of expansion problem, Transactions 
of the American Mathematical Society, vol. 26 (1924), p. 335. 


383 








384 RUDOLPH E. LANGER 


2. The differential system. The differential system to be considered, the 
general real differential system of the first order, is of the form 


(1) y'(s) — [eq(s) + r(s)]y(s) = 0, 
ay(a) = By(b). 


The parameter, denoted by p, is to be free to range over all complex values. 
On the other hand, the variable s, the functions q(s) and r(s), and the constants 
a and 8 are to be real. Beyond this g(s) and r(s) are to be single-valued and 
summable in the sense of Lebesgue over the interval (a, 6), and q(s) is to fulfill 
a hypothesis of which the explicit statement is deferred to §3 below. 

As applied to the differential equation of the system (1) the term “solution’”’ 
will be understood to designate a function which is an indefinite Lebesgue 
integral and which satisfies the differential equation almost everywhere on the 
interval (a, b). In this sense the equation is solved by the formula 


- f. loq(s) + r(s)] ds 
(2) y(s) = ce"* 


? 


with ¢ an arbitrary constant. Inasmuch as any two solutions must be linearly 
dependent, the differential equation determining their Wronskian to be zero, it 
follows that the formula (2) includes all solutions. 

A solution of the differential equation which is not identically zero but which 
satisfies the boundary condition is to be called a characteristic solution of the 
differential system. The substitution of the form (2) into the boundary condi- 
tion yields for the existence of such a characteristic solution the condition 


b b 
f qladda + f r(s) ds 
a a 


In general this condition restricts p to an isolated set of “characteristic values”. 
The exceptional cases are: first, that in which both @ and 8 are zero, the condi- 
tion then being obviously vacuous; second, that in which one but not both of 


the constants a, 8 is zero, the condition then being clearly impossible; and, 
b 


third, that in which q(s) ds is zero, the condition then being independent of p. 


To exclude these exceptional cases it will be assumed that 


b 
(3) aa | q(s)ds # 0. 


yp 
a = Be 


When this hypothesis is fulfilled the differential system may be normalized as 
follows. 
The relation 


' 1S. aleddet f reedae 
a| =|Ble °° " 
determines the real constant v. With it the change of variable 


E [vq(s) + r(2)] ds 
y(s) = w(s)ed* ; 











EXPANSION THEORY OF DIFFERENTIAL SYSTEMS OF FIRST ORDER 385 


transforms the differential system (1) into the form 


w’(s) — [p — v]q(s)w(s) = 0, 


wa) = = w(b). 
| 


a | 
The substitutions 
g=~- ¢ 
r= 
ew 


“b 
h = (p — v) | q(s) ds, 


p(x) = a , 
i q(s) ds 


u(x) = w(s) 
complete the normalization, reducing the system (1) to the form 
(4.1) u'(x) — Ap(z)u(x) = 0, 
u(O) = u(1), 
if aB > 0, and to the form 
(4.2) u'(r) — Ap(x)u(x) = 0, 
u(0) = —u(1), 
if a8 < 0. The function p(x) is summable over the interval (0, 1) and 


1 
(5) [ p(x)dx = 1. 


3. The hypothesis. If there exists on (0, 1) some sub-interval, say 6, upon 
which the function p(z) is essentially of one sign,” then on this sub-interval the 
function 


Pa 
(6) P(x) = [ p(x) dz, 
0 
is monotone and, of course, continuous. The transformation 


(7) t = P(z) 


accordingly maps the interval 6 in a unique and continuous manner upon a 
corresponding interval, say w, of the t-axis. If 5is taken to be directed positively, 


? The term “‘essentially’’ will be used throughout the discussion in the sense of ‘‘almost 


everywhere’. 











386 RUDOLPH E. LANGER 


w will also be directed, positively or negatively according as on the interval 6 
the function p(x) is essentially positive or essentially negative. Finally, on w 
the transformation (7) has a unique inverse 
(8) z= P(t), 
in which P~'(t) is the Lebesgue integral of a summable function which essen- 
tially maintains its sign." 
The hypothesis to which the given differential system when normalized, i.e., 
in the form (4.1) or (4.2), is to be subjected is the following: 
There shall exist on the fundamental interval (0, 1) at least one open point-set, 
to be designated by A, upon which the coeffictent function p(x) fulfills the requirements : 
(i) that it be essentially of one sign upon each of the intervals comprising the set 4; 
(ii) that for almost every value of t on the range 0 < t < 1 the congruence 


P(x) =t (mod 1), 


be fulfilled by some x on A, but that for each value of t it be fulfilled by at most 
one x on A. 

The point-set A, being open, consists of enumerably many non-overlapping 
open intervals. These are to be designated by 6;. The hypothesis (i) insures 
that under the transformation (7) there corresponds to each interval 6; an 
interval w; on the taxis. The hypothesis (ii) thereupon insures that these 
intervals w; are non-overlapping, and that the set of them as a whole would be 
obtainable by making a suitable sub-division of the interval 0 < t < 1, and 
then at most relocating some or all of the individual sub-intervals by translating 
them through appropriate integral multiples of the unit distance. To avoid 
meaningless details it will be supposed that the designation of intervals 6; is 
so made that the function p(z) is essentially of opposite signs upon any two 
such intervals which abut, or, stated differently, that any number of abutting 
component intervals of the set 4 upon which the function p(x) is essentially 
of the same sign will be regarded as comprising together one and the same 
interval 6;. The intervals w; will be collectively designated as the point-set Q. 
Since the transformation (8) exists on each individual interval w; , it evidently 
exists over the entire set 2, and the point-sets 2 and A are the transforms of 
each other under the relation (7) or its inverse (8). 

The simplest type of differential system fulfilling the hypothesis stated is that 
in which p(x) is essentially positive throughout.‘ The point-set A consists in 
this case simply of the interval 0 < x < 1. Because of its simplicity this type 
of system fails to display the peculiarities which are distinctive of the more 
general cases. A case which is still simple for the present discussion, though 
not amenable to earlier ones, is that in which the point-set A does not com- 
pletely cover the interval (0, 1), the function p(x) being essentially of one 
sign on A, but not essentially of either sign on any interval which is contained 


> Cf. M. H. Stone, loc. cit., p. 343. 
‘ This was completely treated by M. H. Stone, loc. cit. 


a 








EXPANSION THEORY OF DIFFERENTIAL SYSTEMS OF FIRST ORDER 387 


in the complement of A. In this case as in the preceding one the point-set A is 
unique. In the general case, however, that is not so, the hypothesis on A 
being in fact generally satisfied by infinitely many different point-sets. This is 
always so when the total variation of the function P(x) over those sub-intervals 
of (0, 1) on which it is monotone exceeds the unit value. 

The distinguishing features of the present hypotheses as compared with those 
of the earlier theory applying to the same differential systems may be sum- 
marized briefly as follows: 

(i) the number of intervals constituting the set A is not restricted to be finite 
but may be enumerably infinite; 

(ii) a set of intervals A need not completely cover the fundamental interval; 

(iii) the function p(x) is subject to no hypothesis other than that of sum- 
mability over the point-set complementary to A; 

(iv) the function p(z) is not restricted to be bounded. 


4. The relation of “orthogonality”. Since the differential systemis (4.1) and 
(4.2) are both explicitly integrable, their characteristic values and solutions 
are directly obtainable. These are found to be for the system (4.1) 


ho = 0, u(x) = 1, 
(9.1) ; An P (2) 
An = 2nri, tia(z) =e, n = 1,2,3, : 
and for the system (4.2) 
An = (2n — 1)zxi, 
9.2) 
a(x) me eer® n=1,2,3,---. 


In either case the set of solutions as a whole has the property of satisfying 
the relations 


_ 0, if h* —k, 
i p(x)ur(x)ur(x)dx = Thea «& 


This property of weighted normality and orthogonality with respect to the 
interval determined by the points at which the boundary condition applies is, 
of course, wholly analogous to those which obtain for the characteristic solu- 
tions of differential systems generally, and which have been made fundamental 
to many deductions of the associated expansion theories. Precisely this prop- 
erty, however, will find no application in the present discussion, and it is 
principally in this point that the present theory for systems of the first order 
departs from that which is to be found in the literature. 

The point-set © will easily be recognized to be of such a form that a function 
defined upon it may still consistently be specified to be periodic or quasi-periodic 
with respect to the unit distance. Such a specification moreover then extends 
the definition of the function to almost all values of t. It will be convenient to 


een 








388 RUDOLPH E. LANGER 


reserve the notations F(t) and F_,(t) to designate respectively functions whose 
definitions on 2 are extended by the relations 


(a) F(t + 1) = F(t) 
(b) F_4(t + 1) —F_,(). 
In the case of a function of the type F(t) the following will readily be verified, 


namely, that if the function is summable, then 


1 
(11) > F(t) dt =: [ y(t) dt, 
i 0 


the symbol | w; | designating the interval w; redirected if necessary so that its 
sense is positive. At the same time it is known’ that in virtue of the trans- 


Il 


(10) 


|; 


formation (7) 
i Fi(t)dt = i p(x) F\(P(2)) dz, j = 1,2,3,---, 
@ j 6; 
and that conversely, if p(x)f(x) is summable over A, then under the trans- 
formation (8) 


[ p(x) f(x)dx = [ f(P7(t)) dt, fat. BE <« 
8; @j 


Since in the case of each 7 the replacement of w; by | w; | may be compensated 
for by the replacement of p(x) by | p(x) | on 6; , it will be seen that the relations 
above when summed with respect to 7 lead in virtue of the formula (11) to the 
result 


- , 
(12) | Fy(t)dt = | | p(x) | f(x) da, 
0 A 


the existence of either integral implying that of the other, and the functions 
involved being related thus: 

f(x) = F\(P(2)) on A, 
FQ) = f(P"(@) on 2, = and—s F(t +: 1) = F(t). 

Now whether the differential system in question be (4.1) or (4.2), it is seen 
from the appropriate formulas (9.1) or (9.2) that for every choice of the sub- 
scripts h and k, and as a function of t, the product {w(x)u,(r)}.-p-1@) admits 
the unit as a period, and that 


! na (% fhe —k, 
, Un Ue dt = lt, fhe —b. 


(13) 


| 


It follows, therefore, from (12) that 
[ (0, if h ¥ —k, 
A 


(14) | p(x) | uy (x)u(2) dx = \1 ee 


It is upon this new relation of orthogonality that the present theory will be based. 


5 Cf. M. H. Stone, loc. cit., or E. W. Hobson, The Theory of Functions of a Real Variable, 
second edition, 1921, pp. 592-595. 





oo 


Eo eee 

















EXPANSION THEORY OF DIFFERENTIAL SYSTEMS OF FIRST ORDER 389 


5. The expansion of an “arbitrary” function. The expansions of a given 
function f(z) in series of characteristic solutions differ somewhat in their details 
for the systems (4.1) and (4.2). Let the attention be focused first, therefore, 
upon the system (4.1). If f(z) is a function given arbitrarily except for the 
restriction that p(x)f(z) be summable over the point-set A, it may be regarded 
as formally associated with a series of characteristic solutions in the manner 


(15) f(x) ~ agtty + z= {a, U(r) + a_,u_,(x)}. 


n=l 


The operations of multiplying this by | p(x) | u_,(z) and integrating term by term 
over A lead formally, and because of the relations (14), to a familiar “evaluation” 
of coefficients, i.e., 


(16) an = [ | p(x) | f(x)u_.(x) de. 
4 


With these coefficients, and with the explicit forms of the solutions as given 
by the formulas (9.1), the relation (15) takes the form 


f(x) ~ Ag + Zz. {A, cos 2nxP(x) + B, sin 2nrP(x)}, 
n=l 


(17) A, 2 | | p(x) | f(x) cos 2nxP(x) dz, 
4 . 


B,, 2 | | p(x) | f(x) sin 2nwP(x) dx. 

Let F,(t) now designate the periodic function associated with f(z) by the 
relation (13). The transformation (8) is found then, in virtue of the formula 
(12) to convert the expansion (17) into the Fourier’s series for F,(t). The 
heuristic motivation for this deduction may now, of course, be abandoned. 
Whenever f(z) is a function such that p(x)f(x) is summable over A the formulas 
(16) associate with it a set of coefficients and thereby an expansion (15). The 
result deduced above may be formulated then as follows: 

TueoreM. If p(x)f(x) is summable over the point-set A, there is associated 
with f(x) an expansion (15), (16) in terms of the characteristic solutions of the 
differential system (4.1). This expansion is the transform under the relation 
t = P(x) of the Fourier’s series of the function F,(t) which is related to f(x) in the 
manner (13). 

The system (4.2) may be similarly considered. If the expansion associated 
with f(x) is indicated thus, 


(18) f(x) ~ = {a,u,(r) + a_,u_,(x)}, 








390 RUDOLPH E. LANGER 
the coefficients are found, in the manner above, to be again evaluated by the 
formulas (16). With these coefficients and the explicit forms (9.2) of the 


characteristic solutions the expansion takes the form 


f(z) ~ D {A, cos (2n — 1)eP(x) + B, sin (2n — 1)eP(2)}, 
n=l 


Il 


(19) A, 2 | p(x) | f(x) cos (2n — 1)rP(x) dz, 
a 


- 


B, = 2 | p(x) | f(x) sin (2n — 1)eP(x) dz. 
A 


Let F_,(8) designate now the quasi-periodic function to which f(z) corresponds 
under the transformation (8), i.e., 


(20) F_(t) = f(P"(b) on Q, F(t +1) = —F_,(0). 


Then under this transformation the expansion (19) is found to become the 
Fourier’s series for the function F_,(¢). 

Tueorem. I[f p(x)f(x) is summable over the point-set A, there is associated with 
J(x) an expansion (18), (16) in terms of the characteristic solutions of the differential 
system (4.2). This expansion is the transform under the relation t = P(x) of 
the Fourier’s series of the function F_,(t) which is related to f(x) in the manner (20). 


6. Conclusions. Inasmuch as an arbitrary function and its expansion in 
characteristic solutions transform together into a Fourier’s series and its gen- 
erating function, it follows that these expansions admit of a theory which is 
coextensive with that of the Fourier’s series. Moreover, the facts of this theory 
relative to convergence, summability, ete., either at points or over intervals, 
are evidently deducible simply by appropriate considerations of the manner 
in which the transformation (7), (8) affects the relative properties of the 
Fourier’s series. In the present paper these considerations will be entered into 
only to the extent which will suffice to emphasize the differences between the 
present theory and the theory as given heretofore. The theorems chosen and 
given below differ sharply in these two theories, and in the present one are 
much more nearly in consonance with their analogues in the existing theories 
for differential systems of the second or higher orders. 

Consider any function f(x) and its expansion relative to a point-set A. Under 
the relation (7) they are transformed into the associated function F,(é) (or 
F_,(t), as the case may be) and its Fourier’s series. Now if x is any point of 
the set A, then since this set is open it lies upon ‘some sub-interval which is a 
neighborhood of it and which is wholly included in one of the intervals 6; . 
Under the relation (7) this point x and this neighborhood correspond to a 
point ¢ and a neighborhood of it. As is familiar, however, the behavior of the 
Fourier’s series of F(t) (or F_,(t)) at the point t is completely determined by 





—_ 





Ae ORE = SNE 

















EXPANSION THEORY OF DIFFERENTIAL SYSTEMS OF FIRST ORDER 391 


the values of this function in such a neighborhood. Under the return trans- 
formation (8) this leads to the 

Tueorem. If p(x)f(x) is summable over the point-set A, the behavior of the 
expansion of f(x) in terms of the characteristic solutions of either of the differential 
systems (4.1) or (4.2) is determined at any point of A by the values of f(x) in an 
arbitrarily small neighborhood of that point. 

Since the function P(x) is monotone on each of the intervals 6; , it follows that 
every point of A is possessed of some neighborhood upon which the transforma- 
tion (7) is monotone. If upon such a neighborhood the function f(z) is also 
monotone, or, more generally, of bounded variation, the same will, therefore, 
be true of the associated function /s:(¢) upon some interval which includes the 
corresponding point t. Under this condition, however, the Fourier’s series is 
known to converge at the point ¢ to the value 4[Fs:(t+) + Fas(t—)], a fact 
which leads to the 

TueoreM. If p(x)f(x) is summable over the point-set A, then at any point x 
of A in some neighborhood of which the function f(x) is of bounded variation, the 
expansion of f(x) in terms of the characteristic solutions of either of the differential 
systems (4.1) or (4.2) converges to the value 


3[f(@+) + f(x—)]. 


Consider now the case of a point x, which does not belong to the point-set A, 
but which is a boundary point of it. Under the transformation (7) it corre- 
sponds to a point & which is a boundary point of 2. The behavior of the 
Fourier’s series of either of the functions F,(¢) or F_,(#) at the point & is com- 
pletely determined locally, i.e., by the values of the function in any ordinary 
neighborhood of the point. In virtue of-their definitions (13) and (20), how- 
ever, these functions are themselves determined in any ordinary neighborhood 
of t, by their values on Q, and this is immediately recognized to mean by their 
values on “a neighborhood of f in ®”, a term which by definition is to signify 
that set of intervals w;, and parts of such, which are congruent (mod 1) to the 
parts of an ordinary neighborhood of t,. Under the return transformation (8) 
a neighborhood of & in 2 corresponds to ‘“‘a neighborhood of z in A’’, a term 
which is to be analogously defined to designate that set of intervals 5; , and 
parts of such, upon which the function P(x) takes on values which are congruent 
(mod 1) to values in a neighborhood of P(x). This leads to the 

THeorem. If f(x) is any function such that p(x)f(x) is summable over the 
point-set A, and if x, is any boundary point of A, then the behavior of the expansion 
of f(x) in terms of the characteristic solutions of either of the differential systems 
(4.1) or (4.2) at x, is determined by the values of f(x) in an arbitrarily small 
neighborhood of x in A. 

The simplest case of a boundary point is that of a point, say x, , which is an 
end point of an interval 6; , and which has another such end point, say x; of 
an interval 6; , corresponding to it under the congruence P(x;) = P(x) (mod 1). 
In this case the neighborhoods of x; in A, and of x; in A coincide, and if they 








392 RUDOLPH E. LANGER 


are sufficiently small they consist simply of the pair of intervals 6;, 5; or parts 
of them which abut x; and z; respectively. It is seen at once that if f(z) is of 
bounded variation in such a neighborhood of x; in A, then F4:(é) is of bounded 
variation in the neighborhood of t;. The Fourier’s series accordingly con- 
verges to 4[Fas(ti+) + Fai(ts:—)], i-e., in the case of F(t), to 

4 [lim Fy(t) + lim Fi(t))¢on a 


tot; tt) 


and in the case of F_,(t) to 
[lim F(t) + (—1)‘ lim F_i(é)]rone, 


t—>t; tt) 


where k is the integer which is determined by the relation th =t: +k. 

Tueorem. Let f(x) be any function such that p(x)f(x) is summable over the 
point-set A, and let x, and x; be any pair of end points of intervals of A which are 
related so that P(x;) = P(x.) + k, where k is an integer. Then if f(x) is of 
bounded variation in an arbitrarily small neighborhood of x; in A, its expansion in 
terms of the characteristic solutions of the differential system (4.1) converges at x; 
to the value 


[lim f(r) + lim f(x)Jeona, 


rr 
zr) 


and its expansion in terms of the characteristic solutions of the differential system 
(4.2) converges at x, to the value 
[lim f(z) + (—1)‘ lim f(2)].ona- 

If A contains intervals which abut the points z = 0 and « = 1, these points 
stand in the relation of x; and x; to each other, with the integer k as 1 or —1 
in virtue of the relation (5). Hence we have the following 

Corotiary. If p(x)f(x) is summable over A, if x = 0 and x = 1 are end 
points of intervals of A, and if f(x) is of bounded variation in some neighborhoods 
of these points, then the expansion of f(x) in terms of the characteristic solutions of 
the differential system (4.1) converges at x = 0 and at x = 1 to the value 


s(f(0O+) + fu—)I, 


while the expansion of f(x) in terms of the characteristic solutions of the differential 
system (4.2) converges at x = 0 to the value 


s(f(0+) — fal—)I, 
and converges at x = 1 to the value 
b[f(l—) — f(O+)]. 


The function F(t) (or F_,(2)) will evidently be null, i.e., will vanish almost 
everywhere, if and only if the function f(x) is null on the point-set A. The 








i 














EXPANSION THEORY OF DIFFERENTIAL SYSTEMS OF FIRST ORDER 393 


condition that the difference of two functions be null, which is necessary and 
sufficient for the identity of all their Fourier’s coefficients, leads, therefore, 
to the 

THeoreM. If fi(x) and fo(x) are any two functions such that p(x)fi(x) and 
P(x)fe(x) are summable over A, a necessary and sufficient condition that their 
expansions tn terms of the characteristic solutions of either of the differential systems 
(4.1) or (4.2) be identical is that they coincide almost everywhere on A. 

Since the expansion coefficients as given by the formulas (16) involve the 
values of f(x) only over A, it is evident that every expansion is strictly relative 
to a point-set A. For any specifically given function f(x) there will, therefore, 
be as many expansions in terms of characteristic solutions of either of the 
differential systems (4.1) or (4.2) as there are point-sets A which fulfill the 
hypotheses stated in §3, and over which p(x)f(x) is summable. Any two such 
point-sets A may, of course, have points in common, and at such points the 
expansions of a given function relative to the two point-sets in question will be 
seen at once to behave identically. For since a point common to two sets A 
is an inner point of each, it possesses a neighborhood which is also contained 
in each of the sets. This point and neighborhood correspond to a point on the 
t-axis and a neighborhood of it which is contained in each of the corresponding 
point-sets 2. The functional values in this neighborhood suffice, however, for 
the determination of the behavior of the Fourier’s series at the point in question, 
and the result of this determination is, therefore, independent of which point- 
set 2 is held in mind. 

The case of a unique point-set A was found in §3 to be exceptional. It is 
clear that the case of a unique expansion for a function f(x) such that p(z)f(z) 
is summable over the interval (0, 1) is exceptional in precisely the same sense. 
The case of such a unique expansion relative to the interval (0, 1) occurs only 
in connection with the simplest systems, namely, those in which p(x) is almost 
everywhere positive. In general, there is a multiplicity of expansions for each 
given function, and this feature must be regarded as one in which the expansion 
theory for differential systems of the first order as here given is at variance 
with the existing analogous theories for differential systems of higher orders. 


UNIVERSITY oF WISCONSIN, 








TRANSFORMATION OF DIFFERENTIAL EQUATIONS IN THE 
NEIGHBORHOOD OF SINGULAR POINTS 


By C. I. Lupin 
1. Introduction. The singular points of the system of differential equations 


(1) = = X(z,y), 2 = Y(z, y), 

where the functions X(z, y) and Y(z, y) both vanish at a point, have been 
considered by many mathematicians. Beginning with Briot and Bouquet 
the list includes Poincaré, Picard, Bendixson, and continues with Dulac, Malm- 
quist, Perron, Birkhoff, and many others. A full bibliography, particularly for 
complex differential equations, can be found in Mémorial des Sciences Mathéma- 
tiques, Fascicule LXI, Points singuliers des équations différentielles, by H. M. 
Dulac. 

The discussion here assumes the functions X and Y to be real and analytic 
in x and y, in the neighborhood of the singular point. The variables z, y, ¢ 
take on only real values, and only real transformations are introduced. Further- 
more, it is assumed that the singular point considered is at the origin and that 
the first degree terms in X(z, y) and Y(z, y) are such that the differential equa- 
tion can be transformed by a linear transformation to the form' 


] = ij 
po =-—-y+ } i ayjty = X(z, y); 
dt i,7=0 
(1.1) j ~ 
4 =r+ y bijz'y’ = Y(z, y) (i+ j 2 2). 
i~=0 


At such a point the solutions are closed curves or spirals and the point is called 
a center or a focal point, respectively. 

It is proposed here to find a canonical form for the system (1.1) and to discuss 
the properties of the transformation attaining that form. 


2. Failure of the usual method. It appears from the discussion below that 
it is not always possible to set up formal power series 
u=z2+ >) czy’ = f(z, y), 


(2) 
v=eyt Ddija'y 


g(x, y) 


Received July 15, 1936. 
' Poincaré, Jour. de Math., (4), vol. 1, p. 172; Picard, Traité d’ Analyse, 1928, Chap. IX 


394 





a eae 


PA re 








saree 





TRANSFORMATION OF DIFFERENTIAL EQUATIONS 395 
which transform formally the system (1.1) into 
(2.1) — = —§, sz =u 


Consequently this form cannot serve as the canonical form for the system of 
differential equations (1.1). 
If there are formal power series (2) leading to the system (2.1), the expressions 
I(x, y) and g(z, y) of (2) would formally satisfy the partial differential equations 
, ] ) 
Xe, 2+ vey) = 
(2.2) 
, ag rf, ag _ 
X(x, y) = + Y(z, y) lal 
Introduce the new variables (see Picard and Poincaré, loc. cit.) 
(2.21) x = pcos 8, y = psin 0. 


The partial differential equations (2.2) become 


00 7) 

on a Q val 
g gk _ 
x( Kt =f, 


where 2 and R are given by 


2 = D> p'2(6) = p(—sin 0X + cos 6Y), 


(2.31) 4 
R = > p' Ri(@) = p(cos 0X + sin 6Y), 
1=3 
and 
X = X(p cos 6, psin 0) + psin#@ = > p' X,(8), 
1=2 
Y = Y(p cos 8, p sin 0) — pcos@ = >> p' Y;(8). 
+=2 


In the above expressions 2; and R;, and X; and Y; stand for homogeneous 
polynomials of degree 7 in sin @ and cos 0, and consequently can be written as 
linear expressions in cos 78, sin 78, etc., i.e., in the form 


a; cos 10 + a;_2 cos (ti — 2)0 + --- + b; sin 76 


(2.32) 
+ b;-2 sin (¢ — 2)0+.--- 








396 Cc. I. LUBIN 


The formal expansions (2) become in terms of p, sin 6, and cos @ 


p qi(9), 


Ms 


(2.33) f =pcos0+ b o' f:(0), g = psind + 


fn 
ad 


J 


where the f; and g; are trigonometric expressions of the type just described, 
(2.32). The existence of the formal expansions (2) in terms of z and y implies 
the existence of the formal expansions (2.33) in terms of p, cos 6, and sin 6, and 
conversely, the existence of the formal expansions (2.33) implies the existence 
of those of (2). 

To obtain the f; and g; , substitute the formal series (2.33) in (2.3) and collect 
terms in p". We get 


alsa dgn _ 
(2.4) —= -g.tli, ; i fn + mn, 


where J, and m, are trigonometric sums of the above type (2.32). Furthermore, 
the l, and m, involve f; and g; only for 7 less than n. 

Now suppose fi, 1, --- ,fn—1, Gn—1 have all been computed and are of the 
type (2.32). To compute f, and g, , set up from (2.4) the differential equation 


lf, . , ar ; 
(2.41) <s = 2S cos 76 + B; sin j@ (j =n,n —2,---), 
tf 


where the A?, B} arise from m, and 1, and thus involve f; and g; only for 7 less 


than n. 
The solution of the differential equation (2.41) is 


: Aj cos j B; sin j@ 
(2.42) fn = a, coS 0 + B, sin 6 + > atoms 
where a, and 8, are arbitrary and where j = n,n — 2,--- ,0,orj = n,n — 2, 
--+ , 1, depending on whether n is even or odd. 
When j = 1, a case which can only occur when n is odd, and when either 


A} # 0 or B; = 0, the solution (2.42) fails. Consequently, in general, the 
system of partial differential equations (2.3) does not have a formal solution 
in terms of p, sin 6, cos @, of required type. By an expression of required type is 
meant a sum of the form (2.33), i.e., Zp'f:(@), where the f;(@) are expressible in 
the form (2.32). Thus the partial differential equations (2.2) do not always have 
a formal power series solution. 

Because of this we modify the partial differential equations and consequently 
obtain a different normal form (v. §4 et seq.). 

Let us note here the form taken by f, , g, of the solution, when it exists. Sup- 
pose for all n, say up to and including n’, the solution (2.42) has not failed. 
Consider the solution for some odd number n less than n’. The expression 




















TRANSFORMATION OF DIFFERENTIAL EQUATIONS 397 


on the right in (2.41) then contains no term in sin @ or cos @, i.e., both Af? and 
B? equal zero. If in (2.4) 


l, = a,cos6+ 6, sin@+.-.-., 
we have 
m, = —a, sin@+b,cos@0+ ---, 


where the terms in cos 76 and sin 76 for 7 > 1 have not been written. 
If f, has the form (2.42), g, becomes 


If, ' ; 
(2.43) gn, = Yn +1, = a, siné — B, cosé + a, cos 6 + db), sin @ + ---, 


where again only the terms in sin 6, cos 6 have been written. 


3. A preliminary transformation. In order to simplify subsequent discussion 
a real analytic transformation will be introduced in the theorem below. 

Suppose that the expressions f; , gi (i = 1, 2, --- , 2k) in f and g (2.33) when 
computed by taking all the arbitrary constants a;, 8; zero are of required type 
but that the expressions for foi: , goes: fail to be. That is, let us suppose the 
differential equation (2.41), for n = 2k + 1, has terms in sin @ or cos @, i.e., 
Af or BT + 0. 

By a method of Dulac,’ we have the following 

THeoreM. By a real analytic transformation 


w= 2 t+ fo (x,y) + --- + foes (x, y) = Sle, y), 


(3) 
2 = yt go (t,y) +--+ + Goes (2, y) = (x, y), 


where f(x, y), g(x, y) are polynomials of degree iin x and y, obtained by substituting 
xr = pcos 0, y = psin@ in p'f(0), p'gi(0), (i = 2,3, ---, 2k), of (2.42) and (2.43) 
and in p*'fox.,1(0), p*'goxs1(8) of (3.22) below, the differential equations (1.1) are 
reduced to 


i = —2+ Aw(w’ + 2)‘ — Belw® + 2*)* + --- = Ww, 2), 
(3.1) 
i = w+ Bu(w* + 2°)! + Az(w® + 2) + --- = Z(w, 2), 


where for the real analytic functions W(w, z) and Z(w, z) terms of higher degree 
than (2k + 1) are not written and where A and B are constants, not both zero, 
uniquely determined, in (3.41) below. 


? Dulac, Bull. Sci. Math., vol. 51 (1923), p. 74. 














398 Cc. I. LUBIN 


Let the differential equations causing the difficulty in the solution of the 
partial differential equations (2.3) be written 





Ufox , 
“aes = — Jeri + acosé+ bsin 6 + les, 
( 
(3.2) 
gers. d I si 
= = Sasi + ¢ cos 9 +d sin 0 + maxi, 
where legs; , M241 contain sin 78, cos j@, only for j = 3, 5, --- , 2k + 1, and where 


either a + d # 0orb — c ¥ Oor both subsist. 
Introduce the trigonometric functions f*(@) and g*(@), of the required type 

(2.32), which satisfy the differential equations 

dg* 


‘ df* * 
(3.21) = —g* + lis, We = f* + muy, 


dé 
where the arbitrary constants of the solution are chosen as zero. Since bey: 


and m4; contain no terms in sin @ or cos 6, there are no such terms in f* or g*. 
Now write 


(3.22) Sersr = f* + @ cos 8, Juri = g* + B cos 8, 
where a and 8 are taken as 
a = —}(b +0), B = Ha — d). 


If we introduce the new variables by the substitution (3), we get as in (2.2) 





(3.3) > ee oe ee 


In terms of p and @, the expressions on the right side become 


3.3 F(,, 2), 4 *) 29 ( ‘) * (2) 
— (1+ 5) + (3 . a6 i +5 + ap\p 


where 2 and # have the significance of (2.31). These quantities when expressed 

in terms of p and @ are, up to terms of degree 2k + 1 in p, equal to —f and g 
° a ° 2%k+1 - . - 4 e 

respectively. The terms in p””" in each expression of (3.31) are, respectively, 











tees — acosé — b sin @ — lyxs:, Sees —ccosé —d sin? — mes, 
which, on use of (3.22), become 
(3.4) — Jer. + A cos 6 — B sin 6, fora + Bos 6+ A sin 6, 


where 


(3.41) A = —}(a+d), B= 3(b — cc). 











TRANSFORMATION OF DIFFERENTIAL EQUATIONS 399 
In terms of x and y the expressions (3.3) become 


¥x4%y. —g + (Ax — By) (2? + y*)' + ---, 
(3.5) oF 


2x - Ly =f + (Br + Ay) (2? + yy’) + ---. 
Ox oy 


The expressions in (3) can be solved for z and y in terms of z and w and substi- 
tuted in (3.5), whence we get the differential equations (3.1). 


4. Second preliminary transformation. Let us start with the system of 
differential equations (3.1) and modify the corresponding partial differential 
equations as shown in (4.3) below in order to take care of the terms in p™*’. 
The system (4.3) so introduced still presents for terms of higher degree in p the 
difficulty found in the last section for the terms in p”“*’, so that a further 
transformation is useful. 

It is convenient to consider this under two cases with a separate theorem in 
each case. 

Casel. A # 0. 

THreoreM. By a real analytic transformation 
— u = wt he(w,z) +--+ + hasai(w, z) = A(w, 2), 

v =z + go(w,z) +--+ + qaeui(w, z) = g(u, 2), 


where hw, z) and qi(w, z) are polynomials of degree ¢ obtained by the substitution 
w = p cos 0,z = p sin 6 in p'h:(@) and p'q;(@) as found below (i = 2, 3, ---, 
4k + 1), the differential equations (3.1) are reduced to 


a —v + (Au — Br) (wu? + v°)* + Lu(u® + v*)™ 4 «++ = Uw, »), 


dy 


7 u + (Bu + Av) (a? + v)* + Lo(? + v°)* 4+... = V(u, »), 
€ 


where in the real analytic functions U(u, v) and V(u, v) terms of degree higher than 
(4k + 1) have not been written and where L is determined below, (4.563). 
Following the method of the last section, the partial differential equations 


corresponding to (2.2) are 
= W(w, z) + 2 Z(w, 2) = —q + (Ah — Bo) (h? + @', 
(4.3) : 


“ww, z) + a) Z(w, z) = h + (Bh + Aq) (h? + 7), 


where W(w, z) and Z(w, z) have the significance of (3.1). 











400 c. I. LUBIN 
Write, as in (2.21), w = pcos 6, z = psin 6, whence (4.3) become 


(1 + ‘) +3 = —¢ + (Ah — Ba) (KP + @)', 


06 p Op 
(4.31) R 

aq Q dq fe 2 2)k 
where 
2 = p[— sin 0 W( cos 6, p sin 0) + cos 6 Z(p cos 8, p sin 0)] — p 


os Bp™** + 
R = plcos 6 W(p cos 6, p sin 0) + sin 6 Z(p cos 0, p sin 0)} = Ap”? + ... 
Now let us seek a solution of the partial differential equations (4.31) of the 
form 
(4.311) h=pcos6+ > p'h,(0), q = psin 0 + > p'qi(8). 
Using this form for the solution we arrive at the system of ordinary differ- 
ential equations similar to (2.4): 


dh dq 
my ) — = “di —— = i 
(4.32 = qi, 77" 
for 1 < 1 S 2k + 1, while for 2k + 1 <7 
dh; dh; ' 
—_— a= —_ —<—< <= — 2 oj. - 4 —— 1—2 
76 di B 7) Aa 2h )hi-or. + Ahyex Ba; 
+ 2k(A cos @ — B sin 6) (cos @ hyo. + sin 6q;—-%) + mM, 
(4.33) j 
aq; aq; . 
—_ = —_ 5 J — 2k i—2/ } i—2/ - 1—2k 
76 F I 76 1( k)qi-x + Bh + Aqi-x 


+ 2k(B cos @ + A sin 6) (cos 6 hyo + sin 6qi-%) + i, 


where m; and n; involve h; and q; for 7 < (¢ — 2k). 

We shall first consider the differential equations for i S 4k and then consider 
the set for i = 4k + 1 separately. Let us suppose that for some integer ¢ = 
s(s S 4k), the h; and q; fori S s — 1 are of the required type. Furthermore 
let us suppose that the h;, q; for i = 2,3, ---, (s — 2k — 1) have been determined 
completely but that for i = (s — 2k), ---, (s — 1) there are terms in sin @ and 
cos @ with arbitrary coefficients which are still undetermined. The solutions 
of these differential equations for even subscripts exist and are of the required type. 
For odd subscripts the solution of the differential equations for 7 < s by virtue of 
(2.42) and (2.43) is 


h; = M; + a; cos 6 + 8B; sin 8, 
qi = Ny + (a; — Bi) cos 0 + (bj + a) sin 8, 


where for i S s — 2k, a; and 8; are determined as indicated below, (4.37), but 
are still left undetermined for s — 2k <i s — 1. 


(4.34) 




















TRANSFORMATION OF DIFFERENTIAL EQUATIONS 401 


From (4.32) we see that the solutions of the required type (4.34) exist for 
t= s S 2k +1 with M;, Ni, a;, and 6; all zero. We need then consider 
only the differential equations for those odd numbers s, such that 2k + 1 <s < 
4k + 1. In the expressions (4.34) take i = s — 2k and substitute them in the 
differential equations (4.33) with i = s. We get 


dh, 
dé 





= —q, + cos 0 (a,2%A(—s + 4k + 1) + a,) 


(4.35) + sin 0(—2kBa,_2%x + Bs-%.A(—s + 2k + 1) + b,) + m:, 
«t o 


a = h, + cos 0 (B,-,A(s — 2k — 1) + 2kBay_% + c,) 

+ sin 0(Aa,_»(—s + 4k + 1) + d,) + ne, 
where m? and n? have no terms in sin @ or cos @ and involve h; and q; only for 
i S s — 2k, and where also the coefficients a, , b, , c., d,, do the same. Note 
that these quantities do not depend on a,—2x , Bs—2x . 

The differential equations (4.35) lead to 
fhe + sin 6 (2a,-A(s — 4k — 1) —F,) 
(4.36) de ~  "* eiesithia ; 


+ cos 6 (—4kBa,_2;. —_ 28.2% A(s = 2k am 1) + E,) oa M,, 
where E, = b, — c,, F, = a, + d, and M, = —n* + dm* /dé. 


Now choose 
— — F, —E 
ett * 9A(s — 4k — 1)’ 


i 1 | rb, — ___2kBP. | 
“*” 2A(s—2—-DL" Ae—4k—-DI 
This can be done for s + 4k + 1. 

With this choice of constants, the differential equation (4.36) has a solution 
of the required type where we now choose the arbitrary coefficients a, , 8, to be 
zero (i.e., when s > 2k). Hence there exist the required functions h;, 4; , 
1 = 2k + 3,---,4k — 1, by virtue of the proper choice of the coefficients 
tio , Bix. These were the terms left undetermined in h;, qi, 1 < 2k + 1, 
so now with this set we have the functions h;, q;, i = 1,2,--- , 2k, 2k + 2,.--, 4k. 
The arbitrary constants a2,1 , 8x41 are still to be determined. 

To determine haeyi , Jaci, let us consider the differential equations (4.35) for 
s = 4k + 1; they become 
dharss 


dé 


(4.37) 











= — a+ ao O4n4. cos 6 os (—2k Bans: aaa ZA kBox +1 


(4.4) + bses1) sin 8 + mig, 


Ste = seas + (2hA Bog as a 2k Bows 41 a Cae 41) cos 6 + dys sin 6 a Nik+t; 


where the terms have the significance given above (4.35). 











402 c. I. LUBIN 


The second order differential equation becomes 


ah . , , 
(4.41) ——— = —huys + Euyi cosd — Puy. sin é 


—4k( Bax +1 a ABox+1) cos 6 aa Mass ‘ 


We see that no choice of aes: , Bes: suffices to give (4.41) a solution of the re- 
quired type, unless (Ages. + ess) = 0, i.e., Pas = Q. 
If (aes: + dueys1) = 0, choose 


(4.42) Bares: + ABeesr = (Beers — Cargr)/4h = Eaess/4k, 


and the solution hy, of (4.41) will be of the required type, while the constant 
L of the lemma is zero. 

However, F'y..; = 0 is not the general case, so we proceed under the assump- 
tion Pass ~ 0. 

Now as in (3) use the functions h; , ¢;, i = 2,3, --- , 4k just found, to make 
the change of variable (4.1) which in terms of p and 6 becomes 


= h(p, 0) = pcos 0 + p'ho(0) + --- + p** hyss(8), 
v = G(p, 0) = psin @ + p'q2(@) + --- + oY qusi(8), 


where Ayesi(@), Gaeii(@), and the coefficients ari: , Bors, IN hewss(O), Geesi(O) are 
determined below. 
In terms of p and @ we have, as before: 


u 


~~ 


(a) . = 8 (1 + ‘) + a ne, 

(4.51) ae 
dv _ (0, 8) ( ‘) aq(p, 0) R 

Oo ara Utat ey 3° 


where 2 and R& have the significance of (4.31). 
But up to terms of degree 4k + 1 in p we have the two expressions in (4.51) 
equal, respectively, to 


(a) —q@ + (Ah — Bah +7), 
(b) h+(Bh+ ADH + 7. 


The terms in p“*' in (4.51a) and (4.51b) are 


(4.52) 


(a) Sees + B(—axys1 sin 8 + Bri cos 0) + a’ cosé + b’ sin @ 


+ A (exs1 COS O + Box: sin 6) (2k + 1) + m’(8), 
(4.53) 


(b) Stas + Blears: COS @ + Boxy: sin @) + c’ cos 6 + d’ sin 6 


+ Alarms: sin @ — Boxy: cos 6) (2k + 1) + n’(8), 








TRANSFORMATION OF DIFFERENTIAL EQUATIONS 403 


while the terms in p“*’ in (4.52a) and (4.52b) are 
(a) —snsr + A (cress COS O + Bers: Sin 6) + a” cos 6 + b” sin 0 
(4.54) — B(caxs; sin 6 — Box: cos 6) + 2k(A cos 6 — B sin @)axs1 + m’’(8), 
(b) haesa + Borsa(—A cos 6+ B sin 6) + ce” cos 6 + d” sin 6 
+ aex4:(B cos 6 + A sin 6)(2k + 1) + n’’(8), 


where sin 6, cos 0, a2x+1 , Bex+1 occur only where explicitly written. 
Now introduce the functions h*(@) and q*(@) satisfying the differential equa- 


tions 
* la* 
(4.55) co = —q* + mix+1(8), a = h* + nins(), 
where 
m’’(6) — m'(@) = mix+1(8), 


n'"(0) — n'(0) = nies1(8), 


as written in (4.4). This system (4.55) has a solution h*(@), q*(@) of the re- 
quired type. 
Now, for Agiss(@), Gavsi(0) of (4.5) write 


Rigrir(O) = h*(@) + “ sin 6, 
dar+i(0) = q*(@) +A sin @, 
and substitute in (4.53a). This becomes 


* 
= + 5 000 0 + cend—B en 0 + Gk + 1)A cs + of cst 


+ b’ sin 6 + Bx:[B cos 6 + (2k + 1)A sin 6] + m’(0) 
= —g* — \sin@+ Asin @+ Mirai + w COS O 
(4.56) + Boxis[B cos 6 + (2k + 1)A sin 6] + a’ cos 0 + DB’ sin 0 
+ axsi[—B sin 6 + (2k + 1)A cos 6] + m’(0) = —daess + m’’(0) 
+a” cos 6+ b” sin 6 + Alans: COS 6 + Box, sin 6) 
— Bary, sin @ — Boxy: cos 0) + Zkax.:(A cos @ — B sin 6) 
+ cos O(—adgesi + uw) + (—Daer + WhBarnss + ZABori1 + A)sin 6; 
i.e., (4.53a) becomes equal to (4.54a) except for the expression 
(4.561) (—aaez1 + w) COS O + (—Duey1 + WZkBarey, + WZABeey1 + A)sin 4. 


Hence the expression (4.5la) equals the expression (4.52a) up to and including 
terms of degree 4k + 1 in p except for (4.561). 

In the same way it is found that the expression (4.51b) equals the expression 
(4.52b) up to and including terms of degree 4k + 1 in p except for the terms 


(4.562) (r -— Gas 2kA Bors. = 2k Bani.4,)cos 6 + (—u —_ dgx41)8in 6. 








404 c. I. LUBIN 


Now choose the constants as follows: 
Bans; + ABresi = (Dserr — Cress) /4k, 
w= 3(—Geesr — dees), 
N= FOr + Cress), 
whence (4.561) and (4.562) become L cos @ and L sin @ respectively, where 
(4.563) L = 4(—daxss — ges). 


The theorem follows now as in the preceding section. 
Case Il. A = 0, B # O. 
The partial differential equations (4.3) now become 


a! + °) + Oe ws —q — Boh’ + 4), 


06 ie dp p 

(4.6) 
7 (1 + 5) + oq z =h + Bh(h® + q)*. 
00 p dp p 


Again let the formal solution be (4.311) and let us follow the procedure of 
Case I. If we attempt now to solve the differential equations corresponding to 


(4.35) in h, , g., we are led to 
Ph, : 

(4.61) = = —h, + F, sin@ + (—4kBa,_» + E,) cosé@ + M, 
de 


where the terms here have the significance of (4.36). 
We thus see that a solution of the required type exists provided F, = 0 and 
provided the arbitrary constant is chosen as follows: 


a, = E,/4kB. 
However in the general case, for some s = 2p + 1, 
Poyst ~ 0, Pp > ze, 


in which case we have the following theorem: 
Tueorem. By a real analytic transformation 


uz Ww ot een. 
v=2+t---, 
the differential equations (3.1) are reduced to 


du 
dt 


dy 
dt 


—v — Bo(w + v’)* + Au? + 0°)? + .--, 


(4.7) 
u + Bu(u® + v*)* + Ao(u? + v*)? + .--.. 








TRANSFORMATION OF DIFFERENTIAL EQUATIONS 405 


The discussion here is much the same as in the preceding lemmas and will not 


be repeated. 
Let us note a further change which can be introduced both in (4.2) and (4.7). 
It is essentially a change in the parameter ¢ such that 


(4.71) dt = dr/{1 + B(w + v’)*}. 


The system (4.2) becomes 


= = —vy+ Au(u® + v°)* + L'u(u? + *)* + .-., 
at 
(4.72) , 
= = u + Ao(u® + v*)* + L'o(u? + 0*)* + 
T 


where L’ = L — AB. The system (4.72) is the form sought under the pre- 
liminary transformations. 
However, the system (4.7) under (4.71) becomes 


o = —v + Au(u’ 4+ 2°)? + ..:, 

dr 

dv 2 2\p 
=u+Av(u+o°)’4+.---, 

dr 


which is in the form (3.1) with A # 0 so that Case I of the theorem in this 
section applies. Thus the ultimate form in each case is (4.72) above. 

Note that after the proper change of parameter, above, has been found, a 
corresponding one could be made in the initial set of differential equations (1.1) 
atonce. Beginning with the set (1.1) so modified no further change in parameter 
is necessary and we arrive at (4.72) under changes in the dependent variables only. 

The set of differential equations (1.1) transform under the 1-1 analytic 
transformations (3) and (4.1) which contain arbitrary constants into a system 
of the above form with unique k, A, B, and L. The system so obtained is 
spoken of as unique. When in addition the change of parameter (4.71) is intro- 
duced, the system attained (4.72) again has a unique set of constants, k, A, LD. 

These statements can be established by observing that if there were a different 
system attained, there would exist a 1-1 analytic transformation taking the 
one system into the other. This leads to a contradiction. 


5. Convergent case. It is of interest to examine what occurs when A; = 0. 
For the case where the functions X(z, y) and Y(z, y) are polynomials, Poincaré’ 
showed that if the partial differential equation 


. af 3) af KR 
(5.1) #14 § ‘2° * 


* Poincaré, loc. cit 











406 Cc. I. LUBIN 


has a formal solution of the required type, that solution is also convergent. 
The existence of this integral of (5.1) implies the existence of analytic solutions 
of (2.2)* so that for these cases there exist analytic functions which accomplish 
the transformation. 
This case can arise in (2.2) or (4.7). In the former the system of differential 
equations becomes 
du ; dv 
oa dt 


In the case of (4.7) the system becomes 


= = —v(1 + B(u® + v’)*), 
do = u(l + Bw’ + v°)*). 
dt 


6. The formal transformation. If we attempt now to transform the system 
of differential equations (4.2) or (4.72), we find there does exist a formal trans- 
formation of the required type. The method followed in establishing this is 
precisely that of §4; in fact, the partial differential equations here 


ah 2 9 ab » oO. 
ah (1 1 ) 4 ee Lg + (Ah — Ba) (i? + Gf + LAW + 9)", 


a0 p dp p 
an Q aq R P “s ‘ mia 
; (1 + :) + == ah + (Bh + Aq) (hn? + ¢°)* + Lah? + 9°)” 
00 po dp p 


differ but slightly from those of (4.31) and the related expressions corresponding 

to (4.35), (4.36), and (4.37) likewise differ very slightly from them. How- 

ever the term in p“*’ which caused the difficulty before is now taken care of 

by the term involving the constant L. This term also contributes the slight 

differences between the expressions here and the corresponding ones of §4. The 

result is stated in the following theorem, whose proof will not be given: 
Tueorem. The system of real differential equations 


a = —v + (Au — Bo) (uw? + o°)* + Lule + v*)* + --- = U(u,v), 
(6.1) 

Z = u + (Bu + Av) (wu? + v°)* + Lo(u® + v*)* +... =V(u, »), 

( 


where LU’ and V are the real analytic functions of (4.2), can be transformed to the 
unique (see §4) system 


] ” 2\4 2 2) 2k 
z = —r + (Aum — Boy) (ut + vy)* + Luu + o)™, 
« 
(6.2) 
i =u + (Buy + Avy) (ur + v1) + Loar + vi)™ 
( 


* Birkhoff, Am. Jour. Math., vol. 49 (1927), p. 37. 











TRANSFORMATION OF DIFFERENTIAL EQUATIONS 407 


by the formal transformation 
2 2\k 
Uy = UF (emer + Boryiv)(u® + v*)* + SY ayju'r’, 


(6.3) sat 
v + (—Borsrt + arngsiv) (u? + v?)* + a b;u'v’, 


V1 
where 
i+j>2k+1 
ABorss + Bargss = 0. 
Note that, if desired, the quantity B can be taken equal to zero by virtue of 
the form (4.72), and here also there is but one form for the transformed differ- 
ential equations. 


7. Auxiliary functions. ‘The series (6.3) of the last section, which accomplish 
the desired transformations, are formal and not necessarily convergent. Conse- 
quently these expressions are not satisfactory and we seek more appropriate ones. 

Following Birkhoff,’ introduce the associated series 

ee: eran 

f(u,v) = u + (au + Br) (w + v*)* + , a: (1 — elitiai; 1cateeryt ) u' y’, 

(7.1) - 
2 2\k + “Ce 24-»2 ’ 

g(u,v) =v + (—Bu + a) (wv? + ov) + DO bi Ai — elt tibsg|Cu ) u'v’. 
These series are of a type considered by Ritt® and by Birkhoff,’ so that only an 
outline of the proof of the following theorem, which is completely analogous to a 
theorem of Birkhoff’s (loc. cit.), is given here. 

TuHeoreM. The real functions f(u, v), g(u, v) defined in (7.1) are analytic in 
the real variables u and v for 

0O<vrsutv Sp 
and C™ for 
0OswWtvsp 


and such that 


ait’ | x aitig ms 
cnn, | ae Gest er nl a Oasel ot 
—| = a,;t!j! = b;1!j7!. 
du’ av |o were Aué av’ |o vere 
In the region of complex space, 
jue +0°| S po, 
largu| < — largv| <= 
« i. as 9 | é i] | onan 
8k wee 
or 
| argu| <= | gev| <= 
ie | — Cae ) —y, 
i £ i Sk ’ x 1 Sk 


5 Birkhoff, Sitzungsberichte Preus. Akad. (1929), pp. 171-183. 
® J. F. Ritt, Annals of Mathematics, (2), vol. 18 (1916), p. 18. 
? Birkhoff, loc. cit. 











408 c. I. LUBIN 


we find 
1\ | ‘ | J 
s—-)|  - 2} ul je 
la:ju' oA 1 —eiiJi < = Bed Bi hd 
[wp 
where 
By = (1+ aij lf + vy. 
We also have 
;u +p = V 2p: 2, 
where 
16; i» 
“=pe , v = pee ~ 


For the terms under consideration, 7 + j > 2k, divide the treatment into the 


following three cases: 





(a) i 2k, j =k; 
then 
2\ul'jo’ 2|u|*|o|’ ae 
ja S tran < 2/1 *! lv}? 3 
| u? + v?| | w |*) 0 |* 2k/2 
(b) i < k, j > Ik om i: 
then 
2) uj‘lol’ 2\v!’ ASA 
i | | e| | ‘ jets 2k 
fu? + vt | = Die] 2 4 y® ei < 2)0| ' 
(c) j<h, i> 2k —j. 


: ‘ + j—2k 
As in (b), these terms are less than 2) u |‘"7"™. 


Thus we have as a dominating series the expression 


is t=h 
ijn? j=k j=@ 
< | i—k j—k | i+j—2k ‘ j+t—2k 
2 jul} eo] +23 > } ul +2 > |v 
=k 3™*1 =1 
guk imk+l1 j2ekrl 


Consequently we have the analyticity of the function in the open region and 


continuity at the origin. 
Let us now consider the partial derivative 4/du of the series. 
for this differentiation becomes 


A typical term 


te 
- & 
2 


-1 
. i-l 3 yy aj; i+] j oR 
1; U vr _ emi) —-— ee a 

L + | a,;| (u? + v*)** 





As above, we have 


=—@ 
: -1 — Ls rk 
| day, u' vi ~ ¢m)| = Dr) u ip 














TRANSFORMATION OF DIFFERENTIAL EQUATIONS 409 


For the second part of the derivative of the term we find 


aj — 2k | 


i~ re u'w’ eFii Ge rey S 2k| ul “|v )*", 
provided i>k+1, j>k4+l. 

The terms for which these inequalities are not satisfied are treated as under 
(b) and (ce) above. Thus again we have a dominating series and consequently 
the derived series converges uniformly and absolutely and hence this partial 
derivative has the stated properties of continuity and analyticity. 

Higher derivatives are treated in the same manner. 

The limits approached by the derived series are as stated. The proof merely 
depends on the observation that each term of the derived series a**?/(au‘av’) 
approaches zero with (u* + v”) except the one of the form 


=§ 
tj! Pa — emi), 
whose limit is i!j!aj;. 


Introduce the variables p, 6 in the functions (7.1), where 
u = pcos 86, v = psin @. 
The functions become 


I(p, 0) = pcosé@ + (a cos 6 + B sin 6)p" is 


x 
+ ps oat - eis) cos ‘A sin 7A, 
a(p, 0) = psin@ + (asin @ — B cos A)p""' 


=~& 
+2 (1 —¢ im) cos ‘8 sin ’8, 


(7.2) 


where 
Aj; = {1 + | ai; |}e", B;; = {1 + | By lie", 

which are analytic in the open region 
O<ps 
0s 0 


lA 
> 


2x 


IIA 


and of class C* in the closed region 
Op 


0s @6 


IIA 


Po; 
2r. 


lA 
IIA 


Considered as power series in sin 6, cos 0, the expressions (7.2) are uniformly 
and absolutely convergent in the closed region. Furthermore, these functions 
are characterized by the following theorem: 








410 c. I. LUBIN 


THeorem. The functions (7.2) can be expressed as power series in 6: 
731) F(e, 0) = folo) + fulo)@ + fale) + ---, 
G(p, 0) = go(o) + gilp)@ + go(p)O + --- 
convergent for 
035 0s 2z, OS pS po, 
where the functions fi(p), gi(p) are analytic for 0 < p S po and of class C™ for 


0S pS po. 
Consider the first expression in (7.2); we have, as seen above, 


i+j—2 


' ; ec. | | 

jaiyjp "\1 ~ ei) < 2p 
Consequently the series 

! it+j—2k pi a, Sete 2k 37 2 J 

dip 0 = 2p cosh ‘@ sinh ’é 


dominates the series obtained by expanding the term 


— 
aijp' H( - elu) cos @ sin 76 


in powers of @. 
Furthermore, the series 


ao(p) + a;(p)0 + a2(p)0 + --- = a. 2p'*?™ cosh ‘@ sinh 76 


converges uniformly and absolutely for p < p; sufficiently small and 0 S 6 S 2z 
and all the a;(p) are positive. Then, since 


| file) | < ai(o), | gi(o) | < ai(p), 


the series (7.21) converge. It is readily shown that the fi(p), gi(o) have the 
additional properties stated in the theorem. 

A similar discussion holds for the partial derivatives, and we have the second” 

TuroremM. The first partial derivatives of the functions f(p, 0), 9(p, 0) exist 
and can be expressed as the convergent series made up of the corresponding partial 
derivatives of the terms of the series (7.21). 

Now introduce new variables a, i by means of (7.1), as follows: 


(7.3) ui = f(u, v), d= g(u, v). 
The variables u, » can be expressed as 
(7.31) u = f*(a, d), v = g*(u, d), 


where the functions f*, g* have the same properties as f(u, v) and g(u, v) of the 
above theorem. 


* See Bécher, On semi-analytic functions of two variables, Annals of Mathematics, vol. 12 
(1910-11), p. 18. 








TRANSFORMATION OF DIFFERENTIAL EQUATIONS 411 


Put i = rcos¢,d = rsing and use (7.2). Then we have 
r cos @ = f(p, 8), 
9(p, 9). 


The properties of the inverse transformation, the expressions for p, @ or p, 
sin @ (or cos @), can be established in the following theorems. (The proofs are 
omitted.) 

TueoreM. The system of equations (7.32) can be solved for p, 0 in the form 


(7.32) 


rsin @ 


(7.4) 6 =o + rhir)e’, p=rt+ rMi(r)¢' 


convergent for 0 S @ S 22,0 Sr S mm, where the h;(r) and m;,(r) have the properties 
of f(p), gi(p) in (7.21) above. 

For the second theorem we have: 

THeorEM. The system of equations (7.32) can be solved for p and sin 6 or p and 
cos 6 in the form 

p=r+ > hi,(r) cos ‘¢ sin ’¢, 
sin 6 = sing + > nij(T) cos ‘p sin “6, 
which are uniformly and absolutely convergent considered as power series in sin ¢, 
cos o, for 


(7.41) 


0 < ¢ XS 2z, O<srsn, 


where hi(r) and i;(r) are analytic in r for 0 <r S ro and of class C* forO Sr S 
ro , and are such that 


h;;(0) => 0, nij(O) = 0. 


8. Auxiliary transformation. In the system of differential equations (6.1) put 
u = pcos 8, v = psin 8, 
and then introduce the new variables r, @ by means of the transformation (7.32) 
(8.1) reos @ = f(p, 6), r sind = g(p, 8), 
whence 
(8.11) r= <fYP+g = M(p, 4). 
The system becomes 
dr/dt = Ar*** + Lr“ + R*(r, 9), 


(8.2) - 
do/dt = 1 + Br” + Q*(r, 6), 


where the properties of the functions R*(r, ¢) and Q*(r, ¢) are given in the fol- 
lowing theorems. 











412 Cc. I. LUBIN 


THeoreM. The function R*(r, @) of (8.2) is analytic in r, ¢@ for0 <r Sm, 
0 < ¢ S 2r, of class C” for0 S r S m,0 S @ S 2x, and can be written R*(r, 6) = 
Ro(r) + Ri(r)d + --- , which converges for0 S r SF m,0 So S 2r. 

To prove the theorem, let us note the form of R*(r, ¢). Introducing r by 
means of (7.41), we get 


, ir aM 2p, *) aM (R(p 2) 
8.3 ~ 1 ‘Ap, 0) fa ) 
(8.3) dt 00 ( T p + dp p . 


an expression in p and @, where 2 and R arise from Ul’ and V here, as they do 
from X and Y in (2.31). In terms of r, @ this becomes 


(8.31) Ar™* + Lr“** + R*(r, 6), 


where the expression #*(r, ¢) can be written in terms of p and @ as 


(8.32) (1 + ‘) on + : — 4M™*" — LM“*" = R*(p, 8), 
p/ 30 p ap 

whence from the theorems of §7, this expression in terms of p and @ has the 
required properties. Consequently, it also has the required properties in terms 
of rand @. Note that where ¢ enters in R*(r, ¢), it can be expressed entirely 
as powers of sin ¢ and cos ¢. 

A similar result holds for 2*(r, ¢). 

A second theorem is as follows: 

TueoreMm. For any positive integer p, 
R*(r, d) 


r Pp 


lim = (). 


For the expression in (8.11), write 
(8.4) M(p, 0) = M'(p, 0) + M’’(p, 8), 


where M'(p, @) is made up only of those terms of M(p, 6) which involve the 
terms of (7.2) with coefficients a;; , b;; such that 7 + 7 S p, while M’’(p, @) is 
made up of the rest of the expansion of M(p, 8). 

Now put 


(8.41) M’(p, 0) = Myi(p, 0) + Mo(p, 8), 
where M,(p, @) is of the form 

> ai; cos '@ sin 70 p'*’ (+ jp), 
while M2(p, @) is made up of terms, each of which contains at least one factor 


of the type e4'. 
We have 


(8.42) m1 + °) 4 ees : = AM** + LM + N(p, 0), 








TRANSFORMATION OF DIFFERENTIAL EQUATIONS 413 


where N(p, @) is a power series in p, cos 6, sin 6, such that each term is of degree 
greater than p in p. 
Now, for the function R*(p, 6) of (8.32) we have 


R*(p,0) = 2 (My, + Mz: + M”) (1 4 ‘) 42,4. 4mM"® 
30 * dp p 
— A(M, + M. + M”)™**' — L(M, + M, + M")** 
(8.5) = N(p, 0) + AM? + LM?" 
— A(M, + M, + M")** — L(M, + M + M")** 
7] ” Q a ” R 
Consequently, we see 
* 
lim RX, 9) = 0, 
p=0 p” 
and thus 
*/,. 
lim R ws o) = 0. 
r=0 


A similar result holds for Q*(r, @). 

Finally introduce into the differential equations the variables a, 0 of §7. We 
immediately have the following facts: 

Tueorem. (1) Under the transformation (7.3) the system of differential equa- 
tions becomes 


S = —5 + (Ad + Ba) (w + 0) + Lae + 0°)” + UGG, d), 
(8.6) - 
ad = i + (Ba + Ad) (w + &*)* + Low + &*)” + VC, d). 


(2) The functions U(a, 0), V (a, 5) are analytic in the open region 0 < (a* + #*)* 
< p; and class C™ in the closed region. 
(3) For any integer p, 





(au, d) 
0 Ge + BR 
lim VG, Res 
i=o (W? + 0°)? 
5=0 


9. Final transformation. The final transformation is introduced in the 


following 
THeoremM. There exist functions 











414 c. I. LUBIN 


(9.1) fi=ait---, $= b+... 


analytic in the open region a + 0° > 0, of class C* in the closed region iw + * = 0 
such that the differential equations (8.6), under the transformation w = fi ,z =n, 
become 


= = —2+ (Aw — Bz) (2 + w’)* + Lw(2* + w*)”, 
(9.2) 
ap = 0 + (Bw + Az) (z* + w')* + Lele’ + w')™. 


To establish this theorem let us first prove the lemma below: 
Lemma. The partial differential equation 


y * N * , 
(9.3) fe (1 + Br™ + - ) o (ae >i" + e) = AF* 4 LPS 


where A, B, L, R*(r, 6), 2*(r, &) have the same significance as in (8.2), has a solution 
(9.31) F=r+ > F;(r) cos ‘¢ sin’ 


analytic in r for r > 0, of class C* for r = 0 and such that 


for any positive integer p. 
Consider the two cases L = Oand L # 0. 
Casel. L = 0. 
If L = 0, the partial differential equation (9.3) becomes 


aF » , a OF ( , x , R* 2k+1 
9. - a -= — {Ar —j=. ‘ 
(9.4) 36 (: + Br” + ) + a ( + =) 1h 
Make the substitution 
1 
(9.41) ig 


in the partial differential equation (9.4). We get 


af * ) af [4 2+ =) 2kk* 

9. a a me S-—t Ar — —_——— = 0 

(9.42) ag ( vm F a ar " + r + pies f 

and f = 1 + ---, where the part of the expression for f which is omitted 


vanishes with r to a power higher than the first. 
Now introduce the variables 


I B 
log r, 


v=o + o¢Aart ~ A 


(9.43) 


p=r. 

















TRANSFORMATION OF DIFFERENTIAL EQUATIONS 415 


The partial differential equation (9.4) becomes 


af (a* — R*(1 + Bp”) Off, xn , R*\ , 2kR* 
(9.44) (= ~ ek 2 ees, Ss Se tn @ 
ay p° Ap** + dp p + p + yr f 

The expressions R*(r, ¢), 2*(r, ¢) above become, in terms of p, ¥, R*(p, W), 
0*(p, y). These expressions have the same properties in terms of p and y that 
the original expressions, R*, 2*, have in terms of r and ¢. 

Finally write (9.44) as 

(1 + Bp”) R* _ & 


af —2kR* Apt pe of 











(9.45) 


where f(0, y) = 1. 
Now let us consider the other case. 
Case2. L # 0. 
Here make in (9.3) the substitution 


l L 1 L ] L 1 L 
05 i, ak eee ote pes pa ened Ue * aa ioe paeed 
(9.5) <A logu = pe + i log E + A + tale log | + 4. 


The partial differential equation becomes 














k 2 
2k — 
x du 2k 0* du 2k+1 Ak+l R* ae r 
(9.51) (14 B +2) 4% (ar + Li +™). APA Den? 
where u = 1 + ---. Introduce in (9.51) the new variables 
AB-L, A+I". 1) 1 
. 54 (Bah gg A tL, UY 1 
( 52) v co) + | 4 I 2k + pk DA’ 
p=", 
whence 
R* 
au — 1 7 2k ry Uu 
dp Ap™ 4. Lp! 4 R* Ap" + Lp" 
(9.53) 
mute mt ee) 
+ av Lp Ap*™* + Lp®* p |’? 


where 2*(p, y), R*(p, ¥) arise from 2*(r, ¢), R*(r, 9). 

By virtue of the lemma below, each of the partial differential equations (9.45) 
and (9.53) has a solution in terms of cos y, sin ¥, p of the form (9.71) below, 
convergent for 











416 c. I. LUBIN 


pS po, 0 Sy S 2rz, 


where the coefficients of the cos ‘y sin *Y have the property (9.72) and where 
u(0, y) = 1. 

Returning to the variables r, @ by means of (9.43) or (9.52), the corresponding 
function f(r, @) is also of the form (9.71) convergent for r S ro, 0 S @ S 2z. 

Finally introduce the function F(r, ¢) by means of (9.5) or (9.4). We find 
in both cases that this function, a solution of (9.3), likewise can be expressed 
in the form (9.71) with the above properties. Consequently we have the lemma 
established. 

If the same transformation (9.43) or (9.52) is made in the partial differential 
equation (9.6) the following lemma can be established: 

Lemma. The partial differential equation 


6 Ps °K Q* ' 4 A+ p* +2, 
(9.6) e(1 + Br + ") + = (4 ‘ie 4 =) = 1+ BF" 
0d - or ; 


has a solution 
G=o+ » gi cos '¢ sin “@ 


analytic in r for r > 0, of class C” in r for r = 0 and such that 


lim 2 = 0 


-p ’ 
reo / 


p any integer. 
Now writing w = f; = F cos G,z = g; = F sin G, and expressing F and G in 
terms of u and v by u = r cos ¢,v = rsin ¢, the theorem of this section can be 


established. 
Finally, the lemma appealed to above can be stated as follows: 
LemMMA. In the partial differential equation 


(9.7) Y Slo, OF + To, 6) a 


let the functions S(p, 6), T(p, 8) be analytic in p and 0, p > 0 class C*, p = 0 ex- 
pre ssible as 
(a) S(p, 0) = >> Si(p)e’, 


(9.71) 
(b) S(p, @) = > Si(p) cos '@ sin’@ 


both uniformly convergent for pp 2 p 2 0 and such that 


™ . Sif : 
(9.72) lim — p) = 0 lim 


p=0 p? pd p 


Sij(o) _¢ 
Pp 


for any integer p, with similar properties for T(p, 6). Then there exists a function 
S(p, 0), analytic in p and 6, p > 0, of class C*, p = 0, expressible as S(p, 0) is above, 














TRANSFORMATION OF DIFFERENTIAL EQUATIONS 417 


(9.71) and (9.72), and satisfying the partial differential equation (9.7) and 


f(0, 0) = 1. 
This lemma can be established by setting up the sequence 
fi - 1, 
(9.73) 


Six 1 + [ {St afi + T(p, 0) %.\ dp. 


The functions S and T are expressed as under (a) above to prove the conver- 
y P ° ° ° ° 9 
gence. The proof follows in the way used for ordinary differential equations. 


10. Conclusion. Combining the transformations of §8 and §9, we take the 
system of differential equations (6.1) into the final form (9.2) using a trans- 
formation analytic in the open region, 2° + y’ > 0, and of class C* in the closed 
region x’ + y’ = 0 for (2° + y’) not greater than, say, d’. 

As indicated above, the form (6.1) is attained by an analytic transformation 
from the initial form (1.1) or by an analytic transformation and the change of 
parameter indicated in §4. 

Consequently the final form of the system of differential equations (9.2) is 
attained from the initial system (1.1) by a transformation analytic in the open 
region, of class C” in the closed region including the origin, or by such a trans- 
formation and a change of parameter. 


UNIVERSITY OF CINCINNATI. 


® Picard, Traité d’ Analyse, vol. 2, 1925, p. 368. 











THE MINIMA OF FUNCTIONALS WITH ASSOCIATED SIDE 
CONDITIONS 


By Herman H. GoLpstInre 


In his doctoral dissertation the author obtained a generalization of the cal- 
culus of variations which does not include either the problem of Lagrange with 
fixed end-points or the more general problem of Bolza. That is to say, the 
theory therein presented includes only the ordinary problem of the calculus 
of variations and certain non-calculus of variations problems which have fixed 
end-points and no side conditions.’ To remedy this defect a more general 
situation is considered in the present paper; more specifically it is proposed to 
find conditions that a functional having certain differentiability properties be a 
minimum in a class of functions satisfying a system of integro-differential equa- 
tions and passing through two fixed points. 

This problem is transformed into one having only generalized end conditions 
and is then treated by a method which is a generalization of the technique 
adopted in the author’s thesis. Analogues of the Lagrange multiplier rule, the 
Clebsch condition, and the Jacobi-Mayer condition are obtained for the trans- 
formed problem. 

The analogue of the Jacobi-Mayer condition is especially interesting, since 
the theory of the fixed end-point problem of Lagrange with integro-differential 
side conditions does not contain such a condition.” Moreover this condition 
does not reduce to the ordinary Jacobi condition for the simple problem of the 
calculus of variations, as has been shown.’ 


|. The problems and the transformation. We shall start with the following 
problem: to find necessary conditions that an are 


t = &(s) (Gi =1,-++,n;0<s 1) 


minimize a functional J in the class of ares satisfying the integro-differential 


conditions 


Received October 26, 1936. 

' Conditions for a minimum of a functional, Chicago Doctoral Dissertation (1936); it is 
expected that this paper will soon appear in the third volume of the Contributions to the 
Calculus of Variations (Chicago). In the subsequent footnotes it will be indicated as 
Paper I. 

? See, e.g., L. M. Graves, A transformation of the problem of Lagrange in the calculus of 
variations, Transactions of the American Mathematical Society, vol. 35 (1933), pp. 675-682. 

*See Paper I, pp. 32-37. 

418 











MINIMA OF FUNCTIONALS WITH ASSOCIATED SIDE CONDITIONS 419 


Palx, &(x), u(x)] = 0 (a = 1,---,m <n), 
(1) ° 
u,(z) = [ P,([z, s, &(s)] ds yY=n+l1,---,n+4q), 
0 
(2) u(x) = a; + [ &;(s) ds (¢ = 1,---,n) 
0 
and the end conditions u;(1) = b; (¢ = 1,---, »). The functions ¢g, are 


assumed to have continuous partial derivatives of the second order with respect 
to all their arguments in a given region Qt, of (2n + g + 1)-dimensional space. 
The functions P, are supposed to possess the same differentiability properties 
as the functions ¢g, on a region Jt. of (nm + 2)-dimensional space. The curves 
to be discussed belong to a certain region ©, of the space ©, of n-uples of 
continuous functions defined on 0 S x S 1. Moreover on this region Gp, 
the functional I has a second differential.‘ 

The region 2 of (2n * q . 2)-dimensional space is, by de finition, the set of 
all points (x, 8, yi, °-*, Yn, ti. *', Un+y) for which the point (x, 8,91, °°", Yn) 
is in M2 and the point (e, “< + Yn, Uy ***, Unsg) is in R,. Then the 
region ©, has the property that the point 


[x, s, E(s), u(s)] OSsS221) 
is interior to the region 2 whenever the element (£, --- , &,) is in @,. Lastly 
along the minimizing “curve”’ 

for = foi(s), Us = Us(s) (@@ = 1,---,n;6=1,---,n +4), 
the matrix of partial derivatives (ay;) (a = 1,---,m;i = 1,---+, m) has 
rank m. 


The second problem to be treated is the following: to find necessary condi- 
tions that an element £ = (&:, --~-, £,), lying in a certain region Gp, of the 


space of r-uples of continuous functions defined on 0 S x S 1 minimize a func- 
tional J in a class of elements satisfying functional fiber ha 


A,(&) = 0 (u am i,cee, P). 


The functionals J and A, are supposed to have second differentials at each 
point of G,. 

It is simple to verify that the transformation adopted by Graves in treating 
the problem of Lagrange’ carries the first problem into a special case of the 
second; moreover the transformation is such that necessary conditions for one 
problem are necessary for the other. 


‘ For definitions of the terms used above see, e.g., L. M. Graves, Topics in the functional 
calculus, Bulletin of the American Mathematical Society, vol. 41 (1935), pp. 641-662; 
T. H. Hildebrandt and L. M. Graves, Implicit functions and their differentials in general 
analysis, vol. 29 (1927), pp. 127-153; or Paper I. 

5 A transformation of the preblem of Lagrange in the calculus of variations, loc. cit. 











420 HERMAN H. GOLDSTINE 


However, before proceeding to a treatment of the latter problem, we shall 
show that it can be transformed into an equivalent problem which is nota- 
tionally simpler to analyse. To effect this transformation we make the fol- 
lowing definitions: 


(3) f(z) = (x —7+4+ 1) G@—-ls2z<t;1=1,--:-,7), 
g(r) = &(1), 
(4) K(&) = J ((&, =e , &)], B,(&) - A, [(&, mr , &)] (u = 1, ithe »P)s 


where £ is given by equations (3). Then the domain of K and B, is a region 
© of the space of functions — which are defined on 0 S x S r and are con- 
tinuous except possibly at r = ¢ (¢ = 1, ---,r — 1), where they have finite 
right and left hand limits; we define the norm of an element £ to be the largest 
of the norms of its sections,’ whence K and B, have second differentials on this 
region. 

Making use of the Riesz theorem’ and the differentiability of K, one can 
verify that there exist regular functions « and 8, of limited variation® such 
that «(0) = 8,(0) = 0 (wu = 1,---, p), « and B, are continuous at r = 7 
(i = 1,°--+,r— 1), and 


(5) dK(&; ¢) -[ C(x) d(x), = d B, (030) = [ C(x) dB, (x7) (u=1,---, p) 


0 


for every admissible variation ¢, i.e., for every function ¢ whose sections are 


continuous functions. 


2. The multiplier rule.’ Since K is a minimum at & in the class of all ¢ 
in @ satisfying the equations B,(£) = 0, the usual considerations of the cal- 
culus of variations suffice to show that there exist constants lh, ¢,--: ,¢», 
not all zero, such that 


ly dK (ko; ¢) + Cp AB, (So ; f) = 0 


for every admissible variation ¢. Hence defining A(x) to be Inx(x) + ¢,8,(2), 
we have 


(6) [ ¢(xr) dX(x) = 0, 


from which follows 


® By definition, the i-th section &; of Eis &(z), where i — 1 S x < i, and £,(i) = lim &(z); 
r<s 
hence each section of £ is a continuous function on a finite and closed interval. 
7 See, e.g., 8S. Banach, Théorie des Opérations Linéaires, Warsaw, 1932, pp. 59 ff. 


* A function f of limited variation will be said to be regular in case 
S(z) = [f(a - 0) + f(z + 0))/2; 


every function of limited variation can be so regularized. 
® See, e.g., H. Hahn, Ueber die Lagrange’sche Multiplikatorenmethode, Sitzungsberichte 
der Akademie Wien, vol. 131 (1922), pp. 531-550. 











MINIMA OF FUNCTIONALS WITH ASSOCIATED SIDE CONDITIONS 421 


THe Muctipiier Rue. [f the function f(x) minimizes the functional K in 
the class of all functions — in G which satisfy the equations 


B,(&) = 0, 
then there exist constants ly, ¢,, +++ , ¢,, not all zero, such that 
(7) M(x) = lyx(x) + ¢,8,(x) = 0 (0S27 sr), 


where x and 8, are the functions appearing in equations (5). 

To prove the theorem it suffices to note that equation (6) implies the vanish- 
ing of \ at « = r and to use the method indicated in the author’s thesis.”° 

An element & in & will be said to be normal in case there exist p admissible 
variations ¢, (¢ = 1, ---,p) such that the determinant | dB,(&; ¢) | (4, ¢ = 
1, --- , p) does not vanish. By a proof similar to the one made in the calculus 
of variations it can be shown that an element é is normal if and only if it has 
no set of multipliers = 0, ¢, --+ ,¢,, with which it satisfies equation (7). 
As usual we can take 5 = 1 for a normal element and hence uniquely determine 
the remaining multipliers. 


3. The second differential. The usual considerations show that, if our mini- 


mizing element & is normal, then dK(é; ¢) = 0 and d@K(&; ¢, ©) = O for 
every admissible variation satisfying 
(8) dB,(%; §) = 0 (u = 1,---, p). 


We find it convenient to impose a restriction on the functionals B, . 
(H.1) The matrix (dB,(é; 2”')) (uo = 1, ---, p) is of rank p. 
Therefore the vectors 


(9) dB,(é ; 1), dBy(g; 2), -+- , dBy(go5 2") (uw = 1, ++, p) 
are linearly independent, and & is a normal element. Then there is no loss of 
generality in supposing that 

(10) B,(0) = 0, dB, 27") = dye = (uo = 1, +++ , p), 
where (6,¢) is the Kronecker delta. Since the vectors (9) are linearly inde- 
pendent, there exists a non-singular matrix (¢,,) such that 


Sue = Cy; ABE; 2" ’). 


Then the functionals C, = c,;B; have the desired properties; the system of 
equations dC,(&; ¢) = O has the same solutions as the system dB,(é ; 6) = 0, 
and dC,(&; 2” ') = 8. It will be supposed throughout the sequel that the 
system (8) has been put into this canonical form. Next it will be shown that 

Lemma 1. If d°K(é; v, v) = 0 for every function v whose sections have a 
continuous p-th derivative and which satisfies dB,(& ; v) = 0, then d°K(&;¢,¢) 20 
for every admissible variation ¢ satisfying dB,(& ; ¢) = 90. 


© Paper I, pp. 21 ff. 








422 HERMAN H. GOLDSTINE 


Suppose that there is an admissible variation {> for which the lemma is 
false. Then applying the Weierstrass approximation theorem to the sections 
of f> , we get a sequence of functions {v,} whose sections have continuous p-th 
derivatives and which converges in the sense of the norm to {). Let ™,(r) = 
vn(Z) + Cin + Cont + +++ + pnt”. Then for each n and each set Cin, «++. Cyn 
of constants, », has the desired differentiability properties. Furthermore by 
choosing the ¢),, --* , Cyn 80 that ¢un = —dB,(& ; vn) /dB,(éo ; x“), we see that »1, 
satisfies dB,(é ; v) = 0 for every n. Moreover, since {> satisfies these equations, 
it is evident that for each » the sequence c,, converges to zero; v;, converges in 
the sense of the norm to {). This leads to a contradiction.” 

We proceed to a determination of the second differential. It can be seen 
from the proofs of Fréchet” that there exists a symmetric function k(x, y) of 
limited variation in the sense of Fréchet™ such that k(z, 0) = 0 = k(O, y) 
(0s xs r;0 Ss y Sr), k is continuous in both variables together along the 


lines x = i, y = j (i,j = 1, ---,r — 1), and for every admissible ¢, and ¢:" 
(11) d°K(&o; ¢1, ¢2) = | [ fi(x)fo(y)d-d,k(z, y), 
0 


Then adopting the method of the author’s thesis,’® we define functions f, : 


r) Bi(x)Bily) _ k(2, r) Bi (y) _ k(r F ) B(x) . 


Bi(r)3i(r) B(r) "W Br) 


I ds [ k;_a(s, Odt 
0 Jo 
*, . [ (x — s)’ *B,(s das f° (y — s)’ “B(s)ds 
he | as [i *;_a(s8, dt 
” y [ (r — yg Aes (r — s)**BAs)ds 





k(a, y) + k(r, 





hy (a, y) 


k(x, y) 





en s)*,; (s)ds 


r ; (x — s)’" ‘Bils)ds 
-[ as fs ‘;1(s, dt 
0 0 


’ — s)" 8; (s)ds 


I (y — s)’*B,(s)ds 
- [a of Vj— i(s, t)dt 


't Paper I, pp. 11 ff. 

2 Sur les fonctionelles bilinéaires, Transactions of the American Mathematical Society, 
vol. 16 (1915), pp. 215-244. 

‘8 Tbid., pp. 223 ff. 

“™ See, e.g., Paper I, pp. 4 ff. The continuity property stated above results from the 
arbitrariness with which the values of k may be chosen. 

'8§ See Paper I, pp. 6-8. 











MINIMA OF FUNCTIONALS WITH ASSOCIATED SIDE CONDITIONS 423 


Then the functions k, (u = 1, --- , p) are symmetric, vanish on the boundary 
of the square (0 < x < r;0 S y S 1r); kh; is bounded, Riemann integrable,” 
and continuous on the lines z = 7, y = j (i, j = 1,---,r — 1). Moreover 
by successively integrating by parts,” and using equations (9) and (10), we 
find that 


(12) dK (371, 2) = [ [ vi” (x)kp(x, y)vs”’ (y)dxdy (vy, in N, » in N), 
0 0 


where 9 is the class of all admissible variations v satisfying equations (8), such 
that the sections of vy have continuous p-th derivatives. 
We find it convenient to make a second restriction on the generality of our 


problem. 
(H.2) If p = 1, it ts assumed that k, is continuous in both variables together. 


4. The Jacobi-Mayer condition. We proceed to establish the 
ANALOGUE OF THE JACOBI-MAYER ConpiTIon. If & is a normal minimizing 
element for K, then the integral equation 


(13) [ “Kel, yely)dy = o¢(z), 


where k,, is the unique continuous function appearing in equation (12), can have 
no negative characteristic values. Moreover if d°K(£; ¢, ¢) is not equal to zero for 
every admissible variation satisfying dB,(&; ¢) = 0 (u = 1, --- , p), then equation 
(13) always has at least one non-zero characteristic value.” 

We prove first that k, is unique. Let x and xe be two continuous functions 
effective in equation (12) and let x = m — x. Then 


[ [ vi” (x)x(ax, yrs? (y)dady = 0 (vy in MN; ve in N). 
0 0 


Consider any two continuous functions 
¢i(), g2(Z) ; 


then define two functions »;(x), (x) as follows: 


(14) (2x) -|[ [ [ ei(tp)dxy - ++ day + Cpe es Hersee + C04 
0 0 0 


((=1,2;0<s27 <7). 


6 Tbid., pp. 8-10. 

17 Thid., p. 11. 

18 Tbid., pp. 23-24, 26-41. In this reference the condition above is obtained for the 
case r = p = 1, 6:(z) = 2; the result is interpreted for two special cases, one of which is 
the fixed end-point problem of the calculus of variations. 











424 HERMAN H. GOLDSTINE 


We can determine the numbers ¢o,;, --~ , Cp-1,; 80 that they satisfy the equations 


me l l i ha I gi(tp)dxy ~~~ dxydB,(t) + Cy-1,¢dBy (bo ; 2*’) 
0 0 
(j= 1,2;4=1,---,p). 


Then both »; and » are in N. Hence we have proved that 


[ [ gi(x)x(x, yorxly)drdy = 0 


for every pair (g: , ¢g2) of continuous functions; this implies that « is iden- 
tically zero. 

Suppose next that o is a negative characteristic value for equation (13). 
Then in (14) replace yg; by ¢, a normed characteristic solution corresponding 
too. Hence we have defined a function v in N such that 


[ be, ye’ (y)dy = ov'”’(y), [ewe dx = 1. 


This leads immediately to a contradiction. 
Finally if k,, is identically zero, d°K(& ; v, v) = 0 for every v in N, whence 
by a method analogous to that employed in Lemma | we find that 
ad K(m&;¢, 5) = 0 
for every admissible variation satisfying equations (8). Consequently the last 
statement in the condition follows immediately from the symmetry of k, and 


the Hilbert-Schmidt theory. 
Corotiary 1. If & is a normal minimizing element for K, then 


A 


ky»(a, x) 20 (n=1,2,---;0s2 r), 


where k,..(x, y) is the n-th iterated kernel for k,(x, y). 

The proof is the same as the one in the author’s thesis.” Apparently neither 
the condition above nor its corollary appear in the literature for any special 
case of the theory herein developed.” 


5. The Clebsch condition. We are able to state a simple generalization of 
the familiar Clebsch necessary condition. 

ANALOGUE OF THE CLEBSCH ConpbiTION. Let & be a normal element which 
minimizes K in the class of all admissible curves satisfying B,(¢) = 0, and let 


(15) n(x) = ¥(z|¢) (QS 2r<7r) 
be a Lebesgue square integrable function of x for each admissible variation § satis- 
'® See Paper I, p. 24. 
2° L.e., excepting the case r = p = 1, 8:(x) = z, which appears in the author’s disserta- 


tion. See, e.g., Graves, A transformation of the problem of Lagrange in the calculus of 
variations, loc. cit., p. 675. 

















MINIMA OF FUNCTIONALS WITH ASSOCIATED SIDE CONDITIONS 425 


fying dB,(éo; 6) = 0 G = 1, --- , p); further suppose that y is a homogeneous 
functional of ¢ forO < x <r. Then the lower bound B(&) of d°K(& ; ¢, ¢) in the 


class of all admissible variations ¢ such that dB,(& ; ¢) = 0 (u = 1, --- , p) and 
(16) [ n(x)dx = 1 
0 


is finite; this number is also the lower bound of d’K(&; v, v) for all v in N satisfying 
equation (16). 

The first part of this theorem follows immediately from the homogeneity 
of y as a function of ¢ and the normality of &. In the proof of Lemma 1 a 
method for constructing sequences of elements in 9 which converge to any 
admissible ¢ with dB,(& ; ¢) = 0 was given. By means of this convergence 
property an indirect proof of the latter part of the condition can be made 
readily.” 

In Paper I the author has shown how the finiteness of the bound B(é) is 
equivalent to the Legendre condition in the special case of the ordinary problem 
of the calculus of variations. Further, the condition above in conjunction 
with the Jacobi-Mayer condition implies that the bound B(é) is both finite 
and non-negative. The problem of obtaining sufficient conditions has been 
solved only for the restricted case which is considered in the author’s thesis. 


Tre UNIVERSITY OF CHICAGO. 


2! See Paper I, p. 23. 











ON THE ZEROS OF JACOBI POLYNOMIALS, WITH APPLICATIONS 
By M. 8S. WEBSTER 


Introduction. This paper deals principally with the generalized Jacobi poly- 
nomials 


Jala; @, B) = Jaz) = (1+ 2) — 2) Fg yt 


(1) = (-'nt(?" at B 2 aCe: «8 


dn(x; a, B) = ¢,(2x) = 2” > S,2""* + iat (n as 0, 1, 2, coe), 


defined (except for constant factors) for all real a, 8 as the polynomial solutions 
of the differential equation 


(1 — 2°)JN(x) + [a — 8B — (a2 + B)a]Ji.(z) 


(2) 
+n(n+a+t+ Bs -— 1)J,(x) = 0 (n = 0,1,---). 


For arbitrary a, 8 several authors have discussed the number of real zeros of 
J,(x; a, 8). Stieltjes [1]' gave a method of finding the number of zeros in the 
intervals (— ©, —1), (—1, 1), (1, ©) but he stated the result only when a, 8 > 0. 
Shibata [2] gave a table for the number of zeros when they are all real and a, 8 
are not negative integers or zero. Lawton [3] gave complete results for the 
closed interval (—1, 1) when n is sufficiently large. The results of Hilbert [4], 
Klein [5], Van Vleck [6], and Hurwitz [7] for the zeros of the hypergeometric 
function may also be applied to Jacobi polynomials. 

Here we find the number of zeros of J, (x; a, 8) inside the intervals (— x, —1), 
(—1, 1), (1, ©) and, in addition, at z = +1 (from which the number of imagi- 
nary zeros is easily obtained) when a, 8 are arbitrary. The method employed is 
new and other properties of J,,(z; a, 8) are developed as well. 

In case a, 8 > 0, the J,(x) form, as is known, an orthogonal system 


| pG@)sn(e)Ja(adaz = 0, p(t) = (1 +2)" 01 —- a 
(m # n;m,n = 0,1, --- ). 


(3) 


Received December 7, 1936; presented to the American Mathematical Society, April 20, 
1935. The author wishes to express his gratitude to Professor J. Shohat for his valuable 
suggestions. 

! The numbers in brackets refer to the bibliography at the end of the paper. 

426 














ON ZEROS OF JACOBI POLYNOMIALS 427 


Here the zeros 2;,.(a, 8) = 2in (@ = 1, 2, --- ,n) of J,(x) have the following 
properties: 


A. —l, Zan—>l(n— @); 
3 < Tint < Zin < V2, n4+1 —* & has Tn+i,n4l < &. 


We obtain upper and lower bounds for z;,, by the geometrical method which 
FE. R. Neumann [8], Winston [9], and W. Hahn [10] used for Laguerre and Her- 
mite polynomials. We further employ a superior method, based on Markoff’s 
theorem [11], which enables us to find better bounds and even asymptotic 
expressions (n — ~) for certain z;,,. We then apply these results to the co- 
efficients H;,, in the mechanical quadratures formula 








J-1 (x — 2M, n)%, (Xin) n) é 


) , '  plx)on(x)dx 
(4) J pa flx)dr = De Hinfltin) + RD, Hin = fz 
— i=l 
with the characteristic property that 


1 n 
P(t)Gona(adde = Lo HinGoAltin), i.e,  Ra(Gonx(x)) = 0, 
1 i=1 


G,(a) denoting an arbitrary polynomial of degree < s. 

We consider also the coefficients H;,, for Laguerre polynomials. 

Finally, we establish the theorem, recently proved in a different way by 
Meixner [12], that the system of Hermite polynomials is the only case where 
systems of orthogonal and Appell [13] polynomials coincide. 

1. The following relations, being identities in a, 8, remain valid for the gener- 
alized J,(2) (as well as for the orthogonal case), n = 1, 2, ---. 


(5) Ji(z;a, 8) = —n(n + a+ 8 — 1)J,u(z;a+ 1,84 1) 
J,(—2; a, B) = (—1)"J,(2; B, a); 

Zin(a@, B) = —Zn—i+1.0(8, a) (2 = |, 2, thes n) 
) Jul—15 @, A) = 2-nt("F*—); J5 « 8) = (—2)"-nt(" + 8-1) 


With Lawton, we let ¥(z) = (1 + sine z)"**" so that 


(6) 


n—l 


£1 + v@)] = 1 + 2) vo) + nS ve), 


In virtue of (1, 2), we obtain by successive differentiation 
(8) nt a+ 8 — I][J,(x; a + 1,8) — J,(x; a, B)] = (2 — 1)J4(z; @, 8), 


(9) [n(x + 1) + 2a] J,(2; a + 1, 8) — An + a) Ja(2} a, 8) 
= (2° — LJ (a; a + 1, 6). 











$28 M. S. WEBSTER 
By repeated use of (5, 7), we have 


Ialz; 0,6) =O IP(—1; , 6) Ft ww Foyle + 1) 


i=0 i! i=0 
(10) 4 | 
=n! D (-1)'2" ii a ee e+ 1). 


Hereafter, r represents a negative integer or zero; [z] = Oif x < 1, and equals 
the largest positive integer < z if x = 1. Introduce non-negative integers 
P, 7, no as follows: 

p=([l-—al, g=[1-—868]; w=[l—-n-a-S,ifnt+at+ 6S. 

Let N; (j = 1, 2, 3) denote the number of zeros of /,,(x; a, 8) (assumed 3 0) 

in the intervals (— x < x2 < —1),(-—1 <2 < 1), (1 < 2 < &) respectively, 

and K, denote the number of zeros of J,,(2; a + 7, 8) (380) in(— = <2 < —1), 

where the smallest and largest such zeros (if they exist) are \;, 7; respectively. 
Then, Ko = Ni, S 1: < —1. By (2, 10), 

(n—a(n+itatB— ly t+ 204+ IGF arin 

(¢ = 0,1,---,n — 1). 


Substituting J,(2; a, B) = (1 + x)" “g(x) in (2), we obtain 


0 
(11) 


Jn(z; 1 — p, B) = (-1"-pt(" bs : aTc + 1)’ Jn-p(2; 1 + p,B) 
(n 


IV 


P), 


(12) J,.(z;a,1—q) = (—at(" + . _ ') (x — 1)* Jn_,(2;3 a, 1 + Q) 
(n 29), 


n 


p+ 
Inpotil+pl+q (n2pt4). 


Jx(z;1 — p, 1 — q) = (-1)""“ + at( ) (x + 1)"(x — 1)° 


These formulas show the effect produced by the indicated changes in a, 8. 
It is interesting to observe that the orthogonality relation (3) (derived for 
a, 8 > 0) is still true if a, 8 are non-positive integers provided m, n = p + q; 
for example, using (12), we have, by (3), 


1 
| (1 +2)" — x) ‘J,.(2; 1 — p, 1 — g)J,(a; 1 — p, 1 — gdr 
1 


- "2 m n ' P/1 _ w\? oe 
\(p + 9)! (, 8 + )f.¢ + x) — 2)" I np; 1 + p, 1 +9) 


J n—p-y(t3 1 + p, 1 + g)dx = 0 (m,n 2 p+ q). 











ON ZEROS OF JACOBI POLYNOMIALS 429 


Formulas (2, 6, 10, 12) yield the following properties of J,,(x; a, 8): (i) it = 0 
if, and only if,n+m+a+8=la+p=I1,n =p > m; (ii) it has no mul- 
tiple zeros, except possibly at x = +1; (iii) it has a zero of multiplicity p at z = 
—1 if, and only if, a + p = 1, n 2 p; (iv) it has a zero of multiplicity q at 
x = 1 if, and only if, 8 + q = 1, n 2 q; (v) it has a zero of multiplicity [n — no] 
at infinity if, and only if, n + m +a+6=1. 

In what follows we may assume n > 0 and J,(x; a, B) = 0. 


2. For the further study of zeros of J,,(2) we make use of the following lemma 
[10]: 

Lemma |. If f(x) *& 0 satisfies the differential equation r(x)f''(x) + s(x)f'(x) 
+ t(x)f(x) = 0 in a certain interval (c, d), where r(x), s(x), t(x) are continuous 
and r(x)-t(x) < 0, then f(x) can have at most one zero inside (c, d). 

We now consider two cases. 

Casel. n+at+6> 1. 

THeoreM 1. Ifn + a+ 8B > 1, then No = [n — p — qj and N,, N; are 
each either 0 or 1. Furthermore, if a + p = 1, n 2 p, then N, = 0; otherwise, 
N, = min (n, p) (mod 2)”, If 8 +4 = 1,n = q, then Nz = 0; otherwise, Nz; = 
min (n, g) (mod 2). 

Proof. N;(j = 1, 3) is 0 or 1 by Lemma 1. 

(i)nsp. (a)n < porn=panda*r. By (10,11), yivin > 0 = 0, 
1,---,n — 1). It follows from Descartes’ rule of signs that Ne = N; = 0 
and N,; = n (mod 2). 

(b) n=p,a+p=1. From (12), Ni = Ne = N; = 0. 

(ii) n Sq. This reduces to (i) by means of (6). 

(iii) n > p,n > q. Hence, n = p+ q. Lawton’s method’ shows Nz = 
n—p-—q. (6, 10, 11) enable us to complete the proof as in (i). 

Case2. n+a+£B8 1. It suffices to illustrate the method for 0 < n + no 
+a+B<1,a*r. By Theorem 1, 


(13) K,,+1 18 O or 1 and = min (n, [p — mo — 1)) (mod 2). 


Let 7 be an integer such that 1 Si Sm +1. If K; 2 1, it follows from (8, 9): 
(i) J,(x; a + 7 — 1, B) and J,(2; a + 7, 8) have no common zero inside (— ~, 
—1), (ii) J,.(@; a + i — 1, B) has exactly K; — 1 zeros which separate the K; 
zeros of J,(x; a + 7, B) inside (— x, —1), (iii) J,(z; a + 7 — 1, 8) has at most 
one zero less than \; and at most one zero inside (n;, —1). Furthermore, making 
use of (1) and of the fact that J,(x) is a polynomial, we get: 


2 ¢ = min (a, 6) (mod 2) means c = d (mod 2), where d = min (a, b). 

3 For n 2 p+ q+ 1, Lawton proved that J,(z; a, 8) has (i) exactly n — p — q zeros 
inside (—1, 1), (ii) a zero of multiplicity p at x = —1lif a + p = 1, (iii) a zero of multi- 
plicity gatz =1ife8+q=1. 











430 M. S. WEBSTER 


Ki1=Ki+1 (i>p), Ki GS p<nt+i), |Ki-1| n+is p) 
(lSiim+1 Sn), 

(14) Kin =K;, (@>p), |Ki-1| GS p<n+i, Ki (n+iS p) 
(Il SsisSnm—n+1,n S n), 

Kin=Ki+1>p), K GSpt+tntid, |Ki—1| (n +78 p) 


(mM —-n+2S5iSm+1,2n S m). 


(14) remains valid if K; = 0. By a repeated application of (14) and (13), we 
derive the following results: 

(i)n 2>m+1. (@@a)p2mt+1. N,isOorl and = m + 1 — min (n, p) 
(mod 2). (b)p<m+l. Ni =m —ptl. 

(ii) n Sm. Ny = [n — pl}. 

We summarize our conclusions in 

THeoreM 2. (i) 0 <n+m+at6B<l. Ifn 2m+1,m+1 < pand 
a#rora+p=1,n < p, then N,; is 0 or 1 and = m + 1 — min (n, p) 
(mod 2). [fn 2p >m+lat+p=1,thnN,=0. Ifn 2m+1 2p, 
thnN, =n —pt+l. IfnSm,thenN,=[n—pl. (ii)n+mtat B= 
1. Here, Ny = [min (n, nm) — p). 

TuHeoreM 3. [fn +a+ 8 S 1, then NeisOor 1. 

Proof. If n + a + B < 1, this follows from Lemma 1; if n + a + 6 = 1, 
we use (2). 

Analogous results regarding N3 are obtained (see (6)) by interchanging @ and 
8, pandq. Theorem 2 and the corresponding one for N; yield 

Corotuary. (i) Jf 0 < n + m+ a+ 8B < 1, tt is impossible to have both 
N, > 1, N3 > 1; if, in addition, a = B, thn NN; = N3 = 1. (ii) fn + m+ 
a + 8 = 1, then at least one of the numbers N, , N; is zero; if, in addition, a = B, 
then N,; = N; = 0. 

Ifn + a+ 8 S 1, we have determined the number of real zeros of J,,(2; a, 8), 
which do not lie inside (—1, 1). Hence, we may find Nz from Theorem 3, 
since the total number of real zeros is even or odd according as n is even or odd. 
Illustrations, such as the one below, show that the conclusions of the above 
theorems and corollary can not be improved. We note that these results differ 
from the case of Laguerre polynomials [10] in that we may have either N, or N3 
greater than unity. 


Illustration: n = 3, a = -5, B= 5. Here, p = 1, gq = 4, m = 2, 


J;(x) = - (3°-5(x + 1)° + 2-3°-5°%(a + 1)? + 2-3*-5(a@ + 1) - 2.3.5}. 


Since the discriminant of the equation /3;(2) = 0 is positive, all of its roots are 
real; thus, N; = 2, Ne = 1, N; = 0, in accordance with the preceding theorems. 




















ON ZEROS OF JACOBI POLYNOMIALS 431 
3. Hereafter, our analysis is confined to the study of the zeros z;,, (all inside 
(—1, 1)) of J,(2; a, 8) for the case a, 8 > 0. It is known [11] that 
tina, B) < Lin(a + 1, B) < Zisin(a, B) (@@ = 1,2,---,n — 1); 
furthermore, by (8, 9), 
(n + a + B)(x + 1)Jn(a; @ + 2, B) = [(2n + a + B)(x + 1) + 2a] 
“Jn(z; a + 1, B) — 2(n + a)Jn(2; @, 8). 
Thus, we obtain the following interesting inequalities, to be used later: 
Lin(a, B) < Lin(a + 1, B) < tinla + 2, B) < Zisrn(a, B), 


(16) ay.n(a, B) < titi nla, B + 2) < Lisi nla, B + 1) < Fisin(a, B), 
talks -:-e- & 


(15) 


By means of (10), we see that 2;,,(a, 8) — —1 + aa | ~. Ti 3 





k (2 0, finite or infinite). In particular, fork = <, 
Xinla, B)-lifa-« (¢ = 1,2, ---, 2). 
An interesting relation for the z;,, may be obtained by putting ¢,(z; a, 8) = 
o,.(x) = x” +--+, where 


1 
[20 - 2%. (ear = 0 (m # n;m,n = 0,1, ---). 
0 


If we make the substitution z = (1 + y)/2, thus reducing the interval (0, 1) to 
(—1, 1), and compare with the orthogonality relation (3) for ¢,(7; a, 8), we find 
that @,(x) and },[(1 + 2)/2] differ by a constant factor only. Thus 


_onz (1 +2 . ot tee 
#2) = 2°, ( 5 ), ig? a, 


where Z;,, = Z;,.(a, 8) are the zeros of @,(z; a, 8). Making use of the known 


relations [11]: 








2i2n(B, B) = F—n,n(3, B) (n+1< iS 2n), 
Ti2ns1(B, B) = F—ns.nQ3, 8B) (n+ 2515 2n+4+ 1), 
we get 
an *4 Xin, B) = 22 rsien(B,B), 1 + rin($,B) = Wnsiss2n4(B, B) 
(i = 1,2, ---,n), 


interesting relations connecting the symmetric case (a = 8) with two particular 
non-symmetric cases corresponding to the same interval (—1, 1). We shall 
make important use of (17). 

With W. Hahn, let 








432 M. S. WEBSTER 


\ 1 . l 1 Pi 
- a = _ - <I «sf. 
ita *“hasae 5 *'**™ <5 
Evaluating p; , pp by means of (10) and using (6), we get 
2a 2a(a + 1) 
n(n +a+6— 1) <i Ras n(n+a+Bp— 1) + a(a +g)’ 
28 28(8 + 1) 
18 <1 — 2. < — - ~ : 
) Sn tatb-}) . n(n+a+p—1) + Bla + B) 
Min = —-1 + o(4), Inn = 1+ o(*) (a,B>O0,n > ~). 


nv . . 4 r 
rhese results may be greatly improved for some special a, 8 (see (32)). We 
note also that 


2 
1+ nn = el oa 1) + a(n) (a — (0), 
2 
1 — Fan = Al * e-') oF “| (g = 0), 
r1,,(0, 8B) = —1, Inrla, 0) = 1 


(see (12)). 
From (17) and (18) we obtain 


V2 ' ; V3 
< —Za.Qem @ Tn+1.n “< 
2+/m(m + a — 3) 2/m(m + a — 3) + Ha + 4) 
(19) 
V6 ; i V 15 
QS T2Lm,tm41 = Lng 2,2m41 S SS ; 
2+/m(m + a + 3) . 2V/m(m +a4+3)4+ 3(a + 3) 


(a=8>0). 
In some cases these results may be improved (see (29)). 
4. We proceed along the lines of E. R. Neumann and Winston to obtain 
bounds for the general z;,,. (5, 6, 8, 9) yield 


1 , (2n + a+ B)(x + a0) | 
J n+ a . e r) — (4 c . al Zz) 
ssi" (7) ae Wy ae J (a) 2n + a+ B)J,(2 


(20) 
m= ’ 
2Qn+a+e 


(2n+a+6)(1—2°) » at 
(21) n+1 J, ila) = (2n+ a + B)(a + Xo)J nai(x) 
— 4(n + a)(n + B)J n(x), 


‘ It is to be noted that these results were obtained before the publication of Buell’s [14] 
paper which supplements but does not supplant formulas (18), (19), ete. Formulas (31), 
(32), ete. were not given by Buell. 

















ON ZEROS OF JACOBI POLYNOMIALS 433 


Jala; a — 1,8 — 1) = (1 — 2°)J4(2; @, B) 
+ la—8B + (a +B- 2)x} J n(x; a, 8). 


Let yi.n(a, B) = win (@ = 1, 2, ---,n — 1) be the zeros of Ji(x; a, 8). In view 
of (5), it is known that 


Lint Lin Livn 
Hin(a, B) = Finala + 1,8 4+ 1), { ' < { } < { + 
Mi-1,n Mi,n+l Min 
where either of the terms in each bracket may be used. It may be shown by 
induction on @ that Jn(2i.na1)+Jnsa(2in) < 0, so that (20), (21) give 


(22) 


2 j = 
Vin <= Bi,nti according as Xin = “= 


= : 4 
Titi net S Min ACCOPdING as Tizin > To. 
We see from (2) that there is no point of inflection of the curve y = J,(z) inside 
B 


(2:2, Hin) if Zin S at and no such point inside (ui.n, Zis1.n) if in 2 <7 
By (20), (21), (22), let 
I, = (Qn + a + B) | Sins J,(x)dx 
pee 5 ot iia | 
—= (n re 1)(n + a ry 8 cee 2) [J nsa(2i,n) J ng t(Zis1,n)), 
n+a+B-1 ' 
hl > Gp esate —H Tones! 





"If tin S e~-& then eft —. js less than the area of the triangle formed 
a+p, 2n+a+B8B 

by the z-axis, the tangent to y = J,(z) at x = 2;,,, and the perpendicular 

to the z-axis at x = x;,;,, , 80 that by (20, 21), 


gc (2m + a + BY itn = Fin) | yh (g, | 


th ; 
-te4 aa Ug = iY Tia) | 
Hence (simitarly if z;,. 2 =—6), 
(23) sone? leer BRB ton 5 8 
ees E zieeees - ae rin BO at 








434 M. S. WEBSTER 


The inequalities (23) lead to the following bounds for z;,, : 
(i + 2V/a2 — 1)’ aBi a—B — 
=e a Fis —, = 3, 
4(in+1l)(n+a+B8—2) am a+, 
(n —i + 2v/pe)” aiB a—B roe 

a ee a A. -°-——, 2 —, n = 3. 

4(n+1)(n+a+B8—2) pr a+8 
9 9 
a = min (1,2), 6; = min (1,8), a@2 = max (a, w), 
Bs = max (8, w), o= (? + v2) = (0.364 ---. 


It suffices to outline the proof for the first inequality (24), when a 2 8. 


Casel. a 2w. Since z;,, S anf it follows that 1 — 2;,, 2 2B/(@ + 8) 
Qa 


with 0 < 28/(a+ 8) = 1. Hence, by (23), 


IA 








l + Tin > 
(24) 


IV 


1+2%n<2— 


a , 20 op Zi,n) - ie 26 ' 
(1 + 241.0) (1 + 2,n) > le + Hin+a +8 — 9) a+ 5 ? 


Pist > pi + V2p;, i= “te (n+ 1I)(n + @ + B— 2)(1 + %,n). 
Following E. R. Neumann, we prove (by induction) that 


a>eqnitive=s 





at least ifn = 3. Thus, 


(i + 2Va — 1) 26 (n = 3) 


1 Sia - — * 
+ Hi, > iatlateats-® a+B 


Case 2. 0 <a <w. Here, we prove that 


’ 


pp 2.2 TE Uy +1)(n+a+B8—-2)14 2%.) >a = : £eye~ 2 
a 28 4 
(@§ + 2VYw—1)° a 2B ’ 

i - a 5 a. A fe. (n = 3). 

om > in+Dn+a+p—2) wo a+, ae 





Continuing with E. R. Neumann’s method, we may, by means of a com- 
plicated analysis, find certain upper and lower bounds for the negative z;,, in 
the symmetric case (a = 8) as well as in the non-symmetric case. Instead of 
giving these results, we proceed to develop a method which yields much more. 


5. Since z;,,(a, 8) for a, 8 = 1/2, 3/2 are known [11] explicitly, Markoff’s 
theorem’ immediately gives 


8 az; a OZ in ° 0z;, ’ 
na B) >0 meen <0 (a #8); - in(a, @) 2 O according as zi, S$ 0. 


da ‘ da 














ON ZEROS OF JACOBI POLYNOMIALS 435 


21 
2n+1 


T (3 Sa,8 $3;1=1,2,---,n), 








= ma t S 2i.n(a,B) S —cos 
Zi,n(a, 8B) = —cos ST m+ (4). 

We may generalize this result for all a, 8 by means of (16), Markoff’s theorem, 

and [11] 

(26) tinsala, B) < tirla,B +1),  tanla + 1, 8B) < nyi,n41(e, 8). 

For example, if a, 8 2 1/2, there exist non-negative integers r, s such that 
$Sa-—-2r—a <3, $5 B-—2%-—a <3} (o1, o2 = 0, 1). 

Ifs+1<isn—r-—1, then 

Zji-s-1n(a — 2r — 0, 8B — 2s — o2) S Xi-s-1,n(a, B — 2s — ae) 

< 2i-s.n(a, 8 — 28) S Xin(a, B) S Tisrnla — Zr, B) < Lisestn(a — 2r — oy, B) 

SZ Lisrstn(a — 2r — 0, B — Bs — az). 
Hence, for all a, 8 2 1/2, 


2i — 2s — 3 2+2r+2 
— 008 —————————— £ S Zin < —cos — —— ¥ 
(27) cos 5 ; < Fin(a,B) S —cos _ 1 


(} Sa — 2r—a; < 3,3 SB — 2s — a2 < 3301, 02 = 0,1; 








s+1l<tisn-r-1). 
It follows that 
(28) Zin(a, B) — —cos tr if = —tasn—- «x (a, 8 > 0). 
Markoff’s theorem and (16) give, further, 
2 . oF 
_— < Lm+1,2m (a, a) ~ Lm+1,2m(3; 3) = sin 4m <. ix (a = $), 
sin a = Lm+1,2m(3, 3) %. Tm+1,2m(at, a) < Lm+1,2m(4, a) 
ioe 3 
< Xms2,em(3, 4) = sin < rm (0<a<}), 
(29) wa 
6 . T T 
a. « ee ai Pf, ae 
2m +at) < Imi2,2 sila, alsa 42,2 iG, 3) in am +1 < am +1 
(a = 3), 
sin ssi = Lmi2,2m41(3 5 3) < Ln42,2m41(@e, a) < Lm42,2m41(3, a) 


cil an 0<a<}). 





< 2m43,2m1(4, 3) = sin 


Im +1 ~2m+1 








436 M. S. WEBSTER 


If « = 3, the upper bounds in (29) are better than in (19). Moreover, 


‘ 2-—-l+o fr 
sin (2 > l+e . 5) = Im+ite.am+e(, 3) < Lm bi4e,2m+e(@, a) 


= S Im+item+e(3, 3) = Sin And Se Be 


(§ Sa Ss }3;0 =0,1;¢ = 1,2,---,m). 
We thus obtain the following asymptotic relations: 
. (/2—-l+ornr i 
m i-+o,2m-+0\ , ) =s “war ee * O a 
sean ai in(Poite ) 4 5) 
(31) (} Sa S3;0=0,1;1 >0), 





2m+1+¢ 
2i—l+o 


, ws? 
Im+i+e2m+e(a, a) a 9 if op 0 asm-— @®, 
s m 


Making use of (17), we have 


Qn+l+ocf ase. al ; a 7 
R24 + xin(} + ¢, B)] > if - O asn—-o 
(3 =ps };¢ = 0,1), 

(32) 2 | 2 1) 

rind, B) = 147420, 2.G,6) = -14 =) Osesp, 

Xn nla, §) = 1+ te SF Ta nla, 3) = 1+ . +“ (} Sa S 3). 
In view of (16, 17, 29, 32), we have 
= nf ae i 
(33) HB [1 + 2;,,(a, B)| — 3 if i;sn— x, af 0 (0 <a, 8 S 3). 


Although derived for a, 8 S 3, (33) remains valid, in view of (16), for all a, 8 > 0 
(as in the case of (28)). 


6. We apply the foregoing bounds for the zeros x;,, to the mechanical quad- 
ratures coefficients H;,,(a, 8) = H;,, (see (4)) by means of the Tchebycheff 
inequality 
Ti+ 


—1, Tain = 1;7 = 1,2,--- ,n). 


il 


(34) O< H,;., < | “p(2) dx (Zo,n 


tia 


It is known [9] that H;,.(a, 8) = Hy-is1.»(B, a). Also, by a method similar 
to that which leads to (17), we get 


Hin(}, B) = 2 Has i2n(B, B), 


(35) 
H;,»(3, B) = Ph ro iatengs(B, B)H n+ i+1,2n+1(B, B) (i = 1,2,---,n). 











ON ZEROS OF JACOBI POLYNOMIALS 437 


In what follows, we obtain an upper bound of the order i for all H;,, such 
that 1 <i S C (arbitrarily fixed constant) in which case Winston [9] gave a 


ew : 
lower bound of the order — if a > 3. Hence, the true order, with respect to n, 
n? 


i . : . ? , 
of Hy,» is = (a > 3,1 <¢*SC). It is sufficient to illustrate the procedure in 


Hin ‘ (Tisa.n 2 Ti-1.n)(1 + Zi-1.2)" (1 = isan)? 


the special case } S a, 8 <1. We have (by (34)), 


=~ £ , ‘ 
- and increasing 





(Using the fact that p(x) is decreasing for —1 < x < - 





5-9 
for =f. < x < 1 does not give any essential improvement.) By (25), 
a+ fs—2 
4-1-7 5 1 . &#—-3 ef” 
Hi. ™ a E hs | 4 [ an +1 | 
< in sai 3 In ntl 9 a ey 9 
2(3—1) 
+2 1 tA 4 
° | cos 2 =e 4] = O( stun) fz= O(n ) (0 < ée< 1). 


Likewise, if 0 < a = B < $, then Hasjomse = o(+), flsisC(e¢ =0, 1). 


Since Winston gave a lower bound for H;,,, of the same order, this is its true order. 

H,,, and H,,,, may be treated as follows. ((34) gives an nope bound in case 
a = 1 but p(x) > ~ asx — —1 for0 <a< 1.) Fora > } and n sufficiently 
large, Winston showed th: at Hy, < He,,. He gave also an upper bound of the 





1 ; , ‘ 
order sz for Hi, if a S 3, except when B < @ in which case H,,, < He,,, so 
n? 


] - mm , : 

that H,,, = (4). On the other hand, if ¢g,(x) is the normalized Jacobi poly- 
ie 

nomial of degree n, 


gots "T(@)I(a + IP(n + IT (n + B) =O (4) 
nea}? 





a os a)I(a + 1) 
a oa? 97 "Ta eet Dletese 


[11], since Ki,(x) 


Il 


(= e(e)) < Oforx < x,,. Hence, the true order of H,,,, is 
i=0 





1 ; ; , 
—— foralla > 0. Likewise, the true order of H,,,,, is - for all 8 > 0. 


The above upper bounds enable us, in case the behavior of f(z) near zr = +1 
is properly specified, to estimate how much the mechanical quadratures formula 
(4) is affected if we omit even infinitely many of the terms H;,,,f(z;,,.) correspond- 
ing to the z;,, near +1. 

Illustration. Consider f(x) such that 








438 M. S. WEBSTER 

| f(z) | = (a) (0 <1—{2| < 6; 4,1 positive constants). 
Then we can omit in (4) all terms H;,,,f(z;,,) corresponding to 
(36) i < Cin‘ ori > n — Czn‘ (C; > 0, C2 > 0, € given constants, 0 < « < 1), 
provided’ 8 = a, 2a — «(2a — 1) > 2. In fact, for all such 7, | z;,. | — 1 


(n —» ©) (see (27)), so that, by (18), for n sufficiently large, 


‘ ; 2a 
O9<li= Tin | < 4, 1 -— | Zin > —_—_——_—_—_—___—_—_-.. 
| tial - 2inta+Bp—1) 


Thus | 2. FD inf (Xin) |, extended over all 7 as given in (36), is = O( <aears) 


>Oasn—- ~, 


7. The method of the preceding section can be applied to Laguerre poly- 
nomials, in which case p(x) = re * (a > 0) and the interval of orthogonality 
is (0, «). 

(i) 0<a@< 1. Using (34) and the bounds for z;,,, given by Winston, we get 

(i+2V aq-—2)? a 


- tint a8 fe . z = 2 a 
Hin <e , je 2V/ a2 ay’ =\ 
\ a 





4(n + 1) : 


‘\GF Wee — 2°(n a) @ 
( 2 
a= max a, (+43) (i = 2,3,---,n— 1). 





f 16 +a+1)'(n+1) a _ i} 


(ii) a>. 


(i+2/ a—2)? 


1G+a+1%n+1) fs 


Gu 3 «+. 2=— 2). 








<a ast (i + 2Va — 2)*(n + a)| 
\ n+a 


In particular, 


, o(=) (1<i<C,a>0), 
a t C’a 
Hi... = O(e-"’n*) (0<csiv=F%,a>0). 

n 4ae 


(Since Hy... < Hy-1.., at least if n is sufficiently large [9], this remains valid for 


* This restriction is not essential, since we may always interchange a, 8. 











ON ZEROS OF JACOBL POLYNOMIALS 439 


. " l ; 
i = n.) Winston gave a lower bound of the order — for Hin (a > 43% = 
n 


~) 


- ,n) and showed that? 
Hi. = o() (0<a@Z3); Hin< He, (a> 4,n sufficiently large). 


Here, [11], 





1 al*(@)PF(n +1) _ () 
H,,, a» K,(0) = Tintat+ 1) = O n@ (a > 0). 


Hence, the true order of H,,, is “3 (a > 0) and the true order of H;,, is - (a > }, 


1<iSC). We note, however, that H;,, is not of the same order with respect 
to n for all 7. 

In accordance with the results of Winston, similar results could be obtained 
for Hermite polynomials. 


8. The above classical orthogonal polynomials of Jacobi (J), Laguerre (L), 
and Hermite (H) possess the remarkable property, important for the preceding 
discussion, that the derived polynomials are again orthogonal polynomials 
J, L, and H respectively, but with new parameters. For Hermite polynomials, 
we have a still more remarkable result: $4(z) = n@n-a(2x), i.e., the weight func- 
tion remains unchanged, or, in other words, Hermite polynomials form a system 
of Appell [13] polynomials. We close by giving a simple proof that the Hermite 
polynomials are the only orthogonal polynomials with this property.” 

TueoreM 4. If {¢,(z) = x" +--+ },n = 0,1, --- , is an orthogonal system 


of polynomials, i.¢., Gm(x)bn(x)dy(x) = 0 (m # n, ¥(x) monotone non-de- 


creasing), such that o, (x) = ndbn_i(x) then {¢,(x)} ts reducible, except for constant 
factors, to the Hermite system {H,(x)} of polynomials by means of a linear trans- 


formation on zx. 
Proof. Write with Appell [13] 


- n “i n(n — 1 _ n 
(37) ga() = 2" + pam2" ty ae $e fT Mnad + Gn, 
where a, @2,°**,@,,°*** are given constants uniquely determined when 


7 Winston’s result (corrected for a slight misprint) is 


r(a)P(n) (1 pees 
H., ¢ TORO) _ of 4) O<as};i =1,2, ,n). 


8 The same result has recently been derived by Meixner [12] as a consequence of more 


general considerations. 
® The present proof was obtained before the proof recently published by Shohat [15]. 








440 M. S. WEBSTER 


}on(z)} is given. It follows (by induction for n > 1) from the fundamental 
recurrence relation [11] 
Gnil(X) = (© — Cngs)bn(®) — AnsrGn-a(7) (n = 1,2,---) 
that 
o,.(—2% — a) = (-—1)"¢,.(a — a) (n = 0,1,---). 


Hence, ¢,(7 — a) contains only even or only odd powers of 2, so that 


Demat — a) -_ . s C, i a" i)+e 
sie (o = 0, 1; C;, constants) 


D2m ie(t) = Cin (r + a) om is 


i=0 


Comparing (37, 38), we obtain the important relation 


P2», igir) = > C; i + q (x / a) 2(m—i)+e 


(39) i=0 

(o = 0,1; m = 0,1, --- ; C; constants), 
where the C; are functions of the a; (¢ = 1, 2, --- ) and do not depend explicitly 
on m (Co = 1). We note the additional property of (39) that only even or only 
odd powers of x + a, occur. Moreover, by the orthogonality property, 


/ (x + a1) ,(x)db(r) = x’ o,(a — a)dy(x — a) = 0 


x J—e 


(40) 
(¢ = 0,1,---,n—1). 


Letting 


B; = / (2 + a;)' d(x) = / rdyiz—a), Bo>O0 (a = 0,1,---), 


a 2 


we have [11] 


Bo Bi: B 
Bi Be s Bri } Bo Bp | 
| | 
cea Bi. mer” PORT Cee. _ = 
B 1B , Bo, 1 |B, 1 Be, 2 
| @ r 
We see that the sequence ¢,(@ — a), n = 0, 1, --- , is a sequence of symmetric 
orthogonal polynomials whose moments are 8), 8, ---. If we take n = 1, 2, 


- successively in (40), it follows that 


(41) Bair = 0 (i = 0,1,---) 











ON ZEROS OF JACOBI POLYNOMIALS 441 


Substituting (39) in the first integral (40), we obtain now (for « = 0, 1) the 2m 
equations 


42 2m + 1 } 
( ) (2m a DC», Be ao ( me )en 1B oe see oS be t ') Ci Bom + Bom+2 = 0, 


(em 1) CB 2 + Bam = 0 
In particular, 
C, Bo + Be = 0. 
By means of (42), we may eliminate C; , C2, --- ,C and express By; (¢ = 
2, 3, --- ) uniquely in terms of 6), Be. Evidently, we may assume 6) fixed 


(say = 1). Hence, 6, C; (¢ = 1,2, --- ) are determined as soon as Be (> 0) 
is give n. 

It is clearly sufficient to show that all systems of polynomials given in the 
form (39), which satisfy (40), are reducible to one system. Let {¢,(x)} be 
such a system where a; , 82 have certain given values d; , Be respectively. By the 
preceding, Ba; , C; for the system {¢,(zx)} will have certain values B, , C; (i = 
1, 2, --- ). -Consider now another such system {¢,(x)} determined by different 
a, Be. Set Bo = ch (c = certain positive constant). Then, by induction 
from (42), we get for the B:; , C; corresponding to the system {¢,(x)}, the im- 
portant relations 


Bo; = c Bai, C; = oC; (§ = 1,2,---). 
It follows by (39) that 


domse(t + dy — a) = > C; ‘e + ’) s+h-a +e)" 


i=0 
= > cd, ‘4 ’) (e+ emit 
a camte 2m + oc r+ a 2(m—i) +e 
7 yo.(, \(=") ; 


or, again, if we apply (39) to dense(x), 


demie(Cr + cay = a) == ent’ g demie(X). 











442 M. S. WEBSTER 


Thus all systems of orthogonal, Appell polynomials are reducible, by the linear 
transformation x/cx + ca, — a, to one system, which is necessarily the system 
of Hermite polynomials 
n(n —1) »-2 . n(n — 1)(n — 2)(n — 3) nag 
— ——$—=_—$ = 


HA2) = 2° -— — poe root ot 





since the latter is a system of Appell polynomials. 


BIBLIOGRAPHY 


1. Tu. Srtetrses, Sur les polynomes de Jacobi, Comptes rendus, vol. 100 (1885), pp. 620- 
622. 
2. WK. Suara, On the distribution of the roots of a polynomial satisfying a certain differential 
equation of the second order, Jap. J. Math., vol. 1 (1924), pp. 147-153. 
3. W. Lawron, On the zeros of certain polynomials related to Jacobi and Laguerre poly- 
nomials, Bull. Amer. Math. Soc., vol. 38 (1932), pp. 442-448. 
!. D. Hinsert, Ueber die Discriminante der im endlichen abbrechenden hypergeometrischen 
Reihen, J. fir Math., vol. 103 (1888), pp. 337-345. 
5. F. Kuen, Ueber die Nullstellen der hypergeometrischen Reihe, Math. Annalen, vol. 37 
(1890), pp. 573-590. 
6. I. Van Vieck, A determination of the number of real and imaginary roots of the hyper- 
geometric series, Trans. Amer. Math. Soc., vol. 3 (1902), pp. 110-131. 
. A. Hurwitz, Uber die Nullstellen der hypergeometrischen Funktion, Math. Annalen, 
vol. 64 (1907), pp. 517-560. 
8. Ek. R. Neumann, Beitrdge zur Kenninis der Laguerreschen Polynome, Jahresber. der 
Deut. Math. Ver., vol. 30 (1921), pp. 15-35. 
9. C. Winston, On mechanical quadratures formulae involving the classical orthogonal 
polynomials, Annals of Math., vol. 35 (1934), pp. 658-677. 
lO. W. Haun, Die Nullstellen der Laguerreschen und Hermiteschen Polynome, Schriften des 
Math. Sem. und des Inst. fiir angewandte Math. der Univ. Berlin, vol. 1 (1933), 
pp. 213-244 
ll. J. SHonar, Théorie générale des polynomes orthogonaux de Tchebichef, Mémorial des Se. 
Math., vol. 66, Paris, 1934. 
12. J. Merxner, Orthogonale Polynomsysteme mit einer besonderen Gestalt der erzeugenden 
Funktion, Jour. London Math. Soe., vol. 9 (1934), pp. 6-13. 
13. P. Apres, Sur une classe de polynomes, Ann. Se. Ee. Norm. Sup., vol. 9 (1880), pp. 119- 
144. 
14. C. Buen, The zeros of Jacobi and related polynomials, this Journal, vol. 2 (1936), pp. 
304-316. 
15. J. SHonar, The relation of the classical orthogonal polynomials to the polynomials of 
Appell, Am. Jour. Math., vol. 58 (1936), pp. 453-464. 


ba 


THe UNIVERSITY OF PENNSYLVANIA. 








RINGS OF SETS 
By GARRETT BIRKHOFF 


|. Definitions. Following Hausdorff,’ a family § of subsets of a class J is 
said to form a “ring” if and only if it contains, with any two sets’ S and 7, 
their swm (or union) S U 7 and their product (or intersection) SM T. Clearly 
a rig contains, with any finite number of subsets S,,---,S,, their sum 
S, U--- U S, and their product 8; N --- N S,. 

The family § is said to constitute a “complete ring” if and only if it contains, 
with any subfamily S of sets S,, their sum V S, and their product A S,.. 


aeS aes 
The family § is also said to be a “o-ring’’ if and only if it contains, with any 
countable subfamily S of sets S, , their sum V S, and their product A S,. 


aeS aeS 
It is obvious that rings containing only a finite number of sets, and o-rings 
containing only a countable number of sets, are necessarily complete rings. 
These theorems can be improved by using chain conditions; however, the 
family € of all finite sets of integers is a countable ring which is not a o-ring 
(and a fortiori not complete), while the family © of all countable subsets of the 
continuum is a o-ring which is not complete. 


2. The importance of the subject. Rings of sets are mathematically impor- 
tant for a number of reasons. They are conceptually important because one 
can define them so simply in terms of two fundamental operations. They are 
also important because the sets of any class I carried within themselves by any 
one-valued transformation of J into itself are a complete ring. (The proof of 
this will be left to the reader.) Also, as is well known, the open and closed sub- 
sets of any topological space constitute rings, and the measurable subsets of 
any Cartesian n-space constitute a o-ring. 

Again, the reader will immediately see that 

(2a) The sets common to all the rings (resp. o-rings or complete rings) of any 
aggregate of rings of subsets of any class J themselves form a ring (resp. o-ring 
or complete ring). 

It follows that the closed subsets of any topological space = invariant under 
any group of transformations constitute a ring. The study of these rings is 
important in dynamics,’ where, however, the existence of minimal closed and 
connected constituents introduces special considerations. It follows also that 


Received January 16, 1937. 

? Mengenlehre, 1927 (2d ed.), p. 77. 

2 We shall systematically use small Latin letters to denote elements, Latin capitals to 
denote sets of elements, and German capitals to denote families of sets. 

? Especially in the theory of so-called ‘‘central motions’. Cf. G. D. Birkhoff, Dynamical 


Systems, 1927, Chap. VII, §6 ff. 
443 











444 GARRETT BIRKHOFF 


the subsets of any class J carried within themselves under any aggregate A of 
one-valued transformations r,_ of I into itself form a complete ring. 

Moreover, all complete rings of sets belong to at least one aggregate 1 of 
transformations in this way. More precisely, any complete ring 9% of sets 
belongs to the “groupoid’™ of all one-valued transformations carrying every 
S «Minto itself. This shows that rings of sets play the same réle in the theory 
of groupoids of one-valued transformations as is played by transitivity and in- 
transitivity in the theory of groups of permutations (one-one transformations).” 


3. Equivalent notions. If we add the empty set O and the all-set J to any 
complete ring of subsets of a class J, we still have a complete ring. Hence the 
theory of rings of sets is contained in that of rings containing O and J. 

THeoreM 1. The complete rings of subsets of I which contain O and I can be 
identified with the different quasi-orderings of I or with the different completely dis- 
tributive topologies on I. 

Explanation 1. By a “quasi-ordering” of J is meant a binary relation x 2 y 


satisfying 

ra: xr = x (reflexiveness), 

P2: x 2yandy =z imply x 2 z (transitivity). 

By a “completely distributive topology” is meant a unary operation S —+ S 
(called closure) on the subsets of J which satisfies ; 


Ci: B28, C2: O=0, C3: 8=8; 

C4: if S = VS,.then § = VS,. 

(These are related to well-known axioms of Hausdorff on “partial ordering”’, 
and of Riesz-Kuratowski on closure.) 

Explanation 2. By an “identification’’ we mean a one-one correspondence 
preserved under all permutations of the elements of J. It follows that if we 
call two families of sets of I resp. two relations on J resp. two operations in J 
“equivalent” if and only if there exists a permutation of the elements of J 
‘arrying one into the other, then the numbers of non-equivalent rings of sets, 
of non-equivalent quasi-orderings, and of non-equivalent completely distributive 


* A family G of one-valued transformations of J into itself is termed a ‘‘groupoid”’ if and 
only if it contains the identity «: z — z and the product or: x — rlo(z)] of any two of its 
members ¢ and r. The author is preparing an article on groupoids in collaboration with 
S. Ulam. 

5 The sets invariant (i.e., the sets identical with, and not merely supersets of, their 
transforms) under any permutation or set of permutations constitute a ‘complete field” 

i.e., a complete ring which contains, with any set, its complement. Moreover, any 
complete field belongs to the group of all transformations leaving its subsets invariant 
(“‘intransitive’’ on its subsets)—this leads to the usual partial descriptions of groups of 
permutations through their ‘‘transitive systems’’. 

Actually, in the case of groups of permutations of 7, any subset carried within itself 
under all their permutations is necessarily invariant. 








RINGS OF SETS 445 


topologies on J are the same-—as well as the numbers of distinct complete rings 
of sets, of distinct quasi-orderings, and of distinct completely distributive 
topologies. 

Proof of theorem. Let ® be any complete ring of sets containing O and J. 
Make the definitions: (1) « 2 y (3) means that every S eR containing z 
contains y, and (2) S is the product of all sets 7’. « R containing S. That the 
relation and operation so introduced satisfy P1-P2 and Cl-—C2 is obvious; it is 
also obvious that the correspondence between them and §t is preserved under 
all permutations of the elements of J. 

To prove C3-C4, recall that, since ® is a complete ring, S is the least set in R 
containing S. This proves C3 and 

(3a) SeM® if and only if S = 8. 

Now suppose S = VS,. Clearly S = S, irrespective of a; hence S = VS,. 


S; 


But conversely VS, €R, since R is a complete ring, and VS, = VS. 
a a a 
hence VS, = S. This proves C4. 


It remains to prove that every quasi-ordering and every completely distributive 
topology belong to such an ®, and that distinct ®R determine distinct quasi- 
orderings and distinct topologies——four assertions in all. 

By (3a), if NR ¥ MN’, then certainly R and NR’ yield distinct topologies. This 
proves one assertion. We next wish to prove that 

(38) Every completely distributive topology is determined by a suitable ®. 

Under any such topology, consider the family § of “closed” sets S = 8. 
Clearly § contains O, J, and (by C4) VS, if it contains every S,. But it also 


contains AS, under the same hypotheses. 
Proof. If 8. = Sq for every a, then AS, = AS, , and so (AS,) S Sa for 
all S,, whence (AS,) S AS, = AS, , and so by Cl AS, is closed. Hence § 


ix a complete ring of sets with O and J. Moreover, if S is any set, then S is the 
product of the T. € § containing S—by Cl, AT, = AT, = 8, and, by C1-C3, § 


is a closed set T, 2 S. Thus § “determines” the given topology. This 
proves (38). 

Again, if 2 is given, then 

(3y) SeM if and only if x¢«S and x 2 y (MR) imply ye S. 

Proof. If Se, by definition the second statement holds. Conversely if 
the second statement holds, then S contains, with every z, the set S(z) of all 
y < x (M)—ie., the product of the S, ¢R with x eS, ; obviously S(x) «eR 
and so S is the sum VS(zx) of the S(x) of the x « S, and is in R®. By (3y), if 
RN = RH’, then R and R’ determine different quasi-orderings. 

Finally, every quasi-ordering p is determined by some 8. For, given p, let 
R(p) consist of all S such that xe S and z 2 y imply ye S. Clearly O « R(p) 
and J ¢R(p). Also, if a family S of S, is in R(p), then (VS,) € R(p) and 








446 GARRETT BIRKHOFF 


(AS.) « R(p). Thus R(p) is a complete ring of sets containing O and J. 
Further, if 2 y, then obviously x = y (R(p)) in the sense that x e S « R(p) 
and 2 y imply ye S. Conversely if r 2 y (M(p)), then the set S(x) of all 
z S x (which is in R(p) by P2 and contains x by P1) contains y—by definition 
of x = y (R(p))--and so x 2 y. This proves the fourth assertion. 


4. The case of fields of sets. Which quasi-orderings and which completely 
distributive topologies correspond to complete fields’ of sets? And what does 
this make Theorem 1 reduce to for fields of sets? 

TuHeoreM 2. In Theorem 1, a quasi-ordering corresponds to a (complete) field 
of sets if and only if it is an equivalence relation; a topology does, if and only if 
the closures of its points are the subsets of a partition of I. 

Explanation. By an “equivalence relation” is meant a quasi-ordering which 
satisfies 
P3’: x = y implies y 2 x. 

By a “partition” of a class J is meant a division of its elements into disjoint 
subsets, whose sum is J. 

Proof. Let ® be a complete ring of sets, and let S(x) be the product of the 
sets S,eR containing x. Then x 2 y (9M) means ye S(x). If M is a field, 
and y « S(x), then the complement S’(y) of S(y) cannot contain 2—otherwise 
reS'(y) N S(x) S S(x) — y < S(x)—and so y € S(x) implies x « S(y). This 
proves P3’. Again, topologically, S(z) is the closure of xz. Hence if ® is a 
field, unless S(x) and S(y) are disjoint, S(z) NM S(y) contains some point z, 
and x = zand y = z, whence by P2 and P3’ x = y and y 2 2g, and therefore 
S(z) = S(y). 

Conversely, if P3’ holds, and ® is the family of sets S such that x «SS and 
y <= zimply y «8S, then S e® implies that x not in S and y S z imply y not 
in S (otherwise ye S and x S y by P3’), whence S’ eR and & is-a field. The 
fact that the sums of the parts of any partition of J are a complete field of 
sets is obvious. 

Corouuary. The complete fields of subsets of I which contain O and I can be 
identified with the different equivalences on I or with the different partitions of I. 


5. Rings of sets and distributive lattices. We shall deal below with rings of 
sets without assuming completeness. 

Suppose we consider rings of sets simply as collections of symbols (forgetting 
that the symbols denote sets of points) related by inclusion, addition and 
multiplication. Then any ring of sets appears as a “distributive lattice”, or 
system § of elements S, 7, U satisfying” 

* A (complete) ring of sets is called a (complete) field if and only if it contains the com- 
plement of every one of its members. 

7 Part of this result is proved by H. Hasse, Héhere Algebra, vol. I, 1933, p. 15, and B. L. 


van der Waerden, Moderne Algebra, p. 14. 
* Cf. the author’s On the structure of abstract algebras, Proc. Camb. Phil. Soc., vol. 31 











RINGS OF SETS 447 


LI: SNS=S and SUS=S. 

L2:SNT=TNS and SUT=TUS. 

IB: (SAT)/NU=SN(TNU) and (SUT)UU=SU(TUD). 

14:SN(SUT)=SU(SNT)=S. 

16: SU(TNUV) =(SUT)N(SUL) and 
SN(TUUV)=(SNT)/U(SNL). 


Moreover, two rings of sets seem indistinguishable when and only when they 
are “isomorphic’’—-i.e., admit a one-one correspondence preserving inclusion, 
sums and products.” 
Conversely, every abstractly given distributive lattice is known to be obtain- 
- . 10 
able from at least one ring of sets. 


6. Representation theory for distributive lattices. It is generally true in 
representation theories for abstract algebras that one gets the simplest results 
by considering homomorphic (many-one) as well as isomorphic (one-one or 
“true’’) representations. 

A full representation theory for Boolean algebras by fields of sets has been 
developed by Stone," and it is interesting to see the complications which arise 
in the more general case of distributive lattices. These show that the assump- 
tion that complements exist cannot be eliminated in Stone’s theory. 

First, let R be any distributive lattice, and let @ be any congruence relation” 
on %. Then the elements congruent to O form an “ideal” D in the sense that 


Il: XY «QO and A e® imply A N XN eD. 
12: X eO and YeD imply X U Ye. 


In case 3 is a Boolean algebra, © determines @, but this is not generally true 
in distributive lattices. 

Proof. With Boolean algebras, S and 7 are congruent mod @ if and only if 
(SM 7’) U (S’ N\ T) €M, whereas O is an ideal in the chain of three elements 
I > X > O, determined by two distinct congruence relations. 


(1935), pp. 433-454. O. Ore calls distributive lattices ‘“‘arithmetic structures’. Con- 
siderable work has been done by Fritz Klein on the decomposition of distributive lattices 
important in number theory; M. Ward has also given categorical definitions of such systems. 

® Actually, any one-one correspondence preserving any one of these three preserves all; 
this is not true of many-one correspondences. 

1° Cf. the author’s On the combination of subalgebras, Proc. Camb. Phil. Soc., vol. 29 
(1933), pp. 441-464, Theorem 25.2. 

1M. H. Stone, The theory of representations of Boolean algebras, Trans. Amer. Math. 
Soc., vol. 40 (1936), pp. 37-111. By a “representation’’ of a distributive lattice L, we 
mean a homomorphism between ZL and a ring of sets. 

2 Te., any partition of the elements of % determining an abstract homomorphism. This 
is a basic notion of general abstract algebra, whose detailed definition we shall omit. 











448 GARRETT BIRKHOFF 


Again, let 8 be any distributive lattice, and A any element of R. The relation 
X = Y mod A meaning X U A = Y U A isa congruence relation. 
Proof. That it is an equivalence relation is obvious. Moreover, by L1-L6, 


(YUY)NA=(XNXU A)U(Y U A), 
(XN Y)UA=(NU A)N(YU A); 


hence the correspondence X —» X U A defines a homomorphism of ® onto a 
subring of itself. 

If ® is a finite Boolean algebra, there are no other congruence relations on R; 
this is not true for finite distributive lattices which are not Boolean algebras 
(proof omitted). 


7. Prime ideals. Let us now suppose that & is any distributive lattice, and 
let us attempt to give a full representation theory for R. 

Let 6: R —- MN be any homomorphism from F to a ring of subsets of a class I. 
We may classify the points of J into three categories: those contained in every 
set X ¢€ 9, those contained in no set X eR, and the others. The first two 
categories of points are trivial, and so we can assume that O eR and J ¢R. 

Under these circumstances, every p «J divides the elements of R into two 
categories: those corresponding to sets including p, and those corresponding 
to sets excluding p. The second set of elements is an “‘ideal’’, while the first 
is a “dual ideal” D in the sense that 


Di: ce DandaeR imply a U ve D. 
D2: reDand ye Dimplyz N yeD. 


Hence the representation of R through ® is characterized to within equivalence” 
by which divisions of R into an ideal and complementary dual ideal occur, and 
how many times cach occurs. 

But conversely, by I1-I2 and D1-D2, if one is given any correspondence 
associating each division « of R into an ideal J and complementary dual ideal D 
with a cardinal number n(x), then this belongs to a representation of R by a 
ring of sets, and so if we define (with Stone, op. cit.) an ideal to be “‘prime’’ if 
and only if its complement is a dual ideal, we have 

THeoreM 3. The inequivalent representations of a given distributive lattice R 
as a ring of sets are the different functions whose arguments are the ““prime ideals’’ 
of R, and whose values are cardinal numbers. 

Remark 1. With Boolean algebras, the number of elements in any prime 
ideal and its dual are the same. Also, no prime ideal contains any other prime 
ideal. Neither of these properties is true in distributive lattices not Boolean 
algebras (e.g., the chain J > X > O). 


8 T.e., to within differences between the various points p «J. This is standard termi- 


nology. 








RINGS OF SETS 449 


Remark 2. It is natural to call a representation “irredundant”’ if and only 
if no prime ideal appears as a point more than once. 


8. The finite case. Only exceptionally are the prime ideals of infinite Boolean 
algebras known. But in each finite Boolean algebra of order 2" they are known 
to be n sublattices of order 2". 

We shall go further and determine the prime ideals of all finite distributive 
lattices. 

Accordingly, let R be any finite distributive lattice, P any prime ideal in R, 
and D = R — P the dual of P. Form any connected chain’ 0 < 1 < 
Ie < +--+ <a, = I in R; it is clear that in such a chain there will be exactly 
one “link” 2; < xi; such that 2; ¢€ P and 2;,., € D, and that x, ¢ P for k < 7, 
while x, € D for k > 7. 

(8a) We have ye P or yeD according : 
(x; U y) N ayes is 2; or ayer. 

Proof. Since x; S x1, 2; U (yA wis) = (ei Uy) A wis (by L6 and 
contraction). Again, for any y, obviously 7; S v S 241 ; hence either v = 2; 
or v = 2i41 (no further interpolation being possible). But if [z; U (y N aj.1)] = 
a; e P, then by 11 (y A 24.1) € P, and so by D2 (since 244; € D), ye P. Similarly, 
if 2; U (y N ai) = aia € D, then by 12 (since x; € P), y N x41 € D, and so 
by Di, y e D. 

Definition. By a “prime factor” of a distributive lattice is meant any symbol 
x/y, where y < x and no element can be interpolated between y and x. A 
prime factor x/y will be called a “cleavage” for a given prime ideal P if and 
only if ye P and xe(R — P). 

(88) Any prime factor a/b is a cleavage for some prime ideal. 

Proof. Let x ¢ P if and only if (6 U x) Na = b; this makes x « (R — P) if and 
only if (6 U x) N a = a, since forallz,b < (6 Ur) Na=bU (x Na) <a, 
and a/bis prime. Clearly ae (R — P) and be P (by L4). It remains to prove 
I1-I2 and D1-D2. But I1 and D1 are obvious, since (b U x) /M ais decreased 
resp. increased by substituting x M y resp. x U y for x. Moreover, under the 
hypotheses of 12, 


b= (bU(rNa)| Ul[bU Y Na) =bU [ix Na) U Vy Na) 
=bUl(rUyN(@UaNWUaNal=dbU [x U y) Nal. 


This proves I2.. The proof of D2 is dual. 

TuHeoreM 4. Let R be any finite distributive lattice, and let its prime ideals be 
P,,+++,P,. Then in every connected chain O < 1 < +++ < 2, = TI, each 
Ti41/2; is a cleavage for just one P;—whence r = n. 

Proof. By (8a), if P; # P;, they can have no cleavage in common, and 
by (88), every prime factor is a cleavage for some P; . 

We have the Jordan-Dedekind theorem” on the constancy of the number of 


= I; U (y N Vist) —— 


I 
~ 


14 A chain is called ‘‘connected”’ (or dense by Ore) if no further terms can be interpolated 
in it. 
1% R. Dedekind, Werke, vol. II, p. 254. 








450 GARRETT BIRKHOFF 


links in connected chains as one corollary, and using Theorem 3, we have the 
further 

Corouiiary. A finite distributive lattice has (to within equivalence) exactly one 
irredundant isomorphic representation as a ring of sets—and the number of points 
involved is the number of links in its connected chains.” 


. The finite case (continued). Let # denote again any finite distributive 
lattice, let its prime ideals be P; , --- , P, , and let their duals be D, , --- , D,. 

Let further s; = s(P;) and p; = p(D,) be the sum of the z ¢ P; resp. the 
product of the re D;. By 12, s;¢P;, and by D2, p; « Dj; hence (ef. I1-D1) 
reP; means x S s; and xe D; means x 2 p;. 

Now let J denote the partially ordered set” of the s;. Call a subset S of I 
“elosed”’ if and only if s; « S and s; S s; imply s; eS. 


(9a) R is isomorphic with the ring of ‘‘closed” subsets of J under the corre- 


spondence S = A s;. 
“08 
Proof. Let S be a “closed” subset of 7. Then by [1-12 and D1-D2, A s; 
sjeS 
is in the P; corresponding to these s;, and no others. But given ze R, the 
subset of s; 2 x is closed, y = A 8; is in the same P,; as z, and hence y U z 
sj2z 
and y M x are, and so by (88) no prime factor can be inserted between them, 
and x = y= As;. Thus the correspondence z= As is one-one. But it 
>; si2z 


clearly preserves inclusion, while by Theorem 1 the closed subsets of J are : 
ring of sets. This completes the proof. 

Consequently two finite distributive lattices having isomorphic partially 
ordered sets of s; are isomorphic. But the converse is obvious, since the s; 
are intrinsically defined. Since, finally, if X is any (abstractly given) partially 
ordered set, the ring of its closed subsets is a distributive lattice having the 
“closures” of points of X for s, , we obtain 

Turorem 5." There is a one-one correspondence between the partially ordered 
sets of n elements and the distributive lattices whose connected chains are of length n. 

In the notation of a previous article (this volume of this Journal, p. 311), by 
(9a) this is the correspondence X = B*. 

Remark 1. The connected components X,,--- , X, of X correspond to the 
indecomposable direct factors of 


B* = (B"') K «++ X& (B") 
in the direct decompositions of B*. 


Cf. On the combination of subalgebras, Theorem 17.2. 

‘7 A set is “partially ordered” (the terminology is Hausdorff’s, Grundziige der Mengen- 
lehre, 1914, Chap. VI) by a quasi-ordering satisfying P3: z 2 y and y = x imply z = y. 
Any subset of a partially ordered set (such as a distributive lattice) is partially ordered 
by the same relation. 

'® Theorem 5 was announced without proof by the author in a note Sur les espaces dis- 
crets, Comptes Rendus, vol. 201 (1935), p. 19. 











RINGS OF SETS 451 


Remark 2. The “Hasse” diagram’’ for X gives an infinitely more compact 
and intelligible way of writing down a general distributive lattice B* than the 
multiplication table used by Dedekind, or than the “Hasse diagram” for B* 
itself used by recent authors. 

Remark 3. Let ® be any finite ring of sets, and L = B* the distributive 
lattice isomorphic with R. To find X, take the quasi-ordering determined by R; 
identify points x and y satisfying both x = y (M) and y = z (MN); the partial 
ordering induced by the quasi-ordering on the sets of “identified”? points will 


yield X. 


10. The indecomposable elements. Let us again suppose that R is a finite 
distributive lattice with prime ideals P; , --- , P, having duals D,,--- , D, . 

(10a) Each s(P;) = s; is product-indecomposable. Dually, each p(D;) = px 
ix sum-indecomposable. 

Explanation. An element a of a lattice 2 is called “product-indecomposable” 
when no two elements x > a and y > a exist with « N y = a; it is called sum- 
indecomposable when (dually) no two elements x < a and y < a exist with 
xUy=a. 

Proof. If x > s; and y > s;, then xe D; and y e D;, whence (x Ni y) € Dy 
and « Ny # 8. 

(108) If x € R is product-indecomposable, it is an s;. If it is sum-indecom- 
posable, it is a p;, dually. 

Proof. If x is product-indecomposable, then it yields a unique prime factor 
a/x: let P; be the corresponding prime ideal. Clearly x = s(P,), since if 
x < s(P,), we would have y = s(P;), whence y € P;. 

Corotiary. The number of sum-indecomposable resp. product-indecomposable 
elements of a finite distributive lattice is the length of its connected chains. 


10a. The free distributive lattices. Consider the “free” distributive lattice 
generated by n symbols 2, +°-+ ,2, ; Theorem 1 inclines one to adjoin to it 
elements O and J such that O < 2; < J for all 2; . 

If this is done, then the product-indecomposable clements form a Boolean 
algebra B" of 2" elements. More precisely, they are the elements, O, 2; , 
a, U2e;,27, U2,  Ur,m Ua, U2, Uam,---, my Ue Um U 
te U--- Ua, ,a, U--- Ux, . The corresponding prime factors are 


a A+++ 2x,/0, 
we U (ar ee) ra A ter ++ A ty) /ze , 


oe ee eee wee meme eee wee eee eee eeeeeeeeeeeeeeee 


(7, U--- Va, Ua, U---U2r,) Ux, 
/(ay U +++ U aga U ayy, U +--+ U o,), 
I/(a, U +++ U a,). 


*Cf. H. Hasse, Héhere Algebra, vol. I1, 1927, p. 103, p. 123. 








452 GARRETT BIRKHOFF 


It is a corollary that the free distributive lattices with O and J adjoined are 
the B”". 
The proofs of the above statements are tedious; they depend on the knowledge 


on” 


of canonical expressions for the elements of the free distributive lattice.” 


11. A general decomposition theorem. Let FR be any (finite or infinite) 
distributive lattice, and let 


r=enN---Na=ywN--- Ny. 


be any two representations of an element xe Rasa product. Then irrespective 
of i,2, = 7, Ur = 2; U (Ay) = AC, Uy). Hence if x; is product-indecom- 
/ 


posable, some x, U y; = 2). This means some y; S$ x;. Symmetrically, some 
x. S y;. Hence either x, = y; = x;, or x; is redundant in the strong sense 


that some xr, < x; , whence 
r= r; nN a nN F593 nN Ti+ N Seat nN at 


Thus if the decompositions are irredundant, the 2; and y; are equal in pairs, 
r = s, and so 

(lla) Ina distributive lattice, no clement has more than one irredundant product- 
decomposition (sum-decomposition) into elements not themselves further decom- 
posable. 

But conversely, any modular lattice which is not distributive is known” to 
contain a sublattice of five elements a, b, 21, x2, x3 satisfying a < x; < b, 
x, N x; = a, and x; U x; = b [i # jj. Now starting with the two product- 
decompositions a = 2 M1 x2 and a = x22 N x; of a, making further decompositions 
as long as possible, and eliminating redundant components, we see that any 
factor for the second decomposition which contains 2; must contain 22 or 2; 
and henee b—-whereas the first decomposition and those derived from it must 
possess at least one factor containing x; but not b. Hence if the above process 
is terminating, we will surely get two distinct product-decompositions of a. 
But in the presence of the chain-condition, the process 7s terminating. This 
completes the proof of 

TueoreM 6.” A modular lattice satisfying the ascending chain condition is 
distributive if and only if cach of its elements has a unique irredundant product- 
decomposition. 

20 The latter are given by Th. Skolem, Uber gewisse ‘‘Verbdnde”’ oder ‘Lattices’, Avh. 
Norske Videnskaps Akademi i Oslo, Mat.-Naturv. Klasse, 1936, no. 7, pp. 7, 8. From 
them it is immediately obvious that the elements specified above are the only product- 
indecomposable elements—but there are just enough such elements to give the lattice 2” 


dimensions; hence they are all product-indecomposable. 

*t Cf. Theorem 4 of the author's On the lattice theory of ideals, Bull. Amer. Math. Soc., 
vol. 40 (1934), p. 617. 

*? This result was announced by the author in Abstract 41-1-75 of the Bull. Amer. Math 
Soc., vol. 41 (1935), p. 32. 











RINGS OF SETS 453 


This result is especially interesting in the light of recent proofs by Kurosch 
and Ore that, in any modular lattice, the number of factors in any two irre- 
dundant product-decompositions of the same element into indecomposable 


. 23 
factors is the same. 


12. Some enumeration problems. (One very impartial test of one’s ability to 
classify finite systems is one’s ability to enumerate them. This suggests the 
problem of determining the following combinatorial functions. 

(12.1) The number F;(n) of different rings of subsets of n elements. (This 
is the number of sublattices of the Boolean algebra B” of 2” elements.) 

(12.2) The number F2(n) of non-equivalent rings of such subsets. (This is 
the number of such sublattices non-conjugate under the group of auto- 
morphisms of B".) 

(12.3) The number F3(n) of non-isomorphic rings of such subsets. (This is 
the number of non-isomorphic distributive lattices of “dimensions’’ n.) 

(12.4) The number F,(n) of non-isomorphic partial orderings of n elements. 

Remark 1. If we replace “ring” by “field’”’ in the above, F;(n) becomes a 
known combinatorial function defined by the recurrence 


H* (n+ 1) = = (;') H* (n — h). 


A=0 

This has been studied by Aitken (Edin. Math. Notes, vol. 28 (1933), pp. xviii 
xxiii). Again, F:(n) becomes the partition function—a celebrated asymptotic 
formula for which has been given by Hardy and Ramanujan. And lastly, 
F;(n) becomes n. 

Remark 2. In virtue of Theorems 3 and 5, F3(n) = D> Fi(n). Also, 

° k=1 

F,(n) is by Theorem 1 the number of non-equivalent quasi-orderings of n ele- 


ments, and F,(n) is the number of different quasi-orderings of n elements. 
A table for these functions for small n follows. 


I 2 3 4 5 6 
F,(n) | 3 29 
F(n) I 3 9 30 
ry n) I 2 5 15 51 250 


2 A. Kurosch, Durchschnitisdarstellungen mit irreduziblen Komponenten in Ringen und 
in sog. Dualgruppen, Rec. Math. (Moscow), vol. 42 (1935), pp. 613-16. O. Ore, On the 
foundations of abstract algebra. II, Annals of Math., vol. 37 (1936), p. 270, Theorem 11. 
The result of Kurosch-Ore contains a decomposition theorem of E. Noether for ideals, 
and a less well-known result of Remak’s on finite groups, as special coroilaries. 








454 GARRETT BIRKHOFF 


In calculating these values, assume that F2(n) is the number of functions from 
the different partially ordered sets of k S n elements to cardinal numbers whose 
sum is”. Also F(n) can be calculated combinatorially from F2(n) by summing 
the occurrences of each type of ring of subsets, over the types existing. To 
find F(n), separate each partially ordered set into its connected components. 

It would be very interesting to know more about the F,(n), numerically or 
asymptotically. F4(n) resembles the function describing the number of groups 
of order 2" —whose first values are 1, 2, 5, 14, 51, 266,---. It appears to 
increase more rapidly than the function describing the number of non-isomorphic 
symmetric relations between n objects (or alternatively, the number of non- 
homomorphic graphs with n vertices), whose first values are 1, 2, 4, 11, 27. 
But as almost nothing is known about the rate of growth of these functions, 


these comparisons are not very reliable. 


13. Homomorphic images and sublattices. [Let us try to determine the 
homomorphisms and sublattices of a given finite distributive lattice, guided by 
the previous results. 

Some authors,” inspired by the numerous analogies between lattices and rings, 
have correlated the congruence relations on lattices with “ideals”? and ‘normal 
sublattices”. But except in the “complemented” case in which cach element x 
POSSESSCS a complement re witha Nx’ = Oandr Ue’ = I, this correlation is 
incomplete. 

Actually, in the case of finite distributive lattices, and more generally with 
arbitrary modular lattices of finite dimensions (the author will publish proofs 
elsewhere, in an article on modular lattices), congruence relations correspond 
one-one to subsets of the set of prime factors. 

It follows that, if L = B* is any finite distributive lattice, the congruence 
relations on it are obtained by setting z = y(@) if and only if x and y contain 
the same P;. Hence, to obtain the homomorphic images B” of B*, set Y equal 
to any subset of Y having on that subset the same inclusion relation as X. 

The determination of the sublattices of B* is even easier. First, recall that B* 
is isomorphic with the ring of subsets of X which are “closed”’ with respect to 
the partial ordering of X, by (Qa). But a sublattice is clearly just a subring 

and by Theorem 1 these subrings are the families of sets ‘closed’? under 
quasi-orderings p of X such that 2 = y(p) whenever x = yin X. Hence, to 
obtain the sublattices of B*, strengthen the inclusion relation in X to any quasi- 
ordering and consider the partially ordered set Y obtained from this after ele- 
ments x and y such that ¢ = y and y = x have been identified; B’ will be the 
general sublattice of B*. 


Harvarp UNIVERSITY 


“Cf. for instance Gr. C. Moisil, Recherches sur Ualgébre de la logique, Annales Sei. de 
l'Univ. de Jassy, vol. 22 (1936), pp. 1-118. 














A REPRESENTATION OF GENERALIZED BOOLEAN RINGS 
By N. H. McCoy anp DEANE MONTGOMERY 


1. Introduction. Stone has recently shown’ that every Boolean ring is iso- 
morphic to a ring of subclasses of some class. As Stone himself remarks, there 
is a close relation between the representation of Boolean rings and the theory 
of direct sums of rings. The theorem just stated is clearly equivalent to the 
theorem that every Boolean ring is isomorphic to a subring of a direct sum of 
rings F,.” We present here a simple direct proof of this theorem in a some- 
what more general case. 

A commutative ring R, is said to be a generalized Boolean ring of index p 
(often abbreviated p-ring) if p is a prime and if for every a in R, it is true that 
a” = aand pa = 0. A Boolean ring as defined by Stone is therefore a 2-ring.” 
We show here that a p-ring is isomorphic to a subring of a direct sum of rings F’, . 
The interest of this theorem lies partly in its generality and partly in the 
simplicity of the proof, which is based on an exploitation of a device used by 
Alexander and by Alexander and Zippin.* Inasmuch as our proof, like Stone’s, 
demands the existence of certain homomorphisms and inasmuch as we prove 
the existence of these homomorphisms by a method analogous to Stone’s,° our 
proof makes use of transfinite induction. 


2. Subrings of direct sums. For the theorem given here on subrings of direct 
sums neither of the rings considered need be commutative. 

THreoreM 1. A necessary and sufficient condition that a ring R be isomorphic 
tu a subring of a direct sum of rings K is that for every a ~ 0 in R there is a homo- 
morphism h of R into a subring of K such that h(a) # 0. 

Consider first the necessity of the condition. Assume then that the elements 
of R are functions f defined on a certain set M with values in K.° If fy in R 
is not zero, there is some element m such that fi(m) # 0. We obtain a homo- 
morphism of R into a subring of K by making correspond to any f in R the 
value f(m). This homomorphism is not zero on f; and therefore satisfies the 
condition of the theorem. 

Received January 26, 1937. 

1M. H. Stone, The theory of representations for Boolean algebras, Transactions of the 
American Mathematical Society, vol. 40 (1936), pp. 37-111. See also Garrett Birkhoff, 
On the combination of subalgebras, Proc. Camb. Phil. Soc., vol. 29 (1933), pp. 441-464. 

* In general, for any prime p, Ff, denotes the field of integers reduced modulo p. 

* When p = 2, the commutativity and the fact that pa = 0 follow from the assumption 
ar=z=a, 

‘Annals of Mathematics, vol. 35 (1934), pp. 389-395; vol. 36 (1935), pp. 71-85. 

* Loe. cit., pp. 102-104. 

«A direct sum consists of the set of all such functions. 


455 








156 N. H. MeCOY AND DEANE MONTGOMERY 


Turning now to the sufficiency of the condition, let H denote the set of all 
homomorphisms h of /?? into any subring of K. Corresponding to each element a 
of R we define on H the function y, , with values in K, as follows: 


(1) Yalh) = h(a). 

Since A is a homomorphism, it follows at once that 

(2) Ya.v(h) = h(a + b) = h(a) + h(b) = yalh) + yr(h), 

and 

(3) Yao(h) = h(ab) = h(a)h(b) = yalh)yp(A). 

Thus the correspondence a — y, is a homomorphism of RF into the ring S ot! 
functions y,,a@ C R. To prove that this is actually an isomorphism, we need 
to show that the function y, vanishes identically on H only if a = 0. This 


follows almost at onee, for we have assumed that if a + 0 there is an h, in H 
such that h(a) = ya(hi) ~ 0. Thus R is isomorphic to S, and the proof is 
completed by noting that the ring S is a subring of the ring of all functions on 
H to K, and is therefore a subring of a direct sum of rings K. 

A theorem entirely analogous to Theorem 1 holds for groups or, more generally, 
for abstract algebras as defined by Garrett Birkhoff. 


3. Imbedding theorem. The following theorem is demonstrated for any 
prime in the same way in which Stone demonstrates it for the case p = 2. 

Tueorem 2. A p-ring R,, may be imbedded in a p-ring R* which contains a 
unit element. 

Let us denote the elements of F, by 0, 1,---, p — 1. The elements of R* 
will be the pairs (r, 7), where r is in R, and fi isin F,. If (m1, 71) and (re, m2) 
are two pairs of the kind described, their sum is defined to be (r; + 72, fy + M2) 
and their product is defined to be (rir2 + ner, + mire, tite). These elements 
form a commutative ring under this definition. Clearly p times any element 
is zero and it can also be verified that any element raised to the p-th power is 
itself. The ring thus formed is the desired ring, for the elements of the form 
(r, 0) form a subring isomorphic to R, and the element (0, T) is a unit element. 


4. Finite p-rings. Let F be a p-ring containing a finite number of elements. 
Since every clement a of F satisfies the equation a” = a, it follows at once that 
if a" = 0 for some positive integer n, then a = 0. Thus F does not contain a 
radical and is therefore known to have a unit element and to be a direct sum of 
fields.’ These fields, being subrings of F, are clearly also p-rings. 

It will now be shown that F, is the only field which is a p-ring. Suppose S 
is such a field, the unit element of S being denoted by e. Then S contains a 
field F’, isomorphic to F, and consisting of the integral multiples of e. Since 


7B. L. van der Waerden, Moderne Algebra, vol. 2, p. 163. 











REPRESENTATION OF GENERALIZED BOOLEAN RINGS 457 


gz’ —x =2(x — 1)--- [x — (p— 1)] (mod p), 


it follows that each element a of S satisfies the equation 


x? —x = x(x — e)-:- [x — (p — 1)e] = 0. 

Now it is easy to show that a satisfies a unique equation f(z) = 0 of minimum 
degree, with coefficients in F’, and with leading coefficient unity. Further, f(z) 
must be irreducible in F',, as otherwise S would contain divisors of zero. But 
f(x) divides x” — x and therefore is merely one of its linear factors, and thus a 
is an element of F’,. These results yield the following theorem: 

THeoreM 3. Every finite p-ring contains a unit element and is a direct sum 
of fields F,. 

The following remarks will be useful in proving the existence of homomor- 
phisms in the next section. Let R, be any p-ring containing a unit element e 
and let a be any element in R,. The ring {a, e} generated by a and e¢ consists 
of all polynomials in a and e. Since a”? = a and since pa = 0, this ring is 
finite and since it is a p-ring, it is expressible as a direct sum of fields F,. Thus 


there exists a set of non-zero elements e;, ¢2, --+ , e, of fa, e} with the following 
properties: 
(4) €=& + eet ++: +e, C; = &, ee; = 0 (i # 9). 


Every clement of |a, e} is expressible as a linear combination of the elements e, 
with coefficients in F,. Furthermore the elements e; are linearly independent 


over F,,. We shall call this set a basis of {a, e}. 


5. Existence of homomorphisms. Let 2, be an arbitrary p-ring containing 
a unit element e and let S be a subring of R, which contains ec. If @ is an 
element of R, not in S, denote by S(a) the subring of R, generated by S and a. 
The elements of the ring S(a) are expressible as polynomials in a having coeffi- 
cients in S with degree at most p — 1. Now let e; (¢ = 1, 2, --- , 7) be a basis 
of fa, e} as in the preceding section. Each integral power of a is a linear com- 
bination of the e,’s with coefficients in F,, and since ¢ is also such a linear 
combination each element 6b of S(a) may be written in the form 


(5) b = bier + boee + ++: + be, 
the coefficients b; being elements of S. If ¢ = ce; +--+ + ©, is another 
clement of S(a), it follows from (4) that 
(6) b+e=(h +a)ea+°-:' + (6 + e,)e,, 
be = (bieyjey + +--+ + (bee, . 


If b = 0, it follows from (5) and (4) that 0 = e;b = bye; , and thus 
(7) by; be .?s b, = bi be it b.(e + ree e,) = @. 


We shall now prove the following lemma. 








45S N. H. Mecoy AND DEANE MONTGOMERY 


Lemma. Let S be a subring of R,, containing the unit e of R, , and let h be a 
given homomorphism S — F,. Then there exists a homomorphism h', S(a) — F, , 
such that under h' the images of elements of S are identical with their images 
under h. 

The symbol P, will be used to represent the direct sum of r rings S, th 
elements of P, being denoted by (db; , be, --+ , b.), where each b; is an element 
of S. In like manner C, will be used to represent the direct sum of r rings F,, 
Let K denote the ideal in P, consisting of those elements (b; , be , --+ , b-) such 
that be, + --- + be, = 0. That K is an ideal follows from (6). 

Now A induces a homomorphism (b; , b2 , --- , b-) — (br, LS. -++ . b*) from 
P. to C,, where b; — b: by h. Denote by L the ideal in C, which is the image 
of K under the induced homomorphism. The ideal L can not contain 
(1, 1, +--+ ,1), for if (b1, be, +--+ ,b-) > Ci, 1,--- , 1), then bib --- b, = 0, 
and from (7), (b: , be, «++ , 6) can not be in K. Therefore L does not include 


all of C.. Any ideal in C, is made up of elements (a , 72, «++ . 2), where for a 
certain fixed set of 7’s, x; = 0, and for the remaining 7’s, 2; may take any value 
in F,,. Since LZ is not identical with C,, we may assume that J consists of all 
elements (0, +--+ ,0, a, ++: ,2,), where k > 1, and x, +++ , 27, are arbitrary 


elements of F, . 
We now set up the correspondence 


(8) b = bbe: + --- + b,e, — br 


and proceed to show that this is the required homomorphism h’. By (6) this 
will be a homomorphism S(a) — F,, provided the indicated correspondence is 
independent of the representation (5) of any given element b of S(a). If b 
can also be expressed as ¢,¢: + ++: + ¢,e,, it follows that (b; — eer + --* + 
(b., — cee = Oand therefore [(b; — c)*,--- , (b+ — ¢,)*] = (by — ct, 

b* — c*) is an element of L. From the form we have assumed L to have, 
it follows that b* — c* = Oand hence bf = c;. Thus (8) defines a homomor- 
phism S(a)-—> F,,. If x is any clement of S, then from (8) we find 


r= re = xr(e, +--+ + ,) > 2* 


and the homomorphism /’ coincides with h on S. This completes the proof. 

Tueorem 4. Jf R, is any p-ring containing a unit element e and if a is any 
non-zero element of R,, , there exists a homomorphism h of R,, into F,, such that 
h(a) # 0. 

Consider first the ring ja, e}. This ring is a direct sum of rings F, and 
therefore a homomorphism h may be defined on it to F, in such a way that 
h(a) # 0. It remains only to extend A to the remainder of R,. If the elements 
of R, not in ja, e} are well ordered, the desired extension may be made, in 
view of the lemma just demonstrated, by transfinite induction.” 


* Cf. Stone, loc. cit., pp. 102-104. 














REPRESENTATION OF GENERALIZED BOOLEAN RINGS 459 


6. Representation of p-rings. We are now in a position to prove our princip:l 
theorem. 

TueoreM 5. If R, is any p-ring, it is isomorphic to a subring of a direct sum 
of rings F , . 

By Theorem 2, 2, may be imbedded in a p-ring R’, with a unit element, and 
by Theorems | and 4, R%, is isomorphic to a subring of a direct sum of rings F,,. 
Therefore F,, itself is isomorphic to a subring of a direct sum of rings F,. 


SsmirH CoLLeGE 








A STRUCTURAL CHARACTERIZATION OF PLANAR 
COMBINATORIAL GRAPHS 


By Saunpers Mac LANE 


1. Introduction. There are several known necessary and sufficient conditions 
that a combinatorial graph be planar.’ This paper aims to establish another 
such condition which has a more intrinsic character, in that it is obtained directly 
from an analysis of the structure of the graph. More explicitly, the new con- 
dition depends on a unique decomposition of the graph into certain maximal 
triply connected subgraphs. This decomposition can be viewed on its own 
merits as a generalization of the Whyburn cyclic element theory. 

The first combinatorial criterion for a planar graph is due to Kuratowski,” 
who showed that a graph is planar if and only if it contains no subgraph homeo- 
morphic to one of the two following graphs: the graph composed of five vertices 
and ten edges, in which each pair of vertices is joined by an edge; the graph 
composed of six vertices, arranged in two sets of three vertices each, and of nine 
edges, such that each vertex of the first set is joined to each vertex of the second 
set by an edge. Subsequently, Whitney defined combinatorially the relation 
between a graph and its planar dual and showed that a graph is planar if and 
only if it has a planar dual.’ A third condition states that a combinatorial 
graph is planar if and only if it contains a complete independent set of circuits, 
modulo 2, such that no edge appears in more than two of these circuits.* 

The application of any of these theorems to a particular case has a haphazard 
character because one must investigate any possible smallest non-planar sub- 
graph or any possible dual or any possible complete set of circuits. We seek 
an intrinsic condition; that is, a condition expressible in terms of configurations 
which are associated in a unique manner with a given graph. An example of 
such a condition is the result of Whitney, that any graph G can be broken up 
uniquely into non-separable components’ and that the graph is planar if and 
only if each of its non-separable components is planar. 

Received January 28, 1937; presented to the American Mathematical Society, December 
30, 1936. 

! For definitions of terms see §2. 

2K. Kuratowski, Sur le probléme des courbes gauches en topologie, Fundamenta Mathe- 
maticae, vol. 15 (1930), pp. 271-283. 

*H. Whitney, Non-separable and planar graphs, Transactions of the American Mathe- 
matical Society, vol. 34 (1932), pp. 339-362. 

*S. Mac Lane, A combinatorial condition for planar graphs, Fundamenta Mathematicae, 
vol. 28 (1937), pp. 22-32. 

5 It is to be hoped that such a condition may throw light on the question of when a graph 
is mappable on the torus. 

* These components of G@ are precisely the true cyclic elements of @. See G. T. Why- 
burn, Concerning the structure of a continuous curve, American Journal of Mathematies, 
vol. 50 (1928), pp. 167-194. For the combinatorial decomposition into components see D. 
Kénig, Theorie der endlichen und unendlichen Graphen, Leipzig, 1936, Ch. 14. 

7H. Whitney, loc. cit., Theorems 12 and 27. 

460 











STRUCTURE OF PLANAR COMBINATORIAL GRAPHS 461 


These non-separable components are cyclically connected, since they are not 
disconnected by the removal of any one vertex. Similarly, a graph G is triply 
connected if the removal of any two of its vertices either does not disconnect G, 
or else disconnects G into two parts, one of which is only a chain. Some results 
of Adkisson and Whitney* indicate that such a triply connected graph G can 
have but one map on the sphere. In this topologically unique map, the circuits 
which bound the connected regions of the complementary set are the only 
circuits which do not “cut” the graph; that is, they are the only circuits whose 
removal does not disconnect the graph. The condition that these circuits really 
give possible region boundaries for a map yields (§4) an intrinsic condition that 
such a triply connected graph be planar. 

It is then natural to try to reduce a graph G which is not triply connected to 
triply connected constituents. This might be done by choosing two vertices 
p and q whose removal! disconnects the graph G into two subgraphs H and H’. 
However, planar maps of H and H’ cannot always be combined to form a map 
of G. This would be the case if H is modified by adding one new are joining 
p to q. The so-modified graph we call a “block” of G. Any graph can be 
broken up successively (“‘split’’) into blocks so that the final blocks, called 
atoms, are triply connected (cf. §2). These atoms are uniquely determined (see 
§3) except fora homeomorphism. In terms of this combinatorial decomposition, 
we obtain our fundamental result that a non-separable graph is planar if and 
only if each of its triply connected atoms is planar (see §5)._ In the last section 
we further illustrate the applicability of these atoms by showing that the number 
of topologically distinct maps of a planar graph can be directly computed from 
the number of atoms and the number of “‘multiple’’ splits of the original graph. 


2. The splitting process. We first state some preliminary definitions. A 
combinatorial graph G consists of a finite number of elements a, 8, --- called 
“edges” and a finite set of “vertices’’, p, g, --- , where each edge B is “on” 
exactly two distinct vertices p and g, which may be called the “ends” of 8. 
Any set of edges in G together with all the vertices on these edges form themselves 
a subgraph. We write H C G for “H is a subgraph of G’. If H, and Hz are 
subgraphs of G, then H, N Hz is the subgraph containing those edges in both 
H, and Hz, while H, + He is the subgraph containing those edges in either 
H, or He, and G — H, is the subgraph containing all edges of G not in H,. 
“Circuits” and “chains” are defined as usual. A hanging chain in G is a chain 
none of whose vertices, except perhaps its two ends, are on more than two edges 
of G. Two graphs G and G’ are called homeomorphic if and only if G ean be 
changed into G’ by one or more of the following operations: 


8V. W. Adkisson, Cyclicly connected continuous curves whose complementary domain 
boundaries are homeomorphic, preserving branch points, Comptes Rendus des séances de la 
Société des Sciences et des Lettres de Varsovie, Classe IIT, vol. 23, pp. 164-193. 

H. Whitney, Congruent graphs and the connectivity of graphs, American Journal of Mathe- 
matics, vol. 54 (1932), pp. 150-168. 








12 SAUNDERS MAC LANE 


(i) The replacement of a hanging chain by a new hanging chain having the 
sume ends; 

(ii) The renaming of edges or vertices or both.’ 

If no vertices are renamed in the homeomorphism, each branch vertex of G is a 
branch vertex of G’, and so we say that the branch points are preserved. Two 
homeomorphic graphs are topologically equivalent, in that any map of the one 
is topologically homeomorphic to any map of the other. 

A graph B consisting only of t 2 3 edges all having the same two ends p and 
q, or any graph obtained from B by replacing these edges by hanging chains, 
will be called a branch graph with ¢ branches and with the termini p and q. 

Because of Whitney’s results on separable graphs, stated in the introduction, 
we shall consider henceforth only non-separable graphs G. A graph G, not a 
single edge, is non-separable (cyclically connected) if and only if each pair of 
vertices of G is contained in a circuit of G.”° 

A semi-split of G at the vertices h; and he is a representation of G as a sum 


(1) G=H+H’', 


where H and H’ are two non-void subgraphs, having in common no ares and 
no vertices except the vertices h; and hz. A split of G at h, and hz is a representa- 
tion (1) which is a semi-split and where neither H nor H’ is a chain. Since G is 
non-separable, 1 must contain both A; and he , and must be connected. Hence 
there is a chain in H with ends A; and he , and in like manner, «a chain in H’ with 
the same ends. 

Take any chains X and X’ contained in H’ and H respectively, and having 
the ends hy and he. The two subgraphs 


(2) H + X, H’ + X’; ace, X’ CH 


will be called blocks'' of G corresponding to the split (1). We say that G@ has 
been split into these two blocks. These blocks are not uniquely determined, 
for X may be replaced by other chains from H’ with the same ends. However, 
any such X is a hanging chain in the block H+ X, so that this block is uniquely 
determined up to a homeomorphism. 

A graph G@ will be called triply connected” if it is cyclically connected, if it 


*H. Whitney, On the classification of graphs, American Journal of Mathematies, vol. 55 
(1933), pp. 236-244 

10 Whitney, Non-separable graphs, loc. cit., Theorem 7. Alternatively, a graph is non- 
separable if it is not disconnected by the removal of any one vertex. 

1 Since H + X is obtained by replacing all ares of H’ by a single chain X, we may con- 
sider this block H + X as a sort of ‘‘factor-graph’’ of G modulo H’. This analogy with 
factor-groups is suggested because the group of eyeles of 7 + X (mod 2) is isomorphic to 
the factor-group of the group of cycles of G modulo the group of eyeles of H’, both taken 
mod 2 

'? This notion of triply connected (call it TCM) does not always agree with the notion 
of triply connected (TCW) introduced by Whitney, in Congruent graphs, loc. cit., p. 158. 
The two definitions agree for graphs G containing at least four vertices and containing 




















STRUCTURE OF PLANAR COMBINATORIAL GRAPHS 463 


has no split, and if G is neither a circuit nor a branch graph. A cyclically con- 
nected graph G@ split into two blocks may be further decomposed by splitting 
one of these blocks which may happen not to be triply connected. Such suc- 
cessive splits finally yield a set of unsplittable blocks. By the definition of triple 
connectivity, each unsplittable block is either a branch graph or a triply con- 
nected graph. The branch graphs are unimportant. The triply connected 
blocks we call atoms, and all the triply connected blocks in a final set of un- 
splittable blocks will be said to constitute a complete set of atoms of G. 

We now list two useful consequences of our definitions. 

Lemma 1. If in the graph G there is a chain L which does not pass through” 
either vertex hy or he , then L is contained in one of the blocks of any split at the 
vertices hy and he . 

The proof follows at once from the definition of a split. 

Lemma 2. A block of a cyclically connected graph is always cyclically connected. 

Proof. By hypothesis any two vertices p and q in the block H + X of (2) 
lie together on a circuit C in G. If C does not lie entirely within the block 
H + X, then one piece of C between h; and hz can be replaced by the chain X, 
giving a new circuit C* in the block and connecting the vertices p and q. 


3. A unique characterization of atoms. This section will give a combinatorial 
proof that the atoms of a graph are unique, up to a homeomorphism. This 
will be done by giving an invariant characterization of these atoms as maximal 
triply connected subgraphs of G. Here a subgraph 7 of G is maximal triply 
connected (max. trip. conn.) if 7 is contained in no other triply connected sub- 
graph W = T. 

TreoreM |. Every atom of G is a maximal triply connected subgraph. If 


(3) Ai , Ae. **+* fe 


is a coneplete set of atoms of G, then every maximal triply connected subgraph of G 
is homeomorphic, preserving branch points, to one and only one of the atoms (3). 

Proof. If T is any trip. conn. subgraph of G, then in the split (1), one of the 
blocks contains a trip. conn. subgraph 7* homeomorphic, preserving branch 
points, to 7. To show this, use the equation 


(4) T=(TNA+(TN A, 


where 7 H and TN H’ have in common no ares and only the vertices A; and 
he. As T is trip. conn., this cannot be a split, and one of the subgraphs, say 
7 1-H’, is void or a single chain. If it is void, then 7 C H, so that T itself is 


neither a circuit £, consisting of only two edges, nor a vertex p, on only two edges. A 
graph containing a circuit 2 is never TCM, but may be TCW. A graph containing a ver- 
tex p, as above, is never TCW, but may be TCM. The definition TCM used above has the 
advantage that it is invariant under any homeomorphism of the graph. 

'? A chain L does not pass through A if A is not in L or is only an endpoint of L. 








464 SAUNDERS MAC LANE 


in one of the blocks. On the other hand, if 7M H’ is a single chain Y, this 
chain must have the ends hf; and he, so that 
(5) T*=(T-Y)+X (Y=TN HW’) 
is a new subgraph homeomorphic to 7’, preserving branch points, and 7™* is 
contained in the block H + X, as required. 

To show that any trip. conn. atom A is max. trip. conn., suppose instead that 
A CT,A # T holds for a trip. conn. T with more edges than A. If we make 
one of the splits (1), leading up to the construction of the atom A, then A belongs 
to one of the blocks (2), say to the block H + X. But 7 > A, so that T con- 
tains at least a circuit of H, and the 7* constructed above from T must certainly 
be in the same block H + X with A. Because 7’ > A, and because T* is ob- 
tained from 7 by changing at most one chain not in A, we must have T* D A. 
Therefore, the atom A is contained in a larger trip. conn. subgraph 7* of the 
block H + X. We repeat this argument, getting A in successively smaller 
blocks until A is itself a block contained in a larger trip. conn. subgraph 7; . 
which in turn is contained in the block A. This is a manifest impossibility. 
Hence every atom is max. trip. conn. 

Consider any max. trip. conn. subgraph 7. We know that it is homeo- 
morphic, preserving branch points, to a 7* in the block H + X. This 7* is 
max. trip. conn. in this block. This is obvious if 7* = T. Otherwise T = 7%. 
and we have Y + X in the construction (5) of 7*. Were 7* contained in a 
larger trip. conn. subgraph W in this block, then W, perhaps modified by re- 
placing the chain X by the chain Y from (5), would be a trip. conn. subgraph 
properly containing 7', contrary to the assumption that 7 is maximal. As s 
result 7* is a max. trip. conn. subgraph homeomorphic to 7 and contained in 
the block H + X. Continuing this, we finally obtain a max. trip. conn. sub- 
graph 7. homeomorphic, preserving branch points, to the original 7 and con- 
tained in a smallest block A. As this A contains the triply connected 7’, , it 
cannot be a branch graph, and so must itself be trip. conn. The maximal 7, 
must be all of A, so that the original 7’ is homeomorphic to the atom A = 7. 
It can be homeomorphic, preserving branch points, to no other atom, for no 
two blocks“ and hence no two atoms can contain the same branch points. The 
theorem is thus established. It states that a set (3) of atoms contains all max. 
trip. conn. subgraphs of G, except for certain homeomorphisms. Hence we deduce 

THeorEM 2. If a cyclically connected graph G can be split up in two ways to 
give two complete sets of atoms, then there exists a one-to-one correspondence between 
these two sets of atoms, so that corresponding atoms are homeomorphic, preserving 
branch points. 

This theorem could also be proved by a direct consideration of two given splits 
by a method similar to that of the Jordan-Hélder theorem; that is, by first 
showing that any two given original splits of G can be subdivided to give homeo- 
morphic results. 


'# Exeept in the trivial case when one block is a branch graph. 











STRUCTURE OF PLANAR COMBINATORIAL GRAPHS 465 


The characterization of atoms given above can be made independent of the 
notion of a split by means of an independent description of triply connected 
graphs, based on the result of Whitney that the chief characteristic of a triply 
connected graph is the fact that any two of its branch points are the termini 
of three independent ares. These independent ares form a @-subgraph, where 
a “é-graph” means a branch graph with three branches. 

THEOREM 3. A cyclically connected graph G is triply connected if and only if 

(i) for each pair of branch vertices p and q of G there is a 6-subgraph W of G 
with termini p and q, and 

(ii) G contains no circuits passing through less than three branch points. 

Proof. Suppose first that conditions (i) and (ii) hold while G still has a split 
(1). Since G is not a single chain, and since by (ii) H can not be a branch 
graph, 7 must contain a branch vertex different from both hj and h,. Similarly, 
H’ contains a branch vertex gq distinct from A; and h,.. By (i) there is a 6-graph 
with termini p and q, and each of the three independent arcs of this 6-graph 
must pass through one of the two vertices h; or he to go from p tog. Thus two 
of these ares intersect, a contradiction. 

Conversely, suppose that G@ is trip. conn. and suppose, contrary to (ii), that 
G bas a circuit C with only two branch vertices h; and h2. Then G = C + 
(G — C) would be a semi-split of G at h; and h,. This semi-split must be a split, 
because were G — C a single chain, G would be merely a 6-graph, contrary to 
the definition of triple connectivity. 

To prove (i) for a trip. conn. G, first modify the graph by replacing each 
maximal hanging chain of G by a single edge. The resulting graph contains 
no vertices not branch vertices, is homeomorphic to G, and so is still trip. conn. 
One argues readily that it is also trip. conn. in the slightly different sense of this 
term used by Whitney.” Then by Whitney’s result” it follows that any two 
vertices of G are joined by three distinct chains, as asserted. 


4. Maps of triply connected graphs. A map o of « combinatorial graph G 
is 2 correspondence which assigns to each vertex p of G a point op on the sphere 
and to each edge a of Ga Jordan are o(a) on the sphere, such that the ends of 
o(«) are the maps of the ends of a, while op ¥ og if p ¥ g and two ares o(a) 
and ¢(8) with a # 6 do not intersect, except perhaps at their end points. Any 
subgraph EF of G has an image o(£) composed of all those ares ¢(a) with @ in E. 
‘Two maps o(@) and 7(G) of G on the sphere will be considered as identical if 
there exists a topological transformation of the sphere carrying ¢(G) into r(@) 
and each ¢(@) and ¢(p) into the corresponding r(@) or r(p). 

If G is non-separable, any map o(G) cuts the sphere into a number of con- 
nected domains whose boundaries are the maps of certain circuits of G. These 
circuits we call the complementary domain boundaries (c. d. boundaries) of the 
map. We state without proof the following 


* HH. Whitney, Congruent graphs, loc. cit., p. 158. 
H.Whitney, ibid., Theorem 7, p. 160. 








466 SAUNDERS MAC LANE 


THeoreM 4. Two maps of a cyclically connected graph on the sphere are topo- 
logically equivalent if and only if they have the same set of complementary domain 
boundaries." 

The possible ¢. d. boundaries of a non-separable graph can be characterized 
in the following fashion: 

ToeoreM 5. A set of circuits C, , --- , Cy, in a non-separable graph G is th 
set of complementary domain boundaries of a planar map of G if and only if each 
edge of G is contained in exactly two of the circuits C, while the circuits Cy, ~~~. Cn 
form a complete independent set’ of circuits in G, mod 2. 

Adkisson has shown” that if a cyclically connected graph G has a map in 
which each pair of c. d. boundaries has a connected or void intersection, then 
every homeomorphism of G to itself can be extended to the sphere. This sug- 
gests that any two maps of such a G on the sphere are identical. But the inter- 
section of any pair of c. d. boundaries will be connected or void if and only if 
the graph G can not be split. This agrees with a theorem of Whitney,” which 
states that a triply connected graph has at most one dual (and hence at most one 
map on the sphere). We shall now find an intrinsic characterization of the 
unique map on the sphere of a triply connected graph. 

A circuit C is a cud circuit in the triply connected graph G if there are two non- 
void subgraphs 7 and H’ of G such that G — C’ = H + #1’, while H and I’ 
have in common only vertices on C. 

TuroreM 6. If G ts a triply connected graph with a planar map o(G), then a 
circuit C CG is a complementary domain boundary of o(G) if and only if C is 
not a cut circuit of G. 

Proof. Any C not a ec. d. boundary of @ certainly cuts G into the two non- 
void pieces located respectively within and without the closed curve o(€). 
Conversely, let C be a ec. d. boundary of o(G) and suppose, contrary to the 
theorem, that C is a cut circuit of G. Then oG — oC is not connected, and 
so has m 2 2 connected pieces FE, , F,,---,&,. Each closure E;, consists of 


E, plus end points of ares, and so is the map of a corresponding subgraph G;, of G. 


7 As the number of domain boundaries is finite, this theorem may be readily proved by 
mathematical induction from the fact that a topological correspondence between any two 
closed Jordan curves can be extended to the interiors of these curves. The induction 
could depend on Mac Lane, loc. cit., Theorem 3.1. The theorem also follows by the method 
used by V. W. Adkisson, loc. cit., Theorem 2; it is based on a theorem due to H. M. Gehman, 
On extending a continuous one-to-one correspondence of two plane continuous curves to a cor- 
respondence of their planes, Transactions of the American Mathematical Society, vol. 28 
(1926), pp. 252-265. 

'® This is a re-formulation of the condition quoted in the introduction, using Mac Lane, 
loc. cit., Lemma 4.1, Theorem 5.1, and Theorem 5.3. 

1? The second condition is equivalent to requiring that C,, --- , Ca. are independent 
modulo 2 and that m — 1 is the nullity of @. The nullity of Gis E(G) — V(G) + P(G@), 
where B(G@), V(G), and P(G@) are respectively the number of edges, the number of vertices, 
and the number of connected pieces of G. 

2° V. W. Adkisson, loc. cit., Theorem 3, p. 168. 

*H. Whitney, Congruent graphs, Theorem 11. 











STRUCTURE OF PLANAR COMBINATORIAL GRAPHS 467 


These G; are the smallest sets into which C cuts G. Those vertices of each G, 
which lie on C we call the feet of G; . 

Kach G; has at least three feet. For if G; had no foot on C, G would be dis 
connected; if G; had one foot on C, G would be separable at this foot, while i! 
Gi; had two distinct feet, G could be split at these two feet into G; plus the re 
maining part of G, unless G; were a single chain. If G; is a single chain with 
two feet on C, this chain divides the exterior of C into two regions. The two 
subgraphs of G contained in the closures of these two regions, respectively. 
intersect only in the two feet of G;, so that G@ is again split. Each of these 
results contradicts the hypothesis that G is triply connected. 

Let p, and pe be two feet of G,. Because EF, is connected, p, and ps2 can be 
joined by a chain L in G,. Any set £; # EF, contains no points of o/ or of the 
c. d. boundary oC, so that #; must lie entirely within one of the two regions 
bounded by of and an are of oC. Therefore, the feet of each G; # G;, all lie 
on one of the two ares into which p; and pe divide C. 

We can choose the feet p, and pe of G, so that one of the ares C; into which 
they divide C contains all the feet of G. , but no feet of G, except for p, and pe . 
Then G can be split. For let H denote the subgraph of G composed of C; and 
all those subgraphs G; whose feet lie only on C; , while H’ consists of C — C, 
and all those G; whose feet all lie on C — C,. By the previous result, every 
G; belongs to exactly one of these subgraphs, so that @ = H + H’, where H 
and H’ have only p, and pein common. As G; has at least three feet, G, is in 
H’, while Ge is in H, so that neither H nor H’ is a single chain. Therefore this 
is a split, contrary to the triple connectivity of G. Hence the c. d. boundaries 
can not cut G, as asserted in the theorem. 

TuHreoreM 7. A triply connected graph can not have two topologically distinet 
maps on the sphere. 

Proof. If there is a map, its ¢. d. boundaries are exactly the circuits which 
do not cut G. This property is independent of the particular map, so that there 
can be only one set of ¢. d. boundaries and hence by Theorem 4 at most one map 
on the sphere. Another immediate result is the following criterion for map- 
pability : 

THEOREM 8. A triply connected graph G can be mapped on the sphere or on the 
plane if and only if the circuits of G which do not cut G form a set of circuits satisfy- 
ing the condition of Theorem 5 for a set of c. d. boundaries. 


5. Maps of cyclically connected graphs. ‘The final theorem on mappability ts 

THeoreM 9. A cyclically connected graph G can be mapped on the plane if 
and only if all its atoms can be mapped on the plane; that is, if and only if all its 
triply connected atoms satisfy the mappability criterion of Theorem 8. 

This theorem will follow by induction on the number of atoms once we show 
that a graph G split into two blocks has a map whenever both blocks have 
maps. This fact we state in the following more explicit form. 

LemMa 3. Jf G is split as in (2), and if + and r’ are maps on the sphere of the 











468 SAUNDERS MAC LANE 


blocks H + X and H’ + X’ respectively, where in + the chain X appears in two ce. d. 
boundaries C and D, while X' appears in 7’ in the boundaries C’ and D’, then there 
is a map of G in which the c. d. boundaries are the c. d. boundaries of + and 7’, 
except that the circuits C, D, C’, and D’ are replaced by 


(6) (C —X)+ CC’ — X’), (D — X) + (D’ — X’). 


Proof. Because the chains C — X and C’ — X’ belong to H and H’ respec- 
tively, they have no ares in common, while by construction they both have 
the two vertices A; and he as ends. Hence (C — X) + (C’ — X’) and likewise 
the other graph in (6) is actually a circuit in G. Once the lemma is established, 
a simple change of notation will give a similar result with (6) replaced by 


(7) (C — X) + (D’ — X’), (D — X) + (C’ — X’. 


To prove the lemma we need only squeeze the map r(H’) into the region 
of the map r(//) originally occupied by 7(X). This may be done as follows. 

Draw an are 7r(Z) with ends r(h;) and r(he) in the region bounded by C and 
draw a similar are r(Z’) inside C’. In these altered maps the c. d. boundaries 
are as before except that C and C’ are replaced by 


(8) Z+(C—X), 2+ 4, Z’ + (C’ — X’), 2+ X’, 


respectively. One of the two regions of the sphere bounded by X + Z contains 
the rest of r(H). Call this region the “‘outside” of X + Z, with « similar con- 
vention as to XY’ + Z’. Then map the outside and boundary of X’ + Z’ topo- 
logically on the inside and boundary of X + Z in such a fashion that r’(A,) and 
r'(he) go into r(4;) and r(he) respectively, while r’(X’) goes into 7(X) and 
r'(Z’) into 7(Z). The two maps so combined have e. d. boundaries which 
ure the original ¢. d. boundaries of + and 7’ except that C, C’, D, and D’ are 


replaced by 
(9) Z24+(C-X), 24+C’—-X’), X+(D-X), X+ (DW — X). 


If we remove the maps of XY and Z, a map of G with the required ¢. d. boundaries 
will remain. The lemma being established, the theorem follows. 


6. The number of maps of a graph. In this section we denote by u(@) the 
number of topologically distinct maps of a graph G on a sphere. If one map of 
G is given, other maps may be found by “rotating” one of the components of a 
split of G or by permuting several components which are connected in “‘parallel’’. 
To show that all distinct maps of G can be obtained by sequences of such opera- 
tions, we first discuss such components in “parallel”. Given two vertices p and 
q of G, we say that two edges @ and 8 of G are connected outside of p and q if there 
is a chain L of G containing @ and 8 but passing through neither p nor gq. The 
relation, ‘‘e is connected to B outside p and q’”, is reflexive, symmetric, and 
transitive. Therefore G is subdivided into disjoint subgraphs M, , Me, --- , Mz 





22 eet 














sonar 





STRUCTURE OF PLANAR COMBINATORIAL GRAPHS 469 


such that two edges of G belong to the same subgraph if and only if they are 
connected outside p and g. Consequently, 


(10) G = M,+ M2+---+M, 


is a representation of G in which any two of the subgraphs M; and M; have 
only the vertices p and gincommon. Furthermore, no subgraph M; has a semi- 
split at p and q, so that the representation (10) can not be further subdivided. 
If t > 2 in (10), we say that @ has a multiple split of order t at p and q. 

THroreM 10. If G can be mapped on the sphere, the number u(@) of topologically 
distinct maps of G on the sphere is 


. 
(11) uG) = 2" TT ( — 1), 

i=! 
where a = a(G@) ts the number of atoms in a complete set of atoms of G, where G has 
k distinct multiple splits, and where the i-th multiple split has the order t; . 

We shall establish this theorem by constructing the atoms from “least”’ 
splits of G. A component H in a split (1) of @ at h; and he will be called least 
if the component H has no semi-split at A; and he ; that is, if any two ares of 7 
are connected outside h; and he. <A split (1) of @ will itself be called least if 
cither of the components H or H’ of the split is least. 

Lemma 4. Let (1) be a least split of G into H and H’. Then in any map a of G 
there are two c. d. boundaries which have edges in both H and H’', while any other 
c. d. boundary is entirely in H or entirely in H'. 

Proof. Edges of both H and H’ abut on the split point h;. The eyclic order 
of the complementary domains about h; indicates that through this point there 
must pass at least two c. d. boundaries which have edges in both H and H’. 

Suppose there were three (or more) such boundaries with edges in both // 
and H’. If the three complementary domains with these boundaries are re- 
moved, the remaining part of the sphere falls into three parts which touch only 
at hy and hz. Since H has edges in all three c. d. boundaries, it follows readily 
that H must have edges in at least two of the three parts of the sphere. As 
these two parts of H touch only in h; and he , H has a semi-split at these vertices. 
H’ likewise has a semi-split, contrary to the assumption that one of H and I’ 
is least. Therefore there are but two ec. d. boundaries of the sort considered. 

Lemma 5. If G has a least split (2), there exists a 2-1 correspondence between 
the maps o of G on the sphere and the pairs of maps (7, r'), where r and r' are maps 
on the sphere of the blocks H + X and H' + X’ respectively. 

Proof. A given map o of the whole graph G is also a map of each of the sub- 
graphs (or blocks) H + X and H’ + X’, and hence does correspond to a pair of 
maps of these blocks. Specifically, let ¢ be determined (Theorem 4) by its ¢. d. 
boundaries 


, 


(12) C1,Ce,-*:,Ce, Ci,°*+,Ce, L+L', M+M"’, 


‘. « . . ‘ ‘ re _ . i 
where each C; is a cireuit contained in H, and each C; is contained in H’, while, 


Bas ct ese 8 











170 SAUNDERS MAC LANE 


as in the last lemma, the two circuits with edges in both H and H’ consist of 
chains L and M in H and L’ and M’ in H’. The sub-map of H + X arises by 
dropping from the map (12) all edges of H’ — X. The first circuits C; to C, 
must thus remain as ¢. d. boundaries in the map of H + X, while by Theorem 5 
there must be two other ¢. d. boundaries containing all of L and all of M respec- 
tively. Furthermore, X must be in two boundaries, so that the additional 
boundaries are just Y + Land X + M. This gives a map of H + X with the 
c. d. boundaries 


(13) Cy, Ca, +, Ce, L + X, M + X, 
and similarly a map of the other block H’ + NX’ with the boundaries” 
(14) O.8,° .&, Met, Woz 


Therewith we have a correspondence between the map (12) and the “pair of 
maps” (13) and (14). Conversely, given any two maps (13) and (14) of the 
two blocks, there can be at most two maps of G corresponding to (13) and (14) 
in this fashion; namely, the maps given by the c. d. boundaries in (12) or in 


(15) C... de, +++. a ae L+M’. M + L’. 


Both of these maps (12) and (15) are geometrically possible by Lemma 3. 
Hence the correspondence is a 2-1 correspondence, as asserted in Lemma 5. 

LemMa 6. If G has a least split (1), then, for any multiple split of G of order 
tat p and q, one of the blocks (2) of the least split has a multiple split of order t at 
p and q, while the other block has no multiple split at p and q. The splits so ob- 
tained are the only multiple splits of the blocks. 

The second statement is simple, for if one of the blocks has a multiple split 
at the vertices r and s, then, by the definition of a split in (10), the original 
graph G also has a multiple split at r ands. The proof of the first part depends 
essentially on the fact that one of the components, say H, of the given split (1) 
is least. 

Case l. p= h,andq = he. As H is least, it must be one of the M; in the 
given multiple split (10) of G, say M,. Then 


H’ = Me+---+M,, 


xo that the other block H’ + X’ has a multiple split like (10) with M, replaced 
by Y’, and this multiple split has the order ¢. 

Case 2. Neither H nor H’ contains both p and q, so that one vertex, say p, 
is in 7 but not H’, while q is in 17’, but not in H. But in the multiple split (10) 
each M,; must, because @ is cyclically connected, contain a chain with ends 
p and q, and this chain must pass through one of the split vertices h; or he of the 
split (1). Since each of these split vertices is in but one of the M’s, there can 


“2 This could also be proved purely combinatorially by showing that if the set of circuits 
(12) satisfies Theorem 5, the two sets (13) and (14) also satisfy Theorem 5. 

















STRUCTURE OF PLANAR COMBINATORIAL GRAPHS 471 


thus be only two subgraphs M;, and the presumed decomposition (10) is no 
honest multiple split. 

There remains the case when p and q are both in one component, say the 
component H of (1), while one of the split vertices, say h; , is at neither p nor q. 
Then any are of H’ is connected to this A; by a chain not passing through either 
p or q, so that all of H’ is contained in one of the pieces M; of the given multiple 
split (10) of G. Thus, when all these edges of H’ are replaced simply by X’, 
(10) gives a multiple split of the block H + X with the same order t, while 
the other block H’ + X’ can not contain both p and q as branch vertices and so 
certainly has no multiple split at p and q. 

In all cases, the given multiple split yields a similar multiple split of just one 
of the blocks (2), so that the lemma holds. 

LemMMA7. If agraph G has no least split, then G is triply connected or is a branch 
graph or a circuit. 

For if G is not triply connected, it must have a split at some pair of vertices 
(p, g), and hence a representation (10). There must be t 2 3 terms, because 
the split is not least. If one of the subgraphs M; were not a single arc, then 
G = M; + (G — M,) would be a split in which M; is least. Such a least split 
is impossible by hypothesis, so each M; is a single are, and G is a branch graph. 

To prove Theorem 10, decompose G by successive least splits until none of the 
resulting blocks have least splits. By Lemma 7 these blocks are either branch 
graphs or else are triply connected and hence atoms of G, so that we have a set 
of blocks 


(16) A,, As,-°:,dAe, B,, Be, ---, Bs, 


where the a graphs A; form a complete set of atoms, while the B; are branch 
graphs. By Lemma 6 each multiple split of G corresponds to one and just one 
multiple split with the same order in one of the blocks (16). Of these blocks, 
only the branch graphs B; have multiple splits, while G has multiple splits of 
order ¢;, 7 = 1, +--+ ,k. Hence the branch graphs in (16) can be so arranged 
that B; is a branch graph with ¢; branches. There are 8 = k such graphs. 

In the set of subgraphs (16) each triply connected atom A; has but one map 
on the sphere, while each branch graph B; with ¢; branches has (¢; — 1)!/2 differ- 
ent maps (this is the number of ways of arranging the ¢, branches in cyclic order). 
We combine the maps of (16) to get maps of G. Each step in the combination 
will by Lemma 5 yield two alternative maps of G, while there are a + k — | 
combinations in all. Hence there are 


ge TTG=D!_ itp e,— py: 


i=1 2 i=1 


different maps of G, as asserted in the theorem. 
Any planar map of G can be obtained by stereographically projecting a map 
from the sphere onto the plane. Distinct maps are obtained when the north 





472 SAUNDERS MAC LANE 


pole is chosen in distinct regions, and the number of such regions is simply 
N(G) + 1; therefore we have the 
Coro“ tary. The number of topologically distinct maps of a cyclically connected 


planar graph on the plane is 


o(G) = [N(G) + 1G) = [N(G@) + 1)2** TT] @ — 0}, 
1 


where N(G) is the nullity of G and the other constants are given as in the theorem. 


CORNELL UNIVERSITY. 

















CRITICAL CURVATURES IN RIEMANNIAN SPACES 
By Artuur B. Brown 


1. Introduction. A well known theorem in differential geometry concerns the 
normal curvatures of curves through a point on a 2-dimensional surface in 
3-space. It states that either the curvature is constant, independent of the 
direction of the curve, or else there is one direction giving a maximum to the 
curvature, and another (perpendicular) direction giving a minimum. We gen- 
eralize this result to the case of an n-surface in a Riemannian (n + 1)-space. 
In place of merely a maximum and a minimum, there is in general a non- 
degenerate critical point of each type or index’ 0, 1, ---, — 1. A similar 
result is obtained for an arbitrary subspace of a Riemannian space, the theorem 
being stated in terms of projections on any direction orthogonal to the sub- 
space. A final theorem, with a similar statement regarding critical values, con- 
cerns the Ricci mean curvature in a Riemannian space. 


2. The principal directions for a real quadratic form. Our results regarding 
critical values will be based on the following theorem. 

TuHeEoreM 2.1. Given the real quadratic form® 
(2.1) 2 = 4;;2;7;, 
on the locus 
(2.2) iti = l 
2 has at most, and in general exactly, n distinct critical values. When the number 
is n, the critical values are taken on at n pairs of diametrically opposite points of 
(2.2), determining n mutually perpendicular lines through the origin in the number 
space of the x’s. If the pairs are ordered according to the algebraic values of z, 
at either point of the i-th pair z has a non-degenerate critical point of index i — 1. 

Proof. We begin by making an orthogonal transformation with fixed origin 
in (x)-space so that the given form becomes (using the same letters 7; , +++ . 2) 
(2.3) 2= bai t-+: + bt, = dizi 
with the b’s real. Now if we consider the function 

Received February 24, 1937; presented to the American Mathematical Society, March 
26, 1937. 

' Marston Morse, The Calculus of Variations in the Large, Amer. Math. Soc. Colloquium 
Publications, vol. 18, p. 143. 

?Cf. L. P. Eisenhart, Riemannian Geomeiry, Princeton, 1926, p. 110. We shall refer to 
this volume as Eisenhart. 

* Repetition of an index indicates summation from | to n. 

‘Cf. M. Bécher, Introduction to Higher Algebra, p. 170, Theorem 1 and p. 171, Theorem 2; 
or L. E. Diekson, Modern Algebraic Theories, p. 74, Theorem 10. 


473 








474 ARTHUR B. BROWN 


(2.4) w = b;27;/2z;7;, 


we see that w = z on locus (2.2). Using the fact that w is constant along 
each line directed towards the origin, omitting the origin itself, we easily see 
that those of the critical points of w, as a function of n independent variables, 
which are located on (2.2), are the same points as the critical points of z on (2.2) 
as a function of n — 1 independent variables. 

Then to find the critical points, using (2.4) we set 


dw _ (x; 2;)(2b,2,) — (b;x;)(2z,) 
ar, (aj;2;)" 


0 = (do not sum s). 


Hence the critical points of z on (2.2) are the points on (2.2) where 

(2.5) b,x, = (b;x?)a, (s = 1, +--+ , n; do not sum s). 
Obviously 

(2.6) Z; = b,; (jg =1,---, ms 


satisfy (2.2) and (2.5) for any fixed k&. Taking k = 1, 2,---, m, we have n 
solutions of (2.5) and (2.2). 

We shall now show that (a) the number of distinct critical values of z on (2.2 
equals the number of distinct b’s in (2.3), and (8) if that number is n, then 


(2.6) gives the only critical points other than those with rz; = —6;. 
Suppose 
(2.7) b; => be = °° = b, ° 


but no other b = b. If we take a solution of (2.2) and (2.5) with one or more 
of 4, %2,°:*: , 4, different from zero, say 7, + 0, then if, say, 2,4. # 0, from 
(2.5) with s = 1 and s = r + 1 we would have 

b,x = b, = bps1, 
contrary to hypothesis. Hence, for the solution in question, 
(2.8) O = fn, = °°* = Ze 
and hence 
(2.9) b=aite:- +27. 
From (2.7), (2.8), (2.9) and (2.3) we now see that the critical value in question 
is b) = bo = --- = b,. Since a similar argument holds for each set of equal 
b's, our assertion (a) follows at once. 

If we have n distinct critical values, then by (a) the number of distinct b’s 
must be n, and hence for each critical point only one of the 2’s can be different 
from zero (ef. (2.8)). Thus (8) is established. The perpendicularity of the 
directions also follows. 


‘&, = Oifk #j, =~ 1lifk = j. 








CRITICAL CURVATURES IN RIEMANNIAN SPACES 475 


Consider the solution (2.6) with k = 1. It is (1, 0,--- , 0). At this point 
ry, °++ , a, can be taken as the independent variables for (2.2).° Substituting 


for z; from (2.2) into (2.3), we have 
(2.10) 2=b(1 — 23 — ++ — 2%) + bere +--+ + daze 
= by + (be — bys + (bs; — bi)z5 +--+ + (Ob, — biz. 


Hence the index of the critical point (the number of negative coefficients) is 
the number of b’s less than b, . 

Since a similar result can be obtained by using (2.6) with each of the values 
k = 2,3,--- , n, we infer that if, say, b} < be < --- < b, , then, for each k, 


x; = 6,; gives us a critical point of index k — 1, at which z = b.. We would 
obtain the same result by taking 7; = —é&j,;. The truth of the theorem is now 
established. : 


3. Riemannian coérdinates. In this section we establish Riemannian coér- 
dinates’ for a Riemannian space without assuming that the coefficients of the 
fundamental form are analytic. If one is satisfied with the case that the coeffi- 
cients are analytic, this section may be omitted. 

THeoreM 3.1. If the fundamental form of a Riemannian space has coefficients 
of class C*, k = 4, neighboring a point P with codrdinates (a), a Riemannian 
coordinate system can be introduced, for a neighborhood of P, with origin at P, 
by a non-singular transformation of coérdinates in terms of functions of class C*'.” 

Proof. If we take 2, ---,2, as the codérdinates, and g;;dx;dr; as the fun- 
damental form, the geodesics are the solutions of 


* The index of the critical point is independent of the particular parameters chosen, 
providing z is a function of class C* of those parameters. 

7A system of coérdinates y:,--- , ya in a Riemannian n-space is called Riemannian 
if the geodesics through the origin are the curves given by the equations y; = X,t 
(¢ = 1, +--+, mj A¥, --- , \* any real constants not all zero). The theorem of this section 
has been proved by J. H. C. Whitehead, On the covering of a complete space by geodesics 
through a point, Annals of Math., vol. 36 (1935), pp. 679-704. Cf. footnote 5 of T. Y. 
Thomas, On normal coérdinates, Proce. Nat. Acad. Sci., vol. 22 (1936), pp. 309-312, where 
a proof of the theorem is based on results in an earlier paper by W. Mayer and T. Y. Thomas, 
Math. Zeitschrift, vol. 40 (1936), pp. 658-661. The proof was also given in some mimeo- 
graphed notes of W. Mayer at Princeton in 1936, for a more general variational problem. 
We are indebted to G. Comenetz for the above information. Since a simple proof has 
not yet been published all in the same paper, we think it advantageous to give one here. In 
the papers cited above, the hypotheses are equivalent to taking k = 2 in Theorem 3.1 in- 
stead of k => 4. We assume k 2 4 to insure that the equations of the geodesics in the 
new coérdinate system have coefficients of class C'. 

* A function is of class C* if it is continuous and has all its partial derivatives, up to 
and including those of order k, continuous. 

* While we shall be dealing with the case that the fundamental form is positive definite, 
we do not make this assumption in this theorem. 











476 ARTHUR B. BROWN 


dx; — {jk\ dx; dx, 
+. ? 


. = 0 (@ = seen 
dt? -) dt dt ’ I, dats 


(3.1) 


Since ‘st is of class C’', the solution with x; = x° and (dx,;/dt) = (dr, dt)o 
) 


« 


at ¢ = ft is given, according to a theorem of differential equations, by 
(3.2) 2: = dt, to, 21, °+* 2, (dri/dt)y, «++ , (dx,,/dt)o] 

= it, to, 2°, (dx/dt)o| (i = 1,---,n), 
with ¢; of class C*' in all the arguments," say for |t! <« |b < « 
jai — as| < «, | (dx;/dt))| < « € > 0. Taking x} = a; and t& = 0, we have 
the geodesics through (a) represented by 


(3.3) r; = ot, 0, a, (dx /dt)o| (¢ = ],+-+, wn), 


with ¢; of class C*' for | t| < €and | (dz;/dt)o| < €. Note that, sinee x; = a; 
is a solution of (3.1), from the uniqueness of the solution we have 


(3.4) ¢,(t, 0, a, 0) = a; Gm f+: ahi ce 


Now let us make a change of independent variable, f = ct, ¢ + 0, constant, 
replacing (3.1) by 
dx; — {jk\ dx; dx 


(3 5) = + 


+-— =Q (i =1,---,n). 
dt? es dt dt , ” 


Since (3.5) is of the same form as (3.1), the solutions of (3.5) through (a) and 


with & = 0 are given by 


(3.6) 2; = ot, 0, a, (dx/db)o) 


= ¢,{ct, 0, a, (dx/dt)o/c}, adi<«, | (dx/dt)o/e < « 
Since (3.5) is equivalent to (3.1), we have, from (3.3) and (3.6), 
(3.7) oft, 0, a, (dx/dt)o| = @,[ct, 0, a, (dx /dt)o/c} (¢ = 1,--+,n), 


; at . 12 
if every argument except the a’s is less than ¢€ in absolute value. 


2. 1 ih fa 9 ) a 
10 jit . | 3 | = T (jae + a of), if we use the customary notation. Cf. 


Eisenhart, p. 50, equation (17.8) and the proof following that equation. In the proof, in 
place of covariant derivatives with respect to z* multiplied by dx*/ds and summed for k, 
absolute derivatives with respect to s should be used, as the functions differentiated are 
not functions of the z’s. Cf. our footnote 17. 

" See G. A. Bliss, Solutions of differential equations of the first order as functions of their 
inilial values, Annals of Math., (2), vol. 6 (1905), pp. 49-68 (theorem on p. 67); or Bolza, 
Variationsrechnung, 1909, p. 178. 

12 The work is carried this far in Duschek-Mayer, Lehrbuch der Differentialgeometrie, 
vol. 2, pp. 93-9, but it is not completed there. Cf. footnote 2 on page 95 there. We 
shall refer to this volume as Mayer. (Vol. 2 is by W. Mayer.) See also G. A. Bliss and 
Max Mason, Fields of extremals in space, Trans. Amer. Math. Soc., vol. 11 (1910), pp. 
325-340. 

















CRITICAL CURVATURES IN RIEMANNIAN SPACES 477 


Now place the restriction || < » = ¢/2. For any sucht + 0, if c = ¢/2t, 
all the arguments in (3.7) except the a’s will be numerically less than ¢ provided 
(dz;/dt)y| < ¢«. Hence 
(3.8) oft, 0, a, (dx/dt)o] = dile/2, 0, a, 2t(dx/dt)o/ 
= W{t(dx,/dt), ghee t(dz,/ dt)o| 
for 0 < \t! < », , (dx;/dt))| < ». Whent = 0 the left-hand member of (3.8) 


is a,. The second member is also a;, by (3.4), when ¢ = 0. Hence (3.8) 
holds also when ¢ = 0, and thus the solutions of (3.1) are given by 


(3.9 xi = Pilt(dxi/dt)o , --- , t(dx,/dt)o] (¢ = 1,---,n) 


with ¥; of class C*™ for | t| < n, | (dxi/dt)o| < n. 
Now differentiate each side of (3.9) with respect to t and set t = 0. Denoting 


by w.°** , Yn the arguments of ¥; , we have 

(3.10) (dx;/dt)o = [dpi(0)/ dy ;](dx ;/dt)o (¢ = 1,---,n). 

Since (3.10) are identities in the (dx;/dt)o’s, we infer 

(3.11) ay (0)/day; = 4;; (1,7 = 1,---, m). 
Now consider the transformation 

(3.12 ti = Wily, -** 5 Yr) (i = 1,---, m). 


From (3.11) we see that, for some ¢ < 9-7 = 7°, ¢ > 0, this transformation is 
one-to-one for | y;| < ¢. Since the y; are of class c**, the new fundamental 
form g;;dy;dy; will have coefficients of class C*"*; and since k — 2 = 2, the 
geodesics are uniquely determined in the new snieenate system. 

Now consider the equations 


(3.13) % = Nt (i _ 1, sai ,n), 


where \’, «+ , \” are (real) constants with | A‘! < 7 and 0 < \‘\‘.. The curve 
(3.13) is given in the (x)-system by 


(3.14) a; = vA t,---, dE), |t| <9. 


Hence, comparing with (3.8), we see that it is the geodesic through (a) in direc- 
tion (\’, --- , A"). We now see that the restriction | \‘| < 7 can be dropped, 
provided that for each set (A) we restrict ¢ sufficiently. Thus all the geodesics 
through P are the curves given in the new coérdinate system by equations of 
the form (3.13) with 0 < \‘A‘. It follows that the new system is Riemannian, 
with origin at P, and the proof is complete. 


4. Hypersurfaces in a Riemannian space. Here we take the case of a surface 
of dimension one less than that of the space. 








478 ARTHUR B. BROWN 


Derinirion. The locus of a set of simultaneous equations 
O(r) = O(M1. +++. Tm) = O (@=1,--- ,kyk < m) 


is called a regular (m — k)-spread of class C’ neighboring a point (21, «++ , 2.) 
in a Euclidean or Riemannian space with codrdinates (x), if (z’) lies on the 
locus, the functions ¢; are of class C’ near (x’), r = 1, and the matrix of the 
first partial derivatives is of rank k at (2°). 

TueoreM 4.1. If P is a point on a regular n-spread S, n > 1, of class C 
in a Riemannian (n + 1)-space R with positive definite fundamental form having 
coefficients of class C*, the normal curvatures" of curves on S through P, as functions 
of parameters determining their directions, have at most n critical values. When, 
as is in general the case, the number is n, the values are taken on in n mutually 
perpendicular directions;"* if the critical values are ordered according to their alge- 
braic values, then the i-th is at a critical point of inder i — 1. 

Proof. Using Theorem 3.1, we introduce Riemannian normal codrdinates” 
with origin at P in such a way that the surface z,,, = 0 is tangent at P to S. 
Then, if we denote x,,, = z, S is given by 


(4.1) z= f(t, --* . Zn), f of class 
with 
(4.2) 0 = of (0O,---,0) =-+- = of (0, ---, 0). 

Ox; OX, 


Now let any unit vector (A) be given at P, so that 


(4.3) NAT = 1. 

Let 

“4 r; = $,(s) (2 = 1,---, ™), 
4) 


z = fidi(s), --- . o.(s)] 
be a curve, with s the are length, s = Oat P, such that 


13 For a curve on S through P with non-zero first curvature, the vector in the direction 
of the principal normal and of length equal to the absolute value of the curvature will 
be called the curvature vector. If the curve has first curvature zero at P, the curvature 
vector is the vector with all components zero. The normal curvature of a curve through 
P on S is the projection on the normal direction to S at P, with a sense assigned to the 
latter, of the curvature vector. All regular curves of class C? on S tangent to a given 
curve on S at P have the same curvature vector. Cf. Eisenhart, p. 151. See also our 
Theorem 5.1 and the definition preceding it. For a curve whose curvature vector is 
orthogonal to S, the normal curvature is plus or minus the absolute value of the actual 
first curvature. 

4 This is well known. Cf. Eisenhart, p. 153. 

'® Riemannian normal codrdinates are Riemannian codrdinates such that gi; = 4., at 
the origin (Eisenhart, p. 55). They are easily obtained from any Riemannian codérdinates, 
when the fundamental form is positive definite, by a linear transformation. The neta- 


tion in Mayer is slightly different 

















CRITICAL CURVATURES IN RIEMANNIAN SPACES 479 


(4.5) ¢;(0) = X' (i = 1,---, n)." 
Since we have normal coérdinates, for curve (4.4) at P we have 
6 dx, _ d dtq _ d x. at eee 
és ds dsds ds* pi eat ; 


, : 18 72 2 
Hence the curvature vector of the curve at P has its a-th component” dz._/ds 
(a = 1,°-+,n+1). Hence the normal curvature, projection of the curvature 
. . *,¢ . . 2 , 2 2 , 2 y 
vector on the direction of the positive z-axis, is d°2,4,/ds" = d°z/ds°. Now 


dz af dd, 
ds ax; ds’ 


17 


and therefore at P the normal curvature 


dz _ Ff(0) de; do; _ & f(0) x! 


= - >! 
ds ar,0xr; ds ds Ox; OX; 


(4.6) K = 


because of (4.2). Hence 
_ FO) 


(4.7) K = a;;'N a;; = : 
ss : OX; 02; 
Theorem 4.1 now follows from Theorem 2.1. 

We observe that, as a special case, the (n + 1)-space may be Euclidean. 


5. Subspaces of a Riemannian space. In this section we take the case of an 
n-dimensional surface S in a Riemannian m-space, m > n, dropping the restric- 
tion that m = n+ 1. The idea, used here, of projecting the curvature vector 
on a direction orthogonal to S is found in Mayer,” used in another connection. 

Let a Riemannian m-space R be given, with positive definite fundamental 
form having coefficients of class C*, the variables being y,:,---, ym. Let S 
be a regular n-spread of class C* neighboring a point P of R. Then S is itself 
a Riemannian n-space, whose fundamental form has coefficients of class C’.” 


'® For example, we could take the curve x; = A (i = 1, +++, m), z = flr, ---, Ame). 
Then at P the quantities dz,/dt (a = 1, --- ,m + 1) are (A', --- , X*, 0). If now we change 
the parameter to are length s, measured in the direction in which ¢ increases, the quantities 
dz,/ds are proportional to dz,/dt, and also have the sum of their squares equal to 1 at P; 
hence they are also (A', --- , \", 0). Hence dz;/ds = ' (i = 1, --+ , n), as was to be proved. 

17 The letter 5 indicates absolute (covariant) differentiation. Cf. Mayer, p. 31 ff. At 
the origin in Riemannian coédrdinates, absolute first derivatives equal the usual first 
derivatives. Cf. Eisenhart, p. 56, for derivatives with respect to the space codrdinates. 
For derivatives to any parameter (e.g., as s above), the result follows immediately from 
the fact that at the origin the first partial derivatives of the coefficients of the fundamental 
form are all zero (Eisenhart, p. 55; Mayer, p. 117). 

1s Cf. Eisenhart, p. 61; Mayer, pp. 59-62. 

1 Pp. 159-160. 

2° If we want S to be sufficiently regular so that the geodesics on S exist and are unique, 
we can demand that S be of class C*. Its fundamental form will then have coefficients 
of class (?. 








480 ARTHUR B. BROWN 


We begin by establishing a property of curvature, following the next defi- 
nition. 

Derinition. The projection of the curvature vector of a curve in a direction 
normal to the curve at a point on it will be called the normal curvature of the 
curve for the given direction. 

THeoreM 5.1. All regular curves of class C° on S tangent to a given curve of S 
at a fixed point P have the same normal curvature for any direction normal to 8. 

Proof. Introduce Riemannian normal coérdinates in R so that the locus 
0 = Ynsi = *** = Ym becomes tangent to S at the origin P. As the new 
coérdinates are obtained by a transformation using functions of class C°, the 
locus S is now given by 


(5.1) yr = fil~rs +++ s Yn), fi. of class C*, (kK =n+1,---, my), 
where all the first partial derivatives are zero at (0,--- ,0). Let 

yi = pi(s) (i = 1, --+,n), 
(5.2) 


Yc = Selpr(s), «++ , pals)] (k = n+ 1,--+,m) 


be one of the curves in question, with s the are length. It is easily verified that 
the p’s must be of class C’. 

Since we have normal codrdinates, as in §4 the curvature vector has a-th 
component d'yq/ds’ (a = 1,-++,m). Hence its projection on the direction 
(0, --- ,0, 1) is d’y,./ds*. Now 


dm — Am Api 


ds = ay; ds 

and therefore, at P, 
- Pym — SF ASm dpi dp; 
(5.3) — a - F 

ds? ‘j= Oy;dy; ds ds 
since af,./dy; = Oat P. As this answer depends only on the direction of the 
curve (5.2), and since any direction normal to S can be made the direction 
(0, «++ , O, 1), we infer the validity of the theorem. 

We now return to the principal question. 

Turorem 5.2. Let P be a point on a regular n-spread S of class C* in a Rieman- 
nian m-space, 1 < n <‘m, with positive definite fundamental form having coeffi- 
cients of class C*. If normal curvatures are taken at P for any given normal 
direction to S at P, the conclusion of Theorem 4.1 holds. 

Proof. Choose coérdinates as above, making the given direction the direction 
(0, --- , 0,1). Since (5.3) is the same kind of result as (4.6), the brief proof 
is the same as that following (4.6). Hence the theorem is true. 

Remark 1. All regular curves on S tangent to a given curve on S and with 
curvature vectors orthogonal to S or having all components zero have the 


same curvature vector. 

















CRITICAL CURVATURES IN RIEMANNIAN SPACES 481 


This is a known property,” which we state for convenience in reference. It 
also follows easily from Theorem 5.1.” 

Remark 2. If, under the hypotheses of Theorem 5.2, S being supposed 
of class C*, all the geodesics of S through P have their curvature vectors 
confined to two opposite directions when non-null, then, in Theorem 5.2, the 
normal curvatures, for either of those directions, of all curves whose curvature 
vectors are orthogonal to S are plus or minus the absolute values of the 
curvatures. 

This follows from Remark | and the fact that when a geodesie has a non-null 
curvature vector, the latter is orthogonal to S.” 


6. The Ricci mean curvature. The sum of the n — 1 Riemannian curvatures 


° . ° 1 ° . ° 
determined by a direction (\,--- , \") in a Riemannian n-space R and each 
of n — 1 other vectors such that all n are mutually orthogonal is called the 
mean curvature, p, of the space for the given direction, and is given by 

> Ri 
(6.1) p=- —y 
th? 
gij d 


where R;; and g;; are components of the Ricci tensor and the fundamental 
tensor respectively.’ If, as in §4, we introduce Riemannian normal coérdinates 
with origin at a given point P where we are considering (6.1), and take (A) 
as a unit vector, at P (6.1) reduces to 


(6.2) p = —R,,y'W. 


We can now apply Theorem 2.1. This gives us the following 

TuHeoreM 6.1. Jf P is a point in a Riemannian n-space with positive definite 
fundamental form having coefficients of class C*, the mean curvatures at P, as 
functions of parameters determining the direction, satisfy the conclusion of 


Theorem 4.1. 


Appendix — The critical diameters of central quadrics” 


7. Central quadrics. The locus S of a second degree equation in Euclidean 
(a1, *** ,%,)-space will be called a central quadric if it is not vacuous and is 
symmetric in a point P not on it, called a center. For example, if n = 2 the 
central quadrics (now conics) are of the following types: circle, ellipse, hyper- 
bola, pair of parallel straight lines. In general a quadric surface is symmetric 
and has only one center. In any case, a particular center will be chosen and 
called the center. 

Cf. Mayer, pp. 158-159. 

22 By Theorem 5.1, the projections on the yn.:, ---, Ym directions are the same for all 
the curves, and as the projections on the y, --- , y» directions are zero, the vectors are 
all the same. 

*3 Eisenhart, p. 75; Mayer, p. 159. 

*4 Cf. Eisenhart, p. 113. 

** This part was originally presented to the Society under its own title. 








482 ARTHUR B. BROWN 


In the earlier part of this paper we had occasion to consider the value of an 
arbitrary real quadratic form a;;2;2; on the locus z;2; = 1. This suggests con- 
sidering the value of x;2; on the locus a;;2;2; = 1. Our principal result is that 
if for a central quadrie in n-space the lengths of the diameters, as functions of 
parameters determining the direction, have critical points in only a finite 
number of directions, the 7-th in magnitude is at a non-degenerate critical 


point of index 7 — 1. 


8. Preliminaries. If the center is at the origin, 2;2; is one-fourth the square 
of the diameter with end points (2) and (—z). As we prefer to consider the 
diameter itself, we begin with the following lemma. 

Lemma. Suppose f(x, --+ , 2») is of class C’ neighboring (x°) in real n-space, 
g(x) = [f(x)}, and f(z’) > 0. Then if either f or g has a non-degenerate critical 
point at (x°), the other has one of the same index. 

Proof. Since dg/dx; = (2f)(af/ax,), if either has a critical point so has the 


other. Differentiating again and setting 7; = x! (i = 1, +--+ ,n) we have 
a(x a f(a” 
q( ) - 2f(x") fla ) 
OX; OX; OX; OX; 


since af(z’)/ax; = 0. The conclusion now follows easily from the definition of 
index.” We shall apply the lemma with n replaced by n — 1. 

Now let us consider the principal question. Suppose a central quadric S is 
given in n-space, nm > 1, and we make a translation of axes so that the center 
(or a center, which will be designated hereafter as the center) becomes the 
origin. The new equation has no first degree terms and as in §2 we can make 
an orthogonal transformation,” giving us the new equation ¢;27 = d. Since 
the origin is not on S, d # 0 and can be made unity. We can easily arrange 
that S has equation 
(8.1) G21 +++ + ate — bedi, — °° — OL = 1 
with the a’s all positive and the 6’s positive or zero. Here k 2 1 since S is not 
vacuous. Note that ¢ = a), °°+ .¢: = de, Cer = ber, +++ On = —bd,. 

We now seek the critical points of 


(8.2) Z = 2:2; 


on the locus (8.1), as a function of m — L parameters. A line through any 
point on (8.1) directed towards the origin is easily seen not to be orthogonal 
to (8.1). Using this fact we easily prove that, as in a similar situation in §2, 
it is sufficient to find the critical points located on (8.1) of the function 


(8.3) w = (2;x,;)/(e;2") 


6 If [a2 f(x°)dx,;dz,\-x,2; = —y; . UE + Wea Hees + ¥ under a non-singular 
linear transformation, the index is k. 
27 The determinant of the coefficients can be made +1 if we like, so that the transforma- 


tion is a “rigid motion’’. 

















CRITICAL CURVATURES IN RIEMANNIAN SPACES 483 


of n independent variables. Upon differentiation we find that they are the 
points satisfying (8.1) and 


(8.4) Z, = (2;2;)c.2. (s = 1, ---, 2; do not sum s). 


Since z;2; ~ 0 on (8.1), at any critical point at least one x, # 0, hence by 
(8.2) and (8.4) z = l/e.. Since cyir, +++ , Ca are not positive, the number of 
distinet. critical values of z is seen to be at most k, hence finite. 


9. The theorem. A diameter in a direction where the length has a critical 
point will be called a critical diameter. 

Turorem. If a central quadric in n-space, n > 1, has only a finite number of 
critical diameters, they are mutually orthogonal and the i-th in length is at a non- 
degencrate critical point of index i — 1. 

Proof. Suppose, in (8.1), a, = a2. Then any point (2; , 22, 0, --- , 0) such 
that 2} + 23 = 1/a; = 1/a, = 1/e, = 1/cz would satisfy (8.1) and (8.4), hence 
determine a critical diameter. The number of critical diameters would then 
be infinite, contrary to hypothesis. We infer that the a’s are all distinct. 

In §8 we saw that if x, ¥ 0 at a critical point, z = 1/c, at the point. Hence 
s < k, and since ¢, , «++ , c, are all distinct, at most one x is different from zero. 
Hence the critical diameters are mutually orthogonal. Since the 2’s cannot 
be all zero, we infer that there are just k critical diameters, one along each of 
the first & coérdinate axes. 

Consider the solutions of (8.1), (8.4) with x, # 0. They are 


(+1/(a;)', 0, +++ , 0). 


We choose either sign. Neighboring the point chosen, x2, --+ , 2, can be taken 
as independent variables for (8.1). Substituting from (8.1) into (8.2) we obtain 


vn (: * a2) $ret ( ~ ae * ( sd net a 
ay ay a " 


b, 2 
feet (it ‘) n- 
ay 


Since none of the coefficients is zero, the critical point is non-degenerate and of 
index equal to the number of a’s larger than a, , hence to the number of (1/a)’s 
smaller than (1/a,). The critical value is 1/a,;. Since a similar result is 
obtained with x replaced by each of x2, +--+: ,2,, we infer the truth of the 
theorem. 

Remark. If S is a central quadric not a surface of revolution, the hypotheses 
of the theorem are satisfied. For we have seen that the distinctness of the a’s 
implies that the critical diameters are mutually orthogonal, hence finite in 
number, 

On the other hand, 2 surface of revolution may satisfy the hypotheses of the 
theorem, for example, any surface for which the a’s are distinct but two of the 
b’s are equal. 


CoLumBia UNIVERSITY. 











THE CHARACTERISTIC ROOTS OF A MATRIX 
By W. V. ParKER 


1. Introduction. If A is a square matrix of order n and J is the unit matrix, 
the equation obtained by equating to zero the determinant | A — XJ | is called 
the characteristic equation of A. The roots of this equation are called the 
characteristic roots of A. 

If A is a matrix of a particular type, certain definite statements may be made 
concerning the nature of its characteristic roots. For example, if A is Hermitian 
its characteristic roots are all real. While it is not possible to make any definite 
statement about the nature of the characteristic roots for the general matrix, 
several authors have given upper limits to the roots. The first of these limits 
seems to have been given by Bendixson' in 1900. He obtained upper limits 
for the real and imaginary parts of the characteristic roots of a real matrix. 
In a letter to Bendixson in 1902, Hirsch’ extended these results to include the 
case when the elements of A may be complex numbers. Hirsch obtained an 
upper limit for the characteristic roots as well as for their real and imaginary 
parts. A limit was also given by Bromwich’ in 1904. In 1930, Browne‘ ob- 
tained limits which do not exceed those previously found and are in general less. 

In the present note it is shown that the limit for the characteristic roots can 
generally be determined to be less than the one given by Browne. A lower 
limit for the characteristic root of greatest absolute value and an upper limit 
for the characteristic root of least absolute value for an Hermitian matrix are 
also found. 

Let A’ and A denote the transpose and conjugate, respectively, of the square 
matrix A and write 


B=- ~—) C= 


It is evident that B and C are Hermitian; that is, B = B’ and C = C’. A 
theorem given by Browne may be stated as follows: 

Browne’s Tororem. If R;, R: , and RY are the sums of the absolute values 
of the elements in the i-th row of the matrices A, B, and C, respectively, and if T; 


Received March 10, 1937. 

1 Bendixson, Sur les racines d’une équation fondamentale, Acta Mathematica, vol. 25 
(1902), pp. 359-365. 

* Hirsch, Acta Mathematica, vol. 25 (1902), pp. 367-370. 

? Bromwich, On the roots of the characteristic equation of a linear substitution, Acta 
Mathematica, vol. 30 (1906), pp. 295-304. 

‘Browne, The characteristic roots of a matrix, Bulletin of the American Mathematical 
Society, vol. 36 (1930), pp. 705-710. 
484 

















CHARACTERISTIC ROOTS OF A MATRIX 485 


is the sum of the absolute values of the elements in the i-th column of A, and if 
R, R', R", and T are the greatest of the R; , R; , R!, and T; , respectively, then for 
any characteristic root, \ = a + iB, of A we have 
I ‘7’ 
we, a| < RP’, B| < R”. 


2. An upper limit to the characteristic roots of A. If \ = a + 78 is a char- 


acteristic root of a matrix A = (a,.) of order n, there exists a set of numbers 
(a. 2, °°* , %,) such that 7 r,&, = 1, whieh satisfy the relations 
ra} 
(1) Ar, = D> area, (r = 1,2,°--,m). 
om] 


If we multiply the r-th equation in (1) by #, and sum as to 7, we get 
(2) A= > a 


r= 


If in (2) we take conjugates on both sides and interchange the subscripts, we get 
(3) i= Do G,d,2,. 

rs=l 
From (2) and (3) by addition and subtraction we get 


(4) a= 2), bn ta, 


(5) B= >> Cub te. 
r,s=l 


From the relations (2), (4) and (5) we determine upper limits for | \ |, | @ | and 
8 |. Since these relations are identical in form, it is sufficient to carry the 
computation through for one of them. We shall write 


” n 


R, = > Cre |, T. = is | Gre . 2S, = R, + Fes 


s=l r= 
and let & = (2, so that p = landé,g, < 1+ £&). If we take absolute 
r=! 


values in (2), we get 


is D lan l&s 33 dD lanl +H =F DRE+IL 1.2 
=] ri3z=1 =] 1 
=LS&sSLE=S, 
r=] r=1 


where S is the greatest of the S,. Similar inequalities hold for |@, and | 8 | 
and we have the following theorem. 








486 W. V. PARKER 


Turorem 1. Jf A is any square matrix, and if 2S,,2S,, 2S, are the sums of 
the absolute values of the elements in the r-th row and the absolute values of the 
elements in the r-th column of A, B, and C, respectively, and if S, S’, S” are the 
greatest of the S, , "8 respectively, then for any characteristic root, \ = a + 18, 
of A we hav 


11s S, als 8’, ip| < 8”. 
The latter two limits are those given by Browne but the limit for | \ | is gen- 


erally less than his. 


3. The characteristic roots of AA’. Browne’ has shown that if \ is a charac- 
teristic root of a square matrix A and if G@ is the greatest and s the smallest of 
the (real and not negative) characteristic roots of AA’ then 0 < s S Ai S G. 

Let U, be a square matrix of order n whose elements are determined by the 
following conditions. The elements of the r-th column (ra definite number) are 


- : - ——$ 
UO, J a0, J oe Opn Tr 


where 
(6) ¢, = p Gdn > 0° 
‘=I 


The elements of the j-th column (7 # r) are 


iy ‘ A 72 “Ene Zin; 
where 
Danke =0 (jf xr), 
fl 
(7) 
> reFe = 55; (1,5 ¥ v). 
f=] 
The matrix l’, thus determined is unitary, that is, Ueu,=TI1. WP, = AU,, 


the elements of the r-th row of P, are 


Pri = z QnFn = 0 (3 #74), 
t=1 


n 

, , 

Pr = > GG1C-p = Gr. 
t=1 


It is evident, therefore, that a is a characteristic root of the matrix P, = Al’,. 


’ Browne, The characteristic equation of a matriz, Bulletin of the American Mathematical 
Society, vol. 34 (1928), pp. 363-368. 

* It is assumed here that not all elements of the r-th row of A are zero. If all elements 
of any one row are zero, then zero is a characteristic root of A and the theorems as stated 


here are still true. 





To ee. Oe 








swe -e 














CHARACTERISTIC ROOTS OF A MATRIX 487 


In a similar way we may construct a unitary matrix V, such that Q, = V,A 
. . 1 
has the characteristic root 7; where 


(S) Tt = D> diede . 


By taking products we see that 
P,P, = AU,U,A’ = Ad’, 
0:0, = AVIVA = A’. 
Hence the characteristic roots of P,P) and Q/Q, are identical with those of AA’. 


These roots are real and not negative. If G is the greatest and s the smallest 
of these roots, we have, as shown by Browne 


8s So, Gands Tt = G. 


IA 


Hence we have the following theorem. ‘ 

TuHeoreM 2. If o,(7,) is the sum of the squares of the absolute values of the 
elements of the r-th row (column) of a square matrix A, and if o(r) is the greatest 
and o'(r’) the smallest of the o,(1,), then AA’ has a real positive (or zero) charac- 
teristic root not less than the greater of « and r and a real positive (or zero) char- 
acteristic root not greater than the smaller of o’ and 7’. 

In particular, if A is Hermitian, its characteristic roots are all real and the 
characteristic roots of AA’ are the squares of the characteristic roots of A. If 
the characteristic roots of AA’ are Aj = AI S--- = =. we have 


° ° 
uM Be RE BA, 


IV 


and hence 


1/2 o > (o’)' 


IV 


| 


Therefore we have the following theorem. 

THEOREM 3. If o, is the sum of the squares of the absolute values of the elements 
of the r-th row of a Hermitian matrix A, and if a is the greatest and o’ the least 
of the o,, then A has at least one characteristic root whose absolute value does not 
exceed (o’) and at least one characteristic root whose absolute value is not less 
than a’. 


4. A lower limit for the characteristic roots of A. A matrix has the character- 
istic root zero if and only if it is singular. It is well known that if A is a 
characteristic root of a non-singular matrix A, then 1/\ is a characteristic root 
of A~', the inverse of A. If |1/A| < L, then || = 1/L and hence an upper 
limit for the characteristic roots of A~' determines a lower limit for the char- 
acteristic roots of A. 


LouIsiANA STavTe UNIVERSITY. 








COMPLETELY MONOTONE FUNCTIONS OF THE LAPLACE 
OPERATOR FOR TORUS AND SPHERE 


By 8S. BocHNEeErR 


In the present note we shall discuss some properties of the Laplace operator 


(1) igo el . ) 
0 = ~ (any \art +" + act 9 
on the torus 
(2) -}Sn<} («= 1,-->,k), 


that is, for functions having the period 1 in each variable, and corresponding 
properties for the Laplace-Beltrami operator on the sphere of positive constant 
curvature. 

PartI. The torus 


1. We introduce on the torus (2) the Hilbert space of functions of integrable 
square. It is the family of functions 


(3) f(r) ~ >> --- 8 Ay, ---ng OXP [Qri(myay + +++ + ne r,)] 


for which 
(4) ) ae a ae 


At the outset, the operator (1) is defined only for functions which are differen- 
tiable twice. As such it is a positive semi-definite Hermitian operator with 
characteristic functions 

(5) exp [2ri(maz, +--+ + ngarx)] 

belonging to the characteristic values 

(6) nit... + ni, 

In accordance with a general theorem referring to the nature of the Laplace- 
Beltrami operator on a compact Riemann space, our initial operator has a 
unique self-adjoint (hyper-maximal) closure with the same spectrum.’ In what 
follows we shall be interested in this closure only and we shall denote it by Ag. 
The spectrum of Ag is non-negative. Let ¢(p) be an arbitrary real continuous 


Received May 4, 1937. 
' Cf. S. Bochner, Analytic mapping of compact Riemann spaces into Euclidean space, 
this Journal, vol. 3 (1937), pp. 339-354. 
488 





FE ee 








ee ere 








COMPLETELY MONOTONE FUNCTIONS OF LAPLACE OPERATOR 489 


function in the half-line 0 S p < «. According to a general theorem on 
functions of operators’ we can form the operator g(A)g. It is again self-adjoint 
and, in our case, it is uniquely determined by the fact that it again has the 


characteristic functions (5), the corresponding characteristic values being 
2 2 
e(my + +++ + Mm). 


If y(A)f is defined for an element (3), then its value is 
(7) D (ni + «++ + ndan,...n, exp [2ri(mar + +--+ + m2], 


and it is defined for an element (3) if and only if (7) is again an element of the 
Hilbert space, that is, if 


2 


jo 


Xe | oni tee $n) P lan..ml < &. 
For example, if 
(8) eo) = ——, c>0, 
then (7) assumes the form 


Bag ++ mg 





“Sn? + tee net c exp [2ri(ma: + +> + mete)], 


and in this case the operator g(A)f is the inverse of the operator (A + c)g; 
that is, it represents the solution of the equation 


(9) (A + c)g = f. 

Purely formally the expression (7) can be written as an integral operator. 
Introduce the Green’s function 
(10) G(r) = > g(ni + +++ + ni) exp [2ri(niay + --- + mez), 


then 
(11) y(A)f = | woe Gla — By 20 te — BME, +, Bd db. 


Our aim is to discuss a class of functions ¢(p) for which this representation 
is valid. 

2. We assume that ¢() is completely monotone in0 S p < ~. This means 
that ¢(p) has derivatives of all orders in 0 < p < ~ and that 


(-1)"4 elo) 9 (n = 0,1,2, ---). 
dp” 


2M. H. Stone, Linear Operations in Hilbert Space, 1932, Chapters VI, VII 








490 S. BOCHNER 


. . ° ye 3 ° ° ° 
By an important theorem of 8. Bernstein and Widder’ any function which is 
completely monotone in a half-line pp < p < © can be represented uniquely 
as a Laplace integral 


(12) ¢(p) = | e dealt) 


in which the function a(t) is monotonely non-decreasing. Since, by our assump- 
tion, ¢(p) is bounded to the right of p = 0, we have 


(13) [ da(t) << »; 
0 


this assumption will be somewhat relaxed later on. Another restriction which 
will be required throughout is the condition 


(14) a(+ 0) = a(0); 
thus the function a(t) shall have no discontinuity for ¢ = 0. The rdle of this 
restriction is obvious; it excludes the function ¢(p) = 1 for which no Green’s 


function exists. As a consequence of this restriction we have the limit relation 


(15) g(p) = lim ¢,(p), 
where 
«x 
(16) alp) -[ e * dalt), a> 0. 


( \bviously 


(17) ¢alp) = O(e ™), px. 
Postponing questions of convergence, we form the integral 


(18) Hay, +++, re) 


| tee | e(ni f+ ee + pz) exp [Qri(ny ay +eee tn ry )|dny ++ dnp, 


and we observe that, by Poisson’s summation formula,’ it is connected with 
the function (10) by the relation 


(19) Glam, °***, %) = , vee > H(a, + mm, +++, Te + me). 


Substituting im (18) 
g(ni fees 4 ni) = | exp [—(ni tree ni)t|de(t), 
0 
*D. V. Widder, Necessary and sufficient conditions for the representation of a function as a 
Laplace integral, Trans. Amer. Math. Soc., vol. 33 (1931), pp. 851-892. 
‘Cf. S. Bochner, Vorlesungen tiber Fouriersche Integrale, 1932, pp. 33-38, 203-205 














COMPLETELY MONOTONE FUNCTIONS OF LAPLACE OPERATOR 491 


inverting the integrations and making use of the formula 


l exp [—(n?t — 2rinx)|dn = (xt) exp (- <=), 


we obtain 


(20) H(u, +++, %) = | exp| ~ “et * |. *dadlt) 


Hence, introducing the function of one variable 


(21) HG) = rf exp | - rie M dee(t), 


we have 
H(a,, +++ , te) = H(i +--+ + 23)}). 


The function H(r) which we consider for 0 < r < @ is positive and non- 
increasing, by (20), and the function G(x) is positive, by (19). 

In order to justify these relations we first replace the function ¢g(p) by the 
truncated function (16) and we denote the corresponding functions (10), (18), 
(21) by G(x), H.(x), H.(r), and the corresponding operator ¢(A)f by ¢a(A)f. 
Because of (17), the expressions (10), (18), (21) converge absolutely, relation (19) 
is valid, and, the function G,(x) being bounded, the right side of (11) has a 
meaning, and its value is (7), for any square-integrable function f(z). Thus 
relation (11) holds. 

We now assume that @ tends decreasingly to 0. Initially, the integral (18) 
and the sum (10) have no meaning for the function ¢(p) itself, but we define 
them by the limit-relations 


(22) H(x) = lim H.(2), G(x) = lim G,(z). 
a0 a0 
Since the functions H,(x), G.(x) increase when a decreases, the limit relations 


have a meaning (although the resulting functions might have values +), 
and relation (19) holds. We shall now use relation (15). In the first place it 
justifies relation (20), thus proving that //(r) is everywhere finite. But we have 
still to show the finiteness of (10) and the validity of (11). As long as (18) is 
absolutely convergent it admits the inversion formula, 


(23) g(ni + --> + nj) 


«x 


II 
—, 


. | H(a,, +++, ay) exp [—2ri(ainy + +++ + aeng))day +++ day. 


In particular, 








492 S. BOCHNER 


¢a(0) = | ahi | Hales, soe, a) day os AX. 


J 
—29 


Since H,(2,, «++, 2) and ¢g,(0) tend increasingly to their limit-values, we obtain 


« 


¢(0) = | see | H(ay, +++, ry) dx, ++: dx,.° 


Thus, the integral on the right is bounded. Since (7) decreases as r increases, 
we readily conclude that the sum (19), but for the term H(x,, --+ , 2%), con- 
verges uniformly in (2). Thus, in (2), G(a,, --+ , 2%) differs from H(a, , --- , te) 
by a bounded function only and, in particular, is finite. Since, by (10), 


1 
+ 


ga(0) =| ove | Galéi, +++, Se) dé +++ dé, 
i=} 


the integral on the right is finite also for the limit-function G(é,,--- , &). It 
is now easy to give a meaning to the integral (11) and to show that it has the 
value (7). Asa consequence of (14) the function ¢(p) tends to 0 as p—> ~ and 
the value (7) of ¢(A)f shows that all our functions g(A) of A are completely 
continuous operators. 


3. We shall next discuss the order of infinity of G(a, , --- , 2.) in the neigh- 
borhood of the origin in terms of the magnitude of g(p) as p> ~. In the 
statements the function G(2 , «++ , 2%) will be replaced by the function H(r). 


I. In order that H(r) be bounded (in the neighborhood of r = 0) it is necessary 
and sufficient that for some (and therefore every) a > 0 


(24) i o(p)p" dp < ©, 


In fact, all occurring functions being non-negative, we may conclude 


[ o(p)p "dp -[ [ ep" dpda(t) = (8) | t* de(t) 


= r(§) n*slim H(r). 
2 r—0 
To simplify writing we shall put 


K(r) -[ et * dealt) = ea(e), 
0 Tr 


Similar to (25) we obtain, for 


(25) 


0<A) < hk, 


5 See also S. Bochner, Monotone Funktionen, Stieltjesche Integrale und harmonische 
Analyse, Math. Annalen, vol. 108 (1933), pp. 399-408. 





Ce A bene Aa Ata sa ne BN 








COMPLETELY MONOTONE FUNCTIONS OF LAPLACE OPERATOR 493 


putting «4 = 3k — X, the relation 


l i A—1 ] ” —! 
lp = . K(r)r* dr. 
ros g(p)p dp aml (r)r* di 
Hence, we have 


II. In order that for some (and therefore every) a > 0 








(26) | H()r"dr < @, 
0 
it 7s necessary and sufficient that for some (and therefore every) b > 0 
(27) [ o(p)p dp < @. 
b 


Finally, we have 
Ill. If L(é) is any function in 0 < — < © such that, for every c > 0, 


(ck 
| (28) lim is =1as §-0 and E> @, 
then the asymptotic relation 
A l 
(29) (po) ~ = L (4), p> %, 
p p 


implies an asymptotic relation 
(30) H(r) ~ 3 Lr’) +0 
. paw” ; . 


and vice versa. 
This follows from the following theorem.° 


If T(t) is non-decreasing, T(+0) = T(0), and [ e *' dT(t) is finite for s > 0, 
0 


then, for ¢ > 0, the relations 
i ” as s— 0 
) (31) [ dT) ~ 5 (*) 
: 0 s ass— x 
: and 
P ; asxr—> @ 
(32) T(r) ~ x’ L(x) 


l(o + 1) asxz— 0 
are equivalent. 

We shall verify, for instance, that (29) implies (30) or, what is the same, 
that (29) implies 


~~ C 
K(r) ~ — L(r), r— 0. 
r? 
6 J. Karamata, Neucr Beweis und Verallgemeinerung der Tauberschen Sdtze, welche die 


Laplacesche und Sfielijesche Transformation betreffen, Crelle’s Journal, vol. 146 (1931), 
pp. 27-39. 











49-4 S. BOCHNER 


i e “da(t) ~ : L (1), 8s—> ©, 
0 Ss s 


Consequently, by the quoted theorem, 


At Li) 
Ma + 5)’ 


o(?) ~ ns 1) xh(?), ities 


Defining the function 7'(é) by the relation 


Me l 
dT() =t il -a(*) 

A x l — 
1 ~ as” (3). ’ 


K(s) - | et dealt) = [ e “dT(t), 


and therefore, since (32) implies (31), we obtain, replacing L(x) by L(x’), 


By assumption 


a(t) fs 


or 


we easily obtain 


But 


ss AT(u) 1 
K(s) ~ L(s), s— 0. 
" r(A) (s) “ 
4. Finally we shall show that G(a,, +--+ , 2%) is analytic at all points of the 


torus except the origin and that for an analytic function f(a, --- , 2) the 
transformed function ¢(A)f is again analytic. 

If x}, +--+ , 2, is any fixed point on the torus different from (0, --- , 0), then 
the function H(r) is analytic in some neighborhood of this point. In fact, 
given an e« > 0 there exists a 6 > O such that 

Pe oe m|/ 2l|al tes + z|/—-é 
for all complex values of the neighborhood 
6 ;2 


0 2 : 
my — M1] Heese +1 hem ee! Se. 


Consequently, the integral (20) and the partial derivatives 


. % ae Ps. F - 
(33) olf =” [ (-?" *) exp (-" (73 + +s + wy) *“ da(t) 
OF 0 t t 


exist in a complex neighborhood, thus proving the analyticity of H(a,,--- , 2%) 
at all points but the origin. Also, the argument in §2 shows easily that the 

















COMPLETELY MONOTONE FUNCTIONS OF LAPLACE OPERATOR 495 


series (19) converges uniformly in our complex neighborhood; the result is that 


G(x, .--+ , 2) is analytic at all points of the torus except the origin and that 
in the neighborhood of the origin it differs from the function H(a2,, --+ , 2%) 
by an analytic function. Furthermore, in dealing with the singularity of 
H(x,.--+ , 2,) at the origin we may assume that ¢(p) has the form 
(34) g*(p) = [ e dat). 

0 


This follows from the fact that the difference function g.(p) = ¢(p) — ¢*(p) 
(see (16)) has the order of magnitude 


gant +--+ + ni) = O (exp [— 4x(nj + +--+ + nj))) 


for a > O sufficiently large, thus making the corresponding function G,(z) 
analytic by crude absolute and uniform convergence of series (10). 

We now assume that f(z, --- , 2,) is analytic on the torus. If x runs over 
a complex neighborhood of a fixed point and & over a set S of the torus such 
that G(x — &) is analytic in the neighborhood of the fixed point, uniformly in & 
from S, then the integral 


| ini | Ge — f(t) dé «++ dé 


is analytic at the given point z (even if f(£) is not analytic). Hence it is suffi- 
cient to show that 


(35) g(2) = | | H(e = OF @ dé dbs 


Sp 


is analytie at the origin if S, is some sphere H+--. + <r’ and 
H(x) -[ exp | fa Se | 2) | raat, 
0 


a > 0, ¢ > 0. Following E. E. Levi’ we replace in (35) the codrdinates 


&, °°: , & by a type of polar codrdinates originating at the variable point z 
which is interior to S,. The new coérdinates consist of angular variables 
6, °*- , & 4 and a radial distance p,0 <p <1. The angular variables do not 
depend on the variable point x. They are fixed variables on the surface of the 
sphere S., such that the rectangular coérdinates m ,--- , 7% of the surface 
ni t-:: + ni =r are fixed functions of 6, +--+ , 0... But the radial dis- 


tance p does depend on x and is defined in the following way. Let » be any 
point on S,; if € lies on the segment joining x and n, then 
f& = a + o(m — 2x) (« = 1,--- ,k), 


7 Cf. E. Hopf, Uber den funktionalen, insbesondere den analytischen Charakter der Lésun- 
gen elliptischer Differentialgleichungen zweiter Ordnung, Math. Zeitschrift, vol. 34 (1931), 


p. 224. 














496 S. BOCHNER 


the quantity p being the length of the segment (z, &) divided by the length of 
the segment (zx, 7). If we introduce these coérdinates in (35), then the factor 
of H(x — £) is analytic in a complex neighborhood of x = 0, uniformly in the 
variables 6, ---, 1, p, over which we integrate. Also, what is very impor- 
tant, the volume element contains the factor p*"' dp. As for H(x — &) itself, 
we can write it in the form 


(36) Hix — = | exp | - ((x; — m) +e +(x, — m |e * Nee(t) 





t 
Since ni + --: + ni = r°, there exists a fixed complex neighborhood N of the 
point r = 0, such that in this neighborhood 
° e r 
(m1 — m) +++ + (re — mm) | 2 5° 


Hence (36) is dominated by 


| exp |-% ac * dat). 


Therefore (36) is analytic for x C N, uniformly in ajl @.--* , A) and in ; 

¢ < p Sr, for any fixed « > 0. Hence the function : 
g(r) = | cee | H(x — &)f(é) di «++ dé, 

Ya” ; 

is analytic for cr C N. But g.(x) — g(x) is dominated by 


[ | exp | -% | # dp da(t), 


and this is a finite dominant independent of x. The function g(x), being the 
limit of boundedly convergent functions over a complex domain, is likewise 
analytic. 





5. The function (2, , «++ , 2%) entering the sum (19) is a Green’s function 
itself, namely, the Green’s function of the operator g(A)f not for the torus (2) 
but for the whole Euclidean space; compare its definition (18). For the special 
function 





we have 


H(r) = x | exp | —7 _ ct} * dt, 


H(r) = 2n'* (re)! * Ky. (2rre’), 


and therefore 








ata eee. 








COMPLETELY MONOTONE FUNCTIONS OF LAPLACE OPERATOR 497 


where K,(z) is the Hankel function of imaginary argument.” For k = 3, its 
expansion in the neighborhood of the origin, apart from a factor independent 
of r and ¢, starts with the term 


. 
(37) —2" 
The other terms contain each a positive power of ¢ or a product of a positive 
power of ¢ with log c. Thus, for ¢c — 0, we obtain the term (37) alone, and 
this is actually the Green’s function for the operator A™'f corresponding to 
v(p) = p'. But the corresponding sum (19) tends to + as ¢ > 0, and so 
does formally the series (10) since its constant term ¢(0) tends to +, if 
¢(p) > p. The reason is that on the torus the operator A”'f does not exist 
for all functions of integrable square, since Af has the proper function f(z) = 1 
corresponding to the characteristic value 0. But the operator AW’f exists for 
all funetions which are orthogonal to this proper function, that is, for all fune- 
tions f(x) in whose expansion (3) the constant term vanishes. More generally, 
for this restricted class of functions f(x) the operator ¢(A)f will exist if ¢(p) is 
(bounded and) completely monotone in 1 S p < «. It will again be repre- 
sentable in the form (11) if in the sum (10) we omit the term corresponding to 
my = +++ = nm = 0. Suppose in general that ¢(p) is completely monotone 
for x = b, in which case G(x) is defined as the sum 


» vee > o(nt+ +++ 4+ nj) exp [Qri(miay + ++ + ne ay)). 


” 


+ nh >b? 


If a is any fixed number 2 b, we put 


g(p) = [maa + [eda 
= gilp) + elo), 

and 

Giz) = - DX ei(ni + +++ + nz) exp [Qri(miar + «++ + nexe)], 

Gx) = — ke rie 2 gi(mt + +++ + ni) exp [2ri(miai + +++ + mere) 

nyt: *+ngce? 
$D Z leh + tad exp ltritma +--+ + mad) 
ni t--tatmet 


Obviously all previous considerations remain valid for the pair of functions 
vi(p), Gy(x). Also go(p) = O(e°") as p > &, and therefore G(x) is analytic 
and bounded on the torus. Consequently all previously stated properties 
remain in foree for the funetions ¢(p), G(r) themselves. 


*G. N. Watson, Bessel Functions, pp. 185-189. 








498 S. BOCHNER 


Part II. The sphere 

We shall now consider the Laplace-Beltrami operator on a k-dimensional 
space of constant positive curvature, and we shall assume that our space is 
given as a Euclidean sphere of radius 1, 
(38) f+--- +8, =1. 
It is possible to prove analogues of our previous theorems for the most general 
completely monotone functions ¢g(p) as before. But the formulas are more 
complicated and the argument is rather tedious. However, there are special 
classes of completely monotone functions for which special sets of formulas are 
available, and we shall restrict our attention to one special class of this kind 


6. We shall need a lemma on completely monotone functions. 
If ¥(p) is completely monotone in p = 0, 
"2 


(39) Hy) = | det), 


and x(p) is the integral, vanishing at the origin, of a completely monotone function 


inp = 0, 
x’ (p) -| e *dy(t), 
0 


then the function ¢(p) = (x(e)) is again completely monotone in p = 0. 

The lemma can be proved directly from the definition by verifying inductively 
that the n-th derivative of ¢(p) has the sign of (—1)". However this procedure 
is rather cumbersome. A more elaborate but more illuminating proof runs as 
follows. We first conclude from the definition and the integral representation 
that sum, product and limit of completely monotone functions are again com- 
pletely monotone. Therefore, since ¢(p) is a limit of finite sums of the form 


a eo?" (ealtris) — alt,)), 


it is sufficient to prove that if x(p)t, is replaced by x(p), ¢ ** is completely 
monotone. Approximating to x’(p) by a finite sum 


Dd oP (y(trat) — y(tn)) 


we may further assume that x/(p) = e °", in which case 


( - | oe e pt 
g(p) = exp a. =e exp r | 


But the latter funetion is completely monotone, since 


~*~ —npt 2 
eplsetlaS Sa | cae. 
t nao nie” i) 





OND Seb etre eet 


- 











ie ae re 


- 








COMPLETELY MONOTONE FUNCTIONS OF LAPLACE OPERATOR 499 


In particular, if v is any constant > 0, we may put 


x(e) = (p+) —» 
since 


2x'(o) = (9 + »”) fig ‘f eee} ay. 
0 
Hence, if ¥(o) is any completely monotone function in p = 0, then the function 
(40) o(p) = W((p + »*)' — ») 
is again completely monotone in p 2 0. 
When dealing with the sphere (38) we shall restrict ourselves to the class of 
functions (40), the constant v having the value 


k-—! 
= 9 . 


_ 


v 


This class is rather narrow; nevertheless it contains the functions (8) for 
0 <e¢ <»°*. In fact, the corresponding function ¥(p) is 


[eo + vy) +e — vy" 


and this is identical with 
(41) (vy — e) ‘| ee” sinh (* — c)*t dt. 
0 


7. In our present case the Green’s function G(é, 2) depends only on the 
geodesic distance between the two points £, xz, both on the sphere (38). De- 
noting this distance by r, 0 <r < z, we have as an analogue to (10) the formula 


x2 


(42) G(r) = 2. (n + v)g(n(n + 2v)) PY’ (eos r). 


n=O 


The spherical harmonics P,,’ can be defined either by 


xz 


(43) > P&?(cos rw" = (1 — 2w eos r + wy” 
V n=O 
or by 
(44) be (n + v) PS” (cos r)w" = (1 — w*)-( — 2weosr + wy". 


n=0 


Now, by (40) and (39), 
g(n(n + 2v)) = W(n) = [ e ™ dat) 


and therefore, by (44), 


- ae i (1 — e *) da(t) 
(45) G(r) = | ‘a ae 2 cos r ent + em tty tt” 








500 S. BOCHNER 


Since 
(46) 1—2ecosr-e' +e" = (l-—e ¥ + 2(1 — cos re“, 


the function G(r) is monotonely non-decreasing for decreasing values of r. 
About its behavior in the neighborhood of r = 0 we shall deduce the fol- 
lowing theorem. 

THeoreM. The assertions I, U, LI of §3 are true if in their wording the 
function H(r) is replaced by the function (42) and the function ¢(p) by the func- 
tion (40). 

We first observe that for the present theorem the function (45) is equivalent 
to the function 


tda(t) 
(f + y2)rtl . 


This follows easily from the fact that, for each « > 0, the differences between 
the functions G(r) and 7'(r) and the corresponding functions 


: : (1 — «& *) da(t) “  tda(t) 
G ; = - , = m 
) | (1 — 2eosre*4+ et? Tr) [ (@ 4 ry 


are bounded in 0 < r S 2/2, and that 


(47) T(r) = | 


G(r) Gr) 


1 — nle) s lim < li — = 1 (e), 
We 3S aT) = arg Fit 
where lim n(e) = 0, since (see (46)) 
. l-e' _ bug” . Al —cosr 
lim —— os fie : = lim al ta r) = 1. 
it~) t t—-0 2t r—0 ia 
Now, since vy + 1 = 3(k + 1), the funetion (47) differs only by a numerical con- 


stant from 
(48) | ke | vi (ni Soe - ni)'} exp [2ri(ni ay tee) 4 nap )| diy +--+ dng, 


where aj + +++ + a = r° and y(p) is our present function (39).° But (48) is 
the function (18) with ¥(p’) in the place of g(p). Hence the assertions I, II, II 
are true if we replace H(r) by (42) and ¢() by ¥(0'). Obviously, for p — ~, 
the function ¥(p") can be replaced by the function (40), and this completes the 
proof of our theorem. 

In a similar fashion we could prove that ¢(A)f transforms analytic functions 
into analytic functions and that the function ¥(p) need not be bounded in the 
whole half-line 0 S p < =. 


* See S. Bochner, Fouriersche Integrale, p. 189, formula (21). 








COMPLETELY MONOTONE FUNCTIONS OF LAPLACE OPERATOR 501 


8. For ¢(p) = (p + c)', 0 < ¢ < v we obtain for (45) (see (41)) 
G(r) = [ sinh t- sinh (v* — c)'t “dt 


(cosh t — eos)’ 
where “‘=" denotes equality up to a factor independent of r (but dependent 
on k and ec). Henee, by partial integration, 


a2 ial 2 os yh 
(49) Go = | =o, 
Jo  (eosht — cos r)’ 


This formula remains valid for ¢ > »v°, in which case 


ox : : — 2 } 
G(r) = | cos (¢ — v’) tdt 
9 (cosh t — cos r)’ 


For vy = 3, that is, k = 2, we hence obtain, writing ¢ — » = p’, 
G(r) = Q-1.,: (cos r) + Q-s_,, (cos r), 
. ° ° ° ° 1 . 
where Q,(cos r) is a spherical harmonic of the second kind.” For 0 < ¢ < 4, 
the connection between G(r) and the standard function Q,(cos r) is more 


involved. 
For k even and 24 we put in (49), (v — ¢)’ = n + 3, u = sinh #t, z = sin 4r, 
and obtain 


on ws [ &,,(u) du 

; Jo (u? + 2°)”’ 
where 

.(u) < cosh (n + 3)t 


cosh 3¢ 


If n is an integer, that is, for 


_ (k-1\V _ (2n+1¥ k—4 
(50) on (45 ) -( ; } n=0,1,+°+,--3> 


?,(u) has the special form 
a tau +--- +a,u", 
and therefore 
ios bo b, be b,, 
GW = Biteitaetocts 
in particular 


G(r) = = (ay + arr’ + agar + asr® + +--+). 


©. W. Hobson, The Theory of Spherical and Ellipsoidal Harmonics, 1931, p. 275, 
formula (149). 





502 S. BOCHNER 


This expansion is remarkable for the absence of terms containing log r which 
one would expect to be present in an expansion of G(r) around the origin. It 
can be shown that for values of c other than (50) the logarithmic terms actually 
do occur, but a further investigation of this question would have but little to 
do with the topic of the present note. 


Princeton UNIVERSITY. 

















AN ANALOGUE OF THE VON STAUDT-CLAUSEN THEOREM 
By LEONARD CARLITZ 


1. Introduction. Let GF(p") denote a fixed Galois field, and x an indeter- 
minate over the field. The function’ 


(1.1) Y = t) = >. — “ue 
where 
(1.2) [i] = 2” — 2, F; = [j{¢ — 1)" --- yy" = F, = 1, 


is closely connected with the arithmetic of polynomials in the GF(p"). In this 
paper we study the coefficients in the reciprocal of (1.1), more precisely in t/y. 
In particular we shall be interested in proving an analogue of the von Staudt- 
Clausen theorem for these coefficients. 

In order to define properly the coefficients in the reciprocal it is necessary to 
define a “normalizing” factor (analogous to n! in ordinary arithmetic). This 
is done in the following way. Let 


m=a tap" +:::+a.p” (0 S a; < p”) 
be the canonical expansion of m to the base p"; then we put 
(1.3) g(m) = Fo°Fy' +++ Fo", q(0) = 
where F’; has the same significance as in (1.2). Thus for example 
g(p™) = F,, g(p™ — 1) = (Fi ++: Fea)”. 
We may now define the coefficients of the reciprocal by means of 
Bn m 
(1.4) > sim 
the summation obviously wsaael only terms in which m is a multiple of 
p" — 1. Clearly By = 1 and B,, is a rational function of xz. The analogy 
between B,, and the ordinary Bernoulli numbers is brought out by the relation’ 
CR ee (p" — 1|m), 


where the summation is over all primary polynomials F, and 


p”*/(p"—1) 
~ mee [A] — 1) ++ 


Received July 21, 1937. 
‘ See this Journal, vol. 1 (1935), pp. 137-168. This paper will be cited as DJ. 


*DJ, p. 161, Theorem 9.3. 
503 








504 LEONARD CARLITZ 


In the present paper we discuss some of the arithmetic properties of B,,. Our 
principal result is the analogue of the von Staudt-Clausen theorem for the Ber- 
noulli numbers. We find that 


B,.=G.-¢« 2 ; (p" # 2), 
deg P=k 

where G,, is some polynomial (whose precise form is not determined), eé is an 
integer, not divisible by p, and the summation is over all irreducible polynomials 
P of degree k; finally & is a number depending on m whose existence depends 
on a certain set of conditions (see (7.3) below); if the conditions are not satisfied 
B,, = G,, and is therefore a polynomial. When p" = 2 the result must be 
modified slightly. 

The method of proof depends on certain ideas due to A. Hurwitz.’ While 
the proof is not particularly difficult, there are a number of details that make it 
rather long. In particular it is necessary to prove certain lemmas on g(m) 
which are of some interest in themselves. 


2. Lemmas on g(m). 
THEOREM 1. For m,, me = O, the quotient 


g(m + me) 


(2.1) q(my,)g(me) 
is integral (that is, a polynomial).* 
Let 
(22) m, = By + Bip" +--+ + Bp” (0 < B < p’), 
me = yo+ mp’ +++ + yep” (0 S vi < p’); 
then 


(2.3) my + me = (Bo + yo) + +++ + (8B, + y.)p”™ (0 < B+: < 2p"). 
If now we put 

Bo + Yo = a + Sop’, 
where 65 = 0 or 1, and 0 S a < p", we may define 6, , «++ , 6, recursively by 
means of 


bo oa By + “11> Q) — dip’, 


3 Mathematische Annalen, vol. 51 (1899), pp. 196-226 (= Mathematische Werke II, 
Basel, 1933, pp. 342-373). 

‘ Throughout this paper the word ‘‘integral’’ will be used to denote a polynomial in z 
with coefficients in GF(p"). 











ANALOGUE OF VON STAUDT-CLAUSEN THEOREM 505 


where each 6; = 0 or 1, and 0 < a; < p". Thus (2.3) becomes 
m, + me. = a + ap" +--+ +a,p™ + ter 
Hence by the definition of g(m) in (1.3), 
qg(m, + m) = FY'--- Po'F*:,. 


Comparison with (2.2) leads at once to 


atom + ma) ete TT pes, 
g(mi)g(me) nme 


Sinee by (1.2) PF... = [s + 1]F?", the right member 


s—1 
= [s oF 1]°* pier tec Bas II Fei 
j=l 


II 


s—1 
[s+ 1)* Fe I] Peihimve 


by the last of (2.4). Again 
Fs t pPeemi~Pe 1—V¥s-1 a [s]**-' Fes5?. 
Proceeding in this way we have finally 
(2.5) g(mi + me) _ [s + 1]**[s}**-' --- [1]. 
g(m,)g(me) 


Thus we have not only proved that (2.1) is integral but have derived the explicit 
formula (2.5). 

As an immediate corollary it is evident that for m , +--+ ,m, = 0, the quotient 

’ ee 
(2.6) sn SE Mi. 

g(m) +++ g(me) 
is integral. 

If in (2.6) we take m, = --- = m, = m, we see that g(km)/g*(m) is integral. 
For later purposes it will be necessary to know that this quotient is divisible 
by g(k). But we may prove without difficulty the following slightly stronger 
result. 

TuHroreM 2. Form 2 

m= a +ap" +--+ tap" (0 S a < p’), 
3 = a +a + —— + a. 
Thus form 21,u 21. Then the quotient 


q(km) 
q* (m)g*(k) 


-_ 


), define 4 = u(m) by means of 


(2.7) 


is integral.” 


> Compare Bachmann, Niedere Zahlentheorie. 








506 LEONARD CARLITZ 


It will be convenient to take first the special case m = p"', k = p"’, so that 
uw = 1. We shall require the 
Lemma. For all i,j = 0, the quotient 
Fj _ fi +ai+j—-1"--- +P" 
F?" F, F; 


1) 


(2.8) (i,j) = 
is integral. 
This is easily proved by induction. From (2.8) follows 
(+ Fa (a+ OFA 
| F, Fe" F, 


ie oy p ft _— , 
(i,j —1)" + pen (i — 1,9). 
i—l 


(2, j) =z 


II 


Since (¢, 0) = 1 = G, O) it is evident from this recursion formula that (7, 9) 
is integral for arbitrary non-negative 7, 7. 

Returning to the general theorem, we now suppose m arbitrary but to begin 
with again take k = p"’. From (2.7) and the definition of g(m) it follows 
easily that 


WPM) _ og 4 ay fam... ft. 


q’ (m) 
Replace m by p"m and this becomes 


q(p om) = [x + 2) [s + 1]""-! --- [2]; 


q”'(p"m) 
combining the last two equations we have 


pm) _ js + ls + 1") --- ((2yfip")*. 


gq” (p" m) 


Continuing in this way we see that 


(2.9) oF) ht Aa-- ery. 


q’(m) i 
If we compare this with (2.8), it is clear that the right member is a multiple of 
Fe ot +e ” F* , 


ny 


so that the theorem is proved for the special case k = p 
In the next place, for 8 2 1, it follows that 


g?(p™ m) 


4 0) F* pti . 
(2.1 i qr (m) 











ANALOGUE OF VON STAUDT-CLAUSEN THEOREM 507 


But by Theorem 1, g°(p"’m) | g(8p”’m), so that (2.10) implies 


g(Bp™ m) 
- ae (m ) ’ 


thus the theorem is proved for k = Bp”, where 0 < 8 < p”. 

Now take 
(2.12) k = ap" + Bp” (0 <a,8 < p’;i ¥j). 
Then by (2.11) we have 


(2.11) Fr 


poe poe g(ap™ m) g(Bp™ m) _ 
ii. ge" (m) gr?’ (m) ’ 
but by (2.12) and (1.3) 
g(ap"' + Bp") = FF, 

and by Theorem 1, 

g(ap"'m)g(Bp"'m) | g{(ap" + Bp™)m}, 
so that 
g(km) 
g*(m) 


for k as in (2.12). Proceeding in this way we see that Theorem 2 is true 


q’(k) 


generally. 


3. H-series.’ If in the series 
x Ar 


(3. ] ) S i ’ 
m= g(m) 


the coefficients A,, are integral, we shall call (3.1) an H-series. It follows at 
once from the definition that if S and S’ are H-series, then AS + A’S’ is also 
an H-series, where A and A’ are any two polynomials in x alone. As for the 
product of two H-series, if A», , Aj, are the coefficients of S and S’ respectively, 
and C,, denotes the general coefficient in SS’, we evidently have 
' g(m) ’ 
= », g(t)g(9) nme: 
By ‘Theorem 1, the g-quotients on the right are integral so that C,, is integral. 
Therefore the product of two H-series is itself an H-series, and generally for 
the product of any number of series. 
Consider next the reciprocal of S. In general this is not an H-series. If 


however Ag = 1 or any non-zero element of GF(p")—then the reciprocal is 
also an H-series. Thus for Ap = 1, put S = 1 + S,; then 

, 1-S +S; 

_= =~ = —™ a1 ee eee 

8° 1484 


“Compare Hurwitz, loc. cit. 








508 LEONARD CARLITZ 


and it is clear from the above that the right member is an H-series. Finally it 
follows that in this case S’/S is also an H-series. 
Of special interest is the series 
x m 

‘ ’ A.l 
(3.2) S= 2) ~~, 

m=1 g(m) 
which has no constant term. We have seen above that for the k-th power, 


x 


+ a C.. t”" 
m=k g(m) , 


‘ 


C,, is integral. We shall now show that C,, is a multiple of g(k), or what 
amounts to the same thing, we prove 

TuroreM 3. If S, is an H-serics without constant term, then Si/g(k) is also 
an H-series. 

Assume first k = p"’. Then by (3.2), 


sai — Ae” (pm 
1 = Ya ni . A ), 
rT g(p"m) g” (m) 
But by Theorem 2, the second fraction on the right is divisible by g(p"') = Fy. 
If then we put 


_ C..0" 
(3.3) Sf” = 2, 
q(m) 
we have F; C,,. Squaring both sides of (3.3), we write 
Pu Cc. re , 
S = ~ , where C,, = g(m) 
q(m) etjam gle)g(f) 


ry e, ° 2 y/ . y2p"s say? : * _s ° 
rhus it is clear that F; | C,, , in other words S;"/F; is an H-series. Similarly 
yap"! +r yk . ° : 
for S*’/F% . In other words, S}/g(k) is an H-series for k = ap"',0 <a <p”. 
Suppose next that 


(3.4) k = ap"' + Bp"’, 0<a,B < p',t #j. 


Put 
, AA 
yap" A m t” wp" - 1 m t” 
ST’ = . Ss; = , 
q(m) g(m) 


so that 
(3.5) Fi | A., P| A]. 
Then for 


C..t” q(m) ra 
S' = ’ Ca = ; 7 ely, 
t= ¥ 2. goa) 


so that by (3.5) 











ANALOGUE OF VON STAUDT-CLAUSEN THEOREM 509 


But by (3.4) 

7a pw ni n 

FEF; = g(ap™ + Bp”) = g(k), 
so that the theorem is proved in this case also. It is now clear how the theorem 
may be proved for general k. 

As » corollary of some interest, we state: if an H-series without constant term 

be substituted for t in the H-series (3.1) and the result written as a series in t, then 
this series is also an H-series. 


° A 
4. Some theorems on B,,. If we define As by means of 


bed (k) gm 
ynk—] An t 


m=pre—1 g(m) ’ 


then we have the formula’ 


aa 
(4.1) B,, = yo L. : 
where 
(4.2) L, = [kk — 1] --- (1). Ly = 1. 


Now A; is integral and by Theorem 3 is a multiple of g(p™ — 1). But by 
(1.3) and (4.2), 


(4.3) g(p" — 1) a (Fy ++: Fy)” 1 
“ Li | ap (k] 


If we recall that [k] is the product of the irreducible polynomials whose degree 
divides k, it is clear that if the left member of (4.3) be reduced to its lowest 
terms, then the factors of the denominator are simple. Further, except for the 
case p" = 2 = k, the irreducible factors are all of degree k. This proves the 
following 

TueoreM 4. Jf B,, = N,,/D,, , where N,, and D,, are relatively prime, then 
D,, has only simple factors. 

In the next place from the identity® 


(4.4) ry(t) — Y(2t) = yb) 


follows 


at t wo-va) wd  w'* yon 


vit) yt) vat) ~— vat)  r-y""@~ 


where, by the discussion of §3, the coefficients C,, are clearly integral. Hence 
by (1.4) and the last theorem, it follows that the product x(x" — 1)B,, ts integral. 


(4.5) 


* DJ, p. 158, formula (8.11). 
* DJ, p. 150, formula (5.09). 








510 LEONARD CARLITZ 


But this result may be extended considerably. If G is an arbitrary poly- 
nomial of degree s, say, then in place of (4.4) we have the more general formula’ 


(4.6) Git) — (Gd) = >> Ajy’”’ (0), 
j=l 
where the A; are integral; indeed 
(-—1)". ae / a 
A; = F. ¥(G), vu) = > (—1) F.Le" 


Thus (4.5) becomes 


Gt t Gy(t) — (Gt) DO” te 


¥(Gt) ni v(t) VGQY0 G-YDAyw* 


Now both numerator and denominator on the right are H-series, and it is easily 
seen that if we replace t by Gt the quotient becomes an H-series. Thus applying 
Theorem 4, we have 

TuHeoreM 5. For G an arbitrary polynomial, the product G(G™ — 1)B,, is 
integral. 

Assume now in the notation of Theorem 4 that P | D,, , where P is an irre- 
ducible polynomial of degree k. Since in the theorem just proved, @ is quite 
arbitrary we may take it equal to a primitive root (mod P). Now by the 
theorem, 

G" = 1 (mod P), 


and therefore because of the nature of G, m must be a multiple of p™ — 1. 
This proves 

TueoreM 6. Jf P is an irreducible divisor of the denominator of B,, , then 
p™ — 1 divides m. 

If we return to the definition of AS and make use of (4.1) and (4.3), it is 
now evident that for p™ — 1 not a divisor of m, AS is a multiple of P. There- 
fore in determining the fractional part of B,, it is necessary to retain in the right 
member of (4.1) only those terms for which p“™ — 1|m. We may now state 
the following 

Tueorem 7. For P irreducible of degree k, we have the congruence” 


ane 
prk—ijm g(m) 


) 


(4.7) yr ‘= (mod P) 


. ° ° nk ah 
the summation extending over multiples of p" + 1 only. The formula (4.1) 
reduces to 


(4.8) R24 F Am 


* DJ, p. 151, formula (5.11). 


Awt™ ‘4 2 as ; 
‘© The statement z — 1 - (mod P) is short for the infinite system of congru- 
g(m) g(m) 
ences A, A. (mod P). 











ANALOGUE OF VON STAUDT-CLAUSEN THEOREM 511 


Na > ‘ . vk 
where G,,, is integral, and the summation extends over such k for which p™ — 1 
divides m. 


- yk r . . . 

5. A lemma ony” ~~. We now prove the following theorem, which is the 
most important point in the proof of the main theorem concerning B,, . 

TueoreM 8. For P irreducible of degree k, 


phy (—1)* ai prk—1 
(5.1) y’ =(>> — 6 (mod P). 


In other words, in forming the (p“ — 1)-th power of ¥(t) (mod P) we may 
ignore those terms in t””’ for which 7 is not a multiple of k. To prove (5.1), 
we remark first that by Theorem 3, 

y"" =0 (mod P). 


Combining this with (4.7), we have the congruence 


Ant" 5s (-1) 


—.~” = 0 mod P), 
pk—1im g(m) F; ( ) 


Picking out the coefficient of t” on the left, we get 


g(m) sie 
~ ( az ) : A m—p"' 0). 
sit Fen Fate 99 4 


il 


We suppose hereafter that m — 1 is a multiple of p"“ — 1. But by Theorem 7, 


i 


k . nk n 
Apes =O for p™ —14m— p", 
that is, for 7 not a multiple of k. Therefore (5.2) becomes 


(5.3) > (-1)" g(m) 


4”? P — 0 
> 2 —prki = 5 
Stee Fisg(m — py"? 


Now if p" / m it is easily verified that the quotient of g(m) by g(m — 1) 4 0 
(mod P). Indeed if p" | m, p"“*” 4m, the quotient = L,, as defined in (4.2). 
Hence for p™ / m, (5.3) becomes 


.) l ‘ g(m) (k 
the summation extending over t > 0 only. 
We next recall a result proved elsewhere :” 
(5.5) Aven 1 = 0 for m > 1, 
which we shall apply to (5.3) in order to show that AS” = 0 (mod P) for a 


u@,, will generally denote a polynomial depending on the index m and not necessarily 
the same in all formulas in which it occurs. 
"DJ, p. 158, Theorem 8.3. 








512 LEONARD CARLITZ 


certain set of values of m. First we make a slight change in notation. In 
(5.4) replace m by m + 1, so that 
(5.6) p™ —1!m, p*4m +1. 
Assume now that (5.6) holds but in addition p™ | m + 2. Then clearly (5.4) 
shows that A“ is a sum of terms of the type 

a ee (i > 0); 


ki 


but according to (5.5), this vanishes unless, for some i, m + 1 — p“ = p™ —1, 
that is, unless 
- = nki nk © 
(5.7) m=p +(p° — 2):1. 

In the same way if 

p" 4m +1, pm + 2, p|m+3 

(so that p"™ > 2), then two applications of (5.4) lead to a linear homogeneous 
expression for A,’ in terms of the type 


Axe prki pre: 7 > 0), 
s ° ‘ nki nk nk “Ss = 
which vanishes unless m + 2 — p“' — p"’ = p™ — 1, that is, unless 
(5.8) m= p™' + p™’ + (p™ — 3)-1 (p™ > 2). 
Now in both (5.7) and (5.8) m is expressed as a sum of p™ — 1 terms p™* 
We shall now show generally that unless 
(5.9) n= ps +--+ + ” ate r= p™ — |, 
AS’ = 0 (mod P). For suppose 
pim+i (j= 1,---,8, 
(5.10) i nk 
p’*m+t+1 (t <p”). 
Apply (5.3) ¢ times and Af” is exhibited as a sum of terms 
ng w=m+tt— pi —.-. — p™", 


Since by the second of (5.10) pw + 1, follows from (5.5) that AW = 0 


unless w = p”™ — 1, that is, unless 
m= pu eee + p™* + (p™ — t — 1)-1, 
which is precisely the condition (5.9). 
: . ms ‘ nk ‘ 
It is now easy to establish the congruence (5.1). If we expand y”” * direetly, 
it is clear that a term involving t” occurs when 


(5.11) m= pi ees +p”, r=p —l. 


But by the result just proved the sum of such terms for fixed m will be = 0 
(mod P) unless m is of the form (5.9). But from this it follows almost imme- 











ANALOGUE OF VON STAUDT-CLAUSEN THEOREM 513 


diately that each 7 in (5.11) is a multiple of k. Hence in forming vy it is 
necessary to use only the terms in t”””’, but as remarked at the beginning of this 
section, this is equivalent to (5.1). We have therefore established Theorem 8. 
As we shall see in the next section it is now possible to evaluate AS. In 
proving the von Staudt-Clausen theorem for the ordinary Bernoulli numbers, 
it is possible to make use of certain explicit formulas. Their analogues are not 
available, and therefore some such method as used here seems necessary. 


ynk—y 


6. Further lemmas on ,/’ We now consider 


(6.1) (= a a “ 


i=0 Fi; 
Rewrite (5.7) in the form 
m =a + ap™ +--+ + ap" (a; = 0), 


(6.2) nk 
Pp —-l=aatat::: +a. 


Then it is clear that (6.1) becomes 
(p™ oe 1)! ee 


,a a 7a 
m alai! +++ as! Fr Pog ++: sk 


the summation extending over all m for which (6.2) is solvable. Making use 
of (5.1) we see that 


+-+-tea,) (p™ — 1)! g(m) 


, — 
a! ++ ag! Fe? -++ FY, 


(6.3) A‘ m= (-1)*™ 


where again m is of the form (6.2); for other m, A“ = 0. All the congruences 
are (mod P), where as above P is irreducible of degree k. To determine when 
the multinomial coefficient in the right member of (6.3) is different from zero 
(that is, not a multiple of p) we use a theorem of Dickson’s:" If the coefficients 
a; in (6.2) are of the form 


nk—1 
(6.4) ay;= aijp (0 < Qi; < Pp =a 3), 
j=0 
then 
nk 
— 1)! 
(ao, ieee a,) _ & : 
ay! +++ a! 
is prime to p if and only if 
(6.5) Daj; = pl (j = 0,-+-, nk — 1). 


i=0 


8 Annals of Mathematics, (1), vol. 11 (1896-97), pp. 75-76; Quarterly Journal of Mathe- 
matics, vol. 33 (1902), pp. 378-384, 











514 LEONARD CARLITZ 
If (6.5) is satisfied, then 


(6.6) (ay, **+,a@) = | Per (mod p). 
me 


We may therefore assume that m satisfies both (6.2) and (6.5). 
It is now easy to evaluate the g-quotient in (6.3). Put - 


(6.7) m= Bo + fp" + Bp” +--: (0 < & <p’). 


Comparison with (6.2) gives 
k—1 
(6.8 a: = 2, Biss DB” (i = O, +++, 8). 


Comparison with (6.4) gives 
(6.9 Bir; = 2 @iaised’ 
om 


Now by (6.7), gim) = FY' FS +--+ , from which follows by use of (6.8), 


qim) 


= (FY! +++ FES )(PI +e PE) - 


so that the g-quotient 
: ’ , ped k—1 
(f) 10 (FP 73? F, 1) = ( -1) 9 


by Wilson’s Theorem and the facet’ that F; is the product of the primary poly- 
nomials of degree k. Therefore by (6.3) and (6.6) we conclude that for m 
atisfying hoth (6.2) and (6.5), we have 


(6.1) » (—1)"""""" - II re (mod 2); 


aul other case & 0 
Let m be fixed; we shall now show that at most one value of k can be found 


for which (6.2) and (6.5) are simultancously satisfied. Yor assume the relations 


vo + "1p" rere F yp” 


yp) ! eT TT *** TMs 


Mulletir the American Mathematical Society, vol. 38% (1932), pp. 756-714; also DJ, 

















ANALOGUE OF VON STAUDT-CLAUSEN THEOREM 515 
’ ni—l 
(6.12) 
Yi = Yip (OS vii 3 p— 1), 
7=0 


t 
Do vi- 


i=0 


p—! 
Then from (6.2) and (6.5) follows 


s nk—l 
ys b> a;; = (p — 1)nk; 
i=) j=0 

on the other hand from (6.12) follows 


i ni—l 
> Bi; = (p — In. 
i=0 j=0 
But clearly the a;; and 8;; coincide (except for numbering), and therefore 
k = l, as asserted above. 
We remark that there may be no value of & for which (6.2) and (6.5) hold. 
In this event B,, has no fractional part. 


7. The main theorem. We return to (4.1). In view of the last result in §6, 
(4.1) becomes 
(7.1) B,, = Gn + J aS, (p" = 2) 
Ly, 
provided k exists satisfying both (6.2) and (6.5); otherwise (7.1) is simply 
B,, = G,, , so that B,, is integral and nothing further need be said. Assuming 
then that a & exists, we make use of (4.3). If we exclude for the moment the 
case p" = 2, it follows that the irreducible divisors of the denominator of B, 
are all of degree k. Now it is easy to show (see the remark immediately following 
(4.3)) that 
_ ait. ssl 
(k] acer P’ 


where the summation extends over all irreducible P of degree a divisor of &, 
and 2?’ denotes the derivative of P. By (4.3) we have 


(7.2) ~. Gi, An . 
: L, Lia deg P= P 
the summation now extending over irreducibles of degree k only. But” 
(—1)) daa af (mod /), 
so that (7.2) becomes 
ri Ge — (—1)"'A? > 2 M 


"DJ, p. 166, formula (EE tO) 











516 LEONARD CARLITZ 


Finally, making use of (6.11) and substituting in (7.1), we have 
THEeoreM 9 (p” # 2). For given m, the system 


nki nk--1 
m= > a:p , p > Gi, 


i=0 i=0 


nk—1 b 


a;= Zz Qi; P’, eas | = Zz Qij, QAijez 0, 


=() i=0 


is either (i) inconsistent, or (ii) consistent for a single value of k, in which case the 
a; , a;; are uniquely determined. In case (i) B,, is integral; in case (ii) we have 


(= gprrreent---tene l 
(7 . in = CG, — F 
_ - [] aij! wee P 


Here G,, is integral and the summation is over all irreducible P of degree k. 

It remains to consider the excluded case p" = 2. We may no longer conclude 
from (4.3) that the denominator of B,, contains only irreducibles of degree k; 
polynomials of the first degree may also occur. To decide when this happens 


fa ( eh4+54 e+ at ae ) 
rere ss a 


we examine 


. 2) : . : - 3 
Clearly A,,’ is different from zero only when m is of the form 2“ + 2° > 2. 
Since in this case 


FF; for a # 8B, 
Post for a = B, 


q(m) = 


it is evident that for a # 8, 
fF. g l 
FF; 1 


AS» = g(m) ¢ k 
= [8] + la] = 2” +2’, 


while for a = 8, 


la + Ifa]. 


; ] 
= ON mee 
q\m) FF : 


Thus for @ 8, it is clear that AS’ is a multiple of Le. 

For a # 8, there are several possibilities. Ifa > B > 0, then Af? is divisible 
by (2° + z)*; but if a > 6 = 0, then Af’ is divisible by 2° + 2 only and the 
* 4. 7) is congruent tol (mod z’ +2). Again fora = B (mod 2), 


/ 


‘ 2) 
quotient A,,’/(z 


As is divisible by 2° + 2 + 1, while for a # B (mod 2), ALY? 1 (mod 
g+2+41). We now note that if the system (7.3) is satisfied for p" = 2 = k, 
then it follows at once that m 4° + 2-4’: in other words, this is the case 


a # B (mod 2). Also it is easily seen that any other value of & is inconsistent 




















ANALOGUE OF VON STAUDT-CLAUSEN THEOREM oli 


with m = 2* + 2° (a > B). Hence we have the following supplement to 
Theorem 9: 
THEOREM 10 (p" = 2). If the system (7.3) is consistent for k # 2, then 


(7.5) ho@ > Bo: 
deg P=k I 
if k = 2, then for m even, 
(7.6) _— 2 
r+ert+l 
while for m odd, 
1 l ] l 
7.7) By = Gu = Gyn + - ; 
0.3 * See TS oat’ #aeel 
If (7.3) ts consistent for no value of k, then for m even, B,, = G» , while for m odd, 
ee eS pe 
ee ae 


The following remark may be useful in testing (7.3). We assume m fixed; 
pick a k such that p" — 1|m. Then (because of the condition 0 < a; < p™) 
the a; are uniquely determined. If their sum is not p™ — 1, we go no further. 
If however the sum is equal to p™ — 1, we use the equation a; = pis aj;p’ to 
determine the a;; (because of the condition 0 < a;; < p, the determination is 
unique). It is then only necessary to check the system of equations 





p-l= ij. 
‘ " 
A partial check on Theorems 9 and 10 is furnished by the case m = p”™” — 1, 
. ° = 16 - a m 
for here a simple explicit formula” is available for B,, : 


g(m) (Fy ++* Pea)? 


7.8) B.. = = 
“ L, Li 
For p’ = 2 = k, this reduces to 
l | l I 
B; =-— = ‘ 
[2] eee ee 


which agrees with (7.7). In all other cases the irreducible divisors of the 
denominator of B,, are of degree k, and it is evident that (7.8) is in agreement 
with (7.5). For this value of m, it is clear that s = 0, a = p™ — llai; = p l, 
AS = g(m). 


Done UNiversiry 


’ DJ, p. 159, formula (9.02) 








SUMS OF VALUES OF A POLYNOMIAL MULTIPLIED BY CONSTANTS 


By KENNETH S. GHENT 
|. Introduction. We seck conditions on the integer s, on the sets of positive 
integers (a,,-°-- .a,) and on the coefficients of a polynomial P(x) for which 


the Diophantine equation 


(] r= > a, Ph,) 


is solvable in integers h, 2 0 for every integer n sufficiently large. By n suffi- 
ciently large we mean that n is greater than an existing constant b, which 
depends only on s, a, ++: ,a, and on the degree k and the coefficients of the 
polynomial P(r). We consider two cases of the polynomial P(x): 


(2) P(x) = alx® — x)/6 + b(e” — x)/2+ er 4d, 
where a > O and a, b, ¢ and d are integers;' 
P(x) = ax(x + 1)(7 + 2)(2 + 3)/24 + bar(a + 1)(x + 2)/6 
+ cr(x + 1)/2 + dr + e@, 


where a > O and a, b. ¢, d and ¢ are integers. 

For a, = --: = a, = 1, the problem is the classical Waring problem for 
third and fourth degree polynomials. If P(x) in (2) is such that a # 4e¢ (mod 8) 
James has shown that every sufficiently large integer n is a sum of nine values 
of P(x)" We show that for s = 9 and for integral constants (a; , +--+ , as) 
satisfying certain congruential conditions given later, every sufficiently large 
integer can be expressed in the form (1). For P(x) as in (3), Miss Humphreys 
has given conditions which are sufficient to prove that every sufficiently large 
integer n is expressible as the sum of 21 values of P(x). Under certain further 
assumptions on P(r) and on the sets of positive integers (a, ,--- , a.) we shal] 


Received June 26, 1936; in revised form, June 28, 1937 
4 cubic polynomial in z is an integer for all integers z 2 0 if and only if it is of the 
form (2) BR. D. James, The representation of integers as sums of values of cubic polynomials, 
American Journal of Mathematies, vol. 56 (1934), pp. 303-315. See also D. H. Hilbert, 
Uber die Theorie der algebraischen Formen, Mathematische Annalen, vol. 36 (1890), pp 
511-512 
A fourth degree polynomial in z is an integer for all integers z 2 0 if and only if it is 
of the form (3). See M.G. Humphreys, On the Waring problem with polynomial summands, 
this Journal, vol. 1 (1935), pp. 361-375. The proof is accomplished by a slight modifica 
tion of that of Hilbert in the paper previously cited 
* James, loc. cit 


518 











SUMS OF VALUES OF POLYNOMIAL MULTIPLIED BY CONSTANTS 519 


show that every sufficiently large integer n can be expressed in the form (1) 
for s = 21. 

The proof depends on certain analytic theorems and on certain congruential 
theorems. The analytic theorems follow closely those of James* and Landau.* 
We quote the analytic theorems without proof;’ the congruential theory is 
given in detail. 


2. Notations. The following notations are used throughout this paper. 
Further notations will be introduced when required. 


2 
id oki /Ir a,P (A) 
Ket”: R = 1-—1/K; f(z) = D2’ = 
h=0 
r(n) = r(n, k, 8, ay, +++ , as 3 a, b, c, d, e) = the number of solutions of (1); 
p denotes a primitive q-th root of unity; 
rrm 
Y a, P(A) 
S(a,, p) = > pe . ; 
h=r+1 
Y a, P(A) 
S,, — p 3» p ’ 
h 
where A ranges over a complete set of residues mod q; 
; ; 
Ad” =— LITS,,0": 
Q" iq) vl 
x 
So = San = > A (q) will be referred to as the Singular Series; 


a 


(a, b) = g.c.d. of a and b. 


3. Principal analytic theorem. Consider the following polynomial 
(4) P(r) = apr® + aa! +--+ + ax, 


where ay > O and a, a, +++ , ay are integers. The value of s considered will 
be that of the first Hardy-Littlewood theory. The notations of the previous 
section are used with P(x) replaced by (2). Consider also equation (1) with 
P(x) replaced by &(v). The theorem which follows can then be proved for the 
polynomial &(x) of degree k by a generalization of the proof given by Landau.’ 
Theorem 1. Fors 2 (k — 2)K + 5 
‘(n) — ao (I + 1/k) an wm 
(a, «++ a,)*P(s/k) ~° ell : 
‘ James, op. cit 
*E. Landau, Uber die neue Winogradoffsche Behandlung des Waringschen Problems, 
Mathematische Zeitschrift, vol. 31 (1929), pp. 319-338, 
© Proofs of the analytic theorems are given in the writer's doctoral dissertation at the 
University of Chicago, August, 1985 
’ Landau, op. cit 








520 KENNETH 8S. GHENT 


where Co is a constant depending only on ®, s and max a, and B, is a constant 
depending only on k. 


4. Primitive solutions. Consider &(z) as in (4). Let @ be the highest power 
of the prime p which divides every coefficient of &’(r). Then 
by(x) = p (x) , 
has at least one coefficient prime to p. We define y as follows: 
y = 6+ 1 for p > 2, 
= 6+ 2 for p = 2. 


Let Mo(m) = Mo(m, n) denote the total number of solutions of 
(5) 3S a,b(z,) = n (mod m), 0s z, <n. 
v~l 
For m = p’, let No(p', n) denote the number of solutions of (5) for which 


p/g.c.d. of (z,). Let N,,(p',n) denote the number of solutions with p / g.c.d. 
a(z,). The last solutions defined will be referred to as primitive solutions. 
By generalizations of the work of Landau and James previously referred to, 
the following three lemmas can be proved. 

Lemma l. If l 2 y then 


N.,(p') = p'-?°N.,(p’). 


no 


LEMMA 


M,(m) = m"' > A,(q). 


ein 
Lemma 3. If p, denotes the h-th prime, then 


DL Aol) = IT 2X Aol). 


sip! p! popi ap! 
The following lemma is also required. 


Lemma 4. Fore 20 


Aol(qg) > —Keq sin 


where Ez, is a constant depending at most on k, 8, ao, +++ , am, 4, °°" 5G. 


5. Application to the cubic and to the quartic. The analytic theorem and 
the lemmas that we have stated apply to a polynomial (x) of degree k with 
integral coefficients and leading coefficient positive. The polynomials P(z) of 
(2) and (3) have leading coefficient positive but the coefficients are not integers. 
Hence, for these eases, let Q(z) Q(z; v, t) P(zv + t), where t = Oand 
» > Oare definite integers depending on the coefficients of P?(2) in the respective 
Cast We choose » so that it contains the least number of factors required to 











SUMS OF VALUES OF POLYNOMIAL MULTIPLIED BY CONSTANTS 521 


make Q(x) a polynomial with integral coefficients. The choice of ¢ depends 
more specially on the case under consideration. For Q(x) we define 6 as before. 
For both the cubic and the quartic, p > 3 implies @ = 0 since p does not divide 
all coefficients of P(x). 


6. Introduction to the congruential theory. The main portion of this paper 
is devoted to the development of certain congruential theorems* for the poly- 
nomials (2) and (3). We are then able, with the aid of the analytic theory 
introduced, to prove the principal results given in Theorems 3 and 5. We 
assume that the cubic is not of the type excepted by James’ and that the quartic 
is not of the type excepted by Miss Humphreys.” 

The following definition will be used: For a prime p and a positive integer I, 
a set of positive integers (a,, +: , a.) has the property S(p, p’) if one of the 
set (a;, +--+ ,a,) which we may designate by a; is prime to p and if the 
a; (¢ = 1, +--+ , 8s; 7 # Jj) are such that for every integer n there exists at < s 
for which 

t 
> >» a; =n (mod p) 
i=1 


is solvable. 


7. Results for cubic polynomials. We seek conditions on the coefficients of 
P(x) in (2) and on the constants (a; , --- , ag) under which the equation (1) is 
solvable in integers h, = 0 for all sufficiently large integers n. We may evi- 
dently assume d = 0 in (2). We may also assume that a, b, c have no common 


factor other than unity; for if p|a, p| band p!c, then p| P(x) and > a,P(z,) 
r=l 


would represent only multiples of p. 

We first prove Theorem 2, a congruential theorem for the cubic; and then by 
Lemma 5 we obtain Theorem 3, the final theorem for the cubic. We state these 
theorems as follows for P(x) as in (2). 


Tueorem 2. If the polynomial P(x) and the set of positive integers (a, . ++ > , a.) 
satisfy the following three conditions: 
(2) for every prime p > 3, at least six of the positive integers (a, ,-°- , a.) are 


prime to p; 
(b) the set (a; , «++ , a.) has the properties S(2, 2°) and S(3, 3°); 
(c) a # 4e (mod 8), é.c., the polynomial is not of the type excepted by James; 
then for every integer n and for Q(x) = Pur + tb) there exist primitive solutions of 
the congruence 


s2o 


(6) p 2B a; Qry) = n (mod p’). 


* The methods of proof are similar to those of L. bh. Dickson, Cyclotomy, higher congru- 
ences and the Waring’s problem, American Jour. of Math., vol. 57 (1935), pp. 301 424 
* Loc. cit, 


© Loe. cit 








522 KENNETH 8S. GHENT 


TueoreM 3. Lets 2 9 be an integer; and let P(x) and the set of positive integers 
(a, , ** , @,) be such that the conditions (a), (b), and (c) of Theorem 2 are satisfied. 
Then there exists a number Cw depending only on s, a, b, c and a, ,--+- , a, such 
that every integer n > Cw is expressible in the form 


n= > a,P(z,), x, 2 0. 


ve=l 


8. Congruential theory for cubic polynomials. We first consider (6) for 
primes p > 3. In §5 we saw that 6 = 0 for p > 3, hence y = 1. Consider 
the polynomial Q(z) in the form 
(7) Q(r) = Ayr’ + Aga” + Asz, 
where A,;, Ag, As are integers whose g.c.d. is unity. Suppose, first, that 
A, = 0 (mod p). Then the congruence is a second degree congruence which can 


be treated directly. Hence suppose A; 4 0 (mod p). Then we may evidently 


take A; = 1 (mod p) since, if we determine d by dA; = 1 (mod p), dn ranges 
with » in (6) over all residues (mod p). Let a, ,-+- ,a@¢ be six members of 
the set (a,,--:*,a,) which are prime to the given prime p. Then each 
a, (i = 1, +++ , 6) satisfies one of the congruences in 

(8) a; = 2’. a; = gx’, a; = gr (mod p), 


where g is a fixed primitive root of p. Hence without loss of generality assume 


(Y) a, = Get as = aus (mod p). 
Case 1 AvAs(a, + a2)(a; + as) A O (mod p). Let re = —wmary. Then 
ayQ)(r,) + al)(r2) a(ujr; + t) + Ag(aguir, + (es) + A;(aguix, + asr2) 
ag usr; urns) + Aoaolutr, + uiri) + Agas(ujay — uy) 
ANoag(uy + uj) + Ayaslut — 1)2; (mod p). 
Similarly, let z u,r,. Then 
agh(ay) + aQ(ry) Aoa;(u; + M3) a3 + Aya uz Us)r, = (mod p). 
Next let ¢ X, + z, where zis such that 
(QAsa(u; 4 u;))z + Ayto(u; 4) 0 (mod p) 


This has one solution, since, from hypothesis, 2Aea.(u; 4+ uy) is not divisible 


by p. Similarly, set x, X, + v, where vis such that 
2A.a us } “3 )v 1 Ava,(us Us) 0 (mod p) 


If Ayiglu “;) 0 (mod p), this transformation is unnecessary 











SUMS OF VALUES OF POLYNOMIAL MULTIPLIED BY CONSTANTS 523 


We have then essentially 


4 


(10) > a;Q(x,) = rx? + sy =n (mod p), 


i=1 
where neither r nor s is divisible by p. This congruence is solvable for every 
integer n. 
Case 2." AsA; # 0, a, + a2 = O (mod p). Let x7; = re + l, where lis so 
chosen that 31 + A, # 0 (mod p). Then 
AQ(21) + 2Q(%2) = aQ(t2 + 1) + a2Q(z2) 
(a, + a2)Q(x2) + ay(x3(3l + Ag) + 22(3 + QA + As) 
+ (Pf + PA: + 1A3)); 
aQ(21) + aeQ(re) = a,(3l + A:)zx} + a,(3P + 21d, + As)r2 + const (mod p). 
The remainder of the proof is as in Case 1. 
Case 3. AsAs3 = 0 (mod p). We show that in this case we can make a 
transformation, rs = NX + I, which earries Q(x) to 
Qi(z) = X* + AsX* + A3N + const 
= X* + X°(31 + Ae) + X(3P + 2Ael + As) + const 


in which A.A; 4 0 (mod p). Suppose first that As and A; are both divisible 
by p. Choose l prime to p and we have ASA; not divisible by p as desired. 
Suppose next that A, is divisible by p but that A; is not divisible by p. Now 


(11) (3P + A;) = 0 (mod p) 


has at most two incongruent solutions for 1. Hence there are at least (p — 2) 
incongruent values of l for which (11) is not satisfied. Any one of these (p — 2) 
values of | makes A343 4 0 (mod p). Finally, suppose that A, is not divisible 
by p but that A; is divisible by p. There is a unique value of 1 (mod p) for 
which 31 + Ae is divisible by p. One of the remaining (p — 1) incongruent 
residues (mod p) satisfies 


31 + 2A; = 0 (mod p). 
Henee there exist (p — 2) incongruent values of l for which 
(31 + Ae)(38l + 2A) # 0 (mod p). 


We use one of these values of Lin the transformation and we obtain AsAs # 0 
(mod p). 
Hence in all cases we can satisfy the condition that AsAs be not divisible by p. 
We require a primitive solution of (6); Le., at least one a,Qo(r) prime to p. 
Suppose that no one of the aQ(vi) which we have used satisfies the desired 
condition, ‘Then choose 25 so that asQo(rs) is prime to p. This we can do 


0 (mod p) 


"Uf ay + ay 0 (mod p), the treatment is the same as when a, + 








524 KENNETH 8. GHENT 


since we have just seen that if A; is divisible by p we can make a transformation 
on x to give Q,(x) in which A; is not divisible by p; and let z; be divisible by p. 
Then a;(Q(zs) is divisible by p while asQo(xzs) = a5A3 (mod p) is not divisible by p. 

We have then shown that for p > 3 there exist primitive solutions of the 
congruence 


6 
> a;Q(z,) =n (mod p) 
i=l 

if for every prime p > 3, a, --+ , a are all prime to p. We have thus proved 


Theorem 2 for primes p > 3. 

For p < 3, James used in the congruence (6) fewer than 9 equal values of the 
polynomial and, if necessary, one additional value of the polynomial which 
insured that the primitivity condition be satisfied in order to solve the con- 
gruence for an arbitrary integer n. If we assume that our set (a, --- , a,) 
has the properties S(2, 2°) and S(3, 3°), the discussion given by James applies 
to the present problem.” We have thus proved Theorem 3. 


4. Results for the quartic polynomial. We consider P(x) as in (3). As in 
the case of the cubic polynomial, we may assume that a, b, c, d and e have no 
cormmon factor other than unity. We prove Theorem 4, a congruential theorem 
for the quartic similar to Theorem 2 for the cubic, and give in Theorem 5 our 


final results for the quartic. 


Tureorem 4. If the polynomial P(x) and the set of positive integers (a, , +++ , Gs) 
satisfy the following conditions: 

a) for every prime p > 3, eleven of the constants (a, , +--+ , a,) are prime to p; 

b) the set (a; , ++ ,a,) has the properties S(2, 2‘) and S(3, 3°); 


) the coefficient of x in the normal form“ of the polynomial is not divisible 
by a prime of the form 4m + 3 (p > 3); 
d) the polynomial is not of the kind excepted by Miss Humphreys, 
then for every integer n and for Q(z) P(vr + t) there exist primitive solutions 


of the congruence 
, 


i 


12 > a, Q(ry) " (mod p”). 
iJ 

Iunonem 5. Lets = 21 be an integer; and let P(x), the quartic polynomial in 

4), and the set of positive integers (a,, +--+ , a.) be such that the conditions (a), 


by), fe), (d) of Theorem 4 are satisfied. Then there exists a constant Cy, depending 


dd, / 


on «a,b. 6, d and max a, such that every integer n > Cy is expressible in the form 


7 >, a, P(x.) 


v= 


Qur hypothesie makes Lemmas 12, 14, and 14 of James’ paper available for our dis 

on. The use of Lemma ’ ean be avoided. Sections 5 and 6 of James’ paper then give 
t ‘ re ré ' 

Shy ! the polynomial ts discuseed in the next section 











SUMS OF VALUES OF POLYNOMIAL MULTIPLIED BY CONSTANTS 525 


10. Congruential theory for the quartic. Suppose that Q(z) is of the form 
Q(x) = Box* + Bix + Bex” + Bax + By, 


where 8; (¢ = 0, 1, --+ , 4) are integers and 8) > 0. This we can assume after 
transformation of P(x) in (3) by replacing z by vr + ¢t. As in the case of the 
cubic we may assume that 8) = 1 (mod p). Moreover, if k is prime to p (this 
is the case for p > 3) we replace x by X + z where z is so chosen that the coeffi- 
cient of X° is divisible by p. Finally, after this transformation, we may evi- 
dently assume that the constant term is zero and consider (12) for Q(z) re- 
placed by 
Qi(z) = 2 + Asa”? + Age. 

We call Q(x) the normalized form of the polynomial Q(x). Then condition (c) 
of Theorem 4 implies A; # 0 (mod p = 4m + 3, p > 3). 

We consider the congruence (12) for primes p > 3. Then @ = Oandy = 1. 
Hence we consider 


s=21 

(13) pa a;Qi(xr;) =n (mod p), 
inl 

where we assume that (aq, , --* ,@) are prime to p. By Lemma 39 of a paper 


15 . ° e,° ° 
by Huston,” there exist primitive solutions of 
. 4 
pe ah; =n (mod p). 
t~l 


Put 2; = hay (@§ = 1,°-: 5), ri = Aye (6 = 6, +--+ . 10), where y, and ye are 
prime to p and where the h; are so chosen that 


o 10 
(14) > ah; =0= z. ajh; (mod p 
i=l on 6 


and in each congruence at least one a,h; is prime to p. 


Case 1. AsAs # O(mod p). Suppose first that the values of Ay (¢ = Lys +. o>) 
which we have chosen to satisfy > ahi = 0 (mod p) also satisfy Y ah? = 0 
cml 1 


5 


(mod p). Then by choice of hy or — hy we can have ba ah, not divisible by p 


Pet | 
This is true since we may take ah; prime to p and since if Ve. , As) satisties 
i) o > 
“ “~~ o _ 
p & ah! > ah; pa ah, 0 (mod p), 
1 oe | aml 
then (—Ay, he, +: Ay) satisties the first two congruences but insures that 


b 


> ayhy is not divisible by po Hence 


MR. BM. tlusten, Asymptotic generalizations of Warcng'’s theorem, Proceedings of the 
London Mathematical Society, (2), vel 80 (1985), pp. S2 11S 














KENNETH 8. GHENT 





5 5 


> a:Q(zx) = pb ajhiy, =n (mod p), 
i=l t=1 
which is solvable for every integer n. To satisfy the condition that the solution 
be primitive it may be necessary to add one more summand, aeQ;(26), where 2x¢ 
is divisible by p while 


asQi (xs) = ac(4x° + 2Asx + As) 4 0 (mod p), 


since p does not divide A;. 


Suppose next that the values of h; (¢ = 1, --- , 10) which we have chosen 
to satisfy (14) are such that 
5 10 
ys a;h; F 0, > ahi #0 (mod p). 
i=l i=6 


Then 


10 5 B) 
Zz aQ(r) = A(X ashi) ai - 4(X ashe) 
i=l 


(15) ~ 10 - 10 
+ a(X ashi) + 4(% ash) Ye (mod p). 
1=6 i=6 


Put yo = Ye + Zs, where Zz is so chosen that the coefficient of Y2 in (15) is 
divisible by p. Put yw. = Yi + Z, where Z, is so chosen that the coefficient 
of Y; in (15) is divisible by p. We then have essentially 


10 


ba a, Qi(ay) 


; =n (mod p), 


rY¥? + s¥ 


Ii 


which is solvable for every integer n since 


5 10 
r= Az >, a;h? and s= Az >, a;h? 
i=l t=6 


are both not divisible by p > 3. As before we may add one more summand, 
4y)Q;(2;), divisible by p to insure that the primitivity condition is satisfied. 

Case 2. As = 0, A; 4 O (mod p). Evidently this case is equivalent to the 
first part of the preceding case since Az, divisible by p has the same effect in 
the congruence as 


5 


> ay he 0 (mod p). 


Case 3 | ‘A O (mod p). Then 
> a,.Q:(2,) ar, 4 (le Io + - t+ yas " (mod p) 


has a primitive solution by Lemma 29 of Huston’s paper 
Case 4 i, £ 0, A 0 (mod p tm + 1). Choose A; so. that 


























SUMS OF VALUES OF POLYNOMIAL MULTIPLIED BY CONSTANTS 527 


5 


> » ajh; 


i=] 
(mod p) has at least one root — such that & #4 1 (mod p). Hence if 
(hy , ho. «++ , As) is a solution of 


Ii 


Il 


0 (mod p) and > ah? 4 0 (mod p). This can be done since — 
i=l 


5 


p R a;hi = > a;h? = 0 (mod p), 
i=l 


i=l 


then (gh, . he, +++ , As) is a solution of a;h? = 0 (mod p) but does not satisfy 
i ‘ 


i=1 


> ah? = 0 (mod p). Use this latter set of hj. Then we can obtain 
i=l 


I 


5 10 5 10 
pie @,Qilay) + pm a;Qi(x;) = 4(X ashi) yi + 4( ashi) =n (mod p), 


i=! i=6 i=l i=6 
é 
. . . . . 2 
which is solvable for every integer n since neither r = aX ashi) nor y = 
i=l 


le 
a(X wh?) is divisible by p. 


t=—6 

The proof fails when A; is divisible by a prime of the form 4m + 3. This 
gives rise to the exception in the theorem. An additional summand, ay,Q(zu), 
can be added if necessary to satisfy the condition that our solution be primitive. 

Hence for p > 3 we have the congruence (12) solvable for s = 11, with the 
one exception noted. 

For p < 3 we assume that our set has the properties S(2, 2‘) and S(3, 3°). 
This assumption reduces the discussion of this case to that of Miss Humphreys. 
Her proof for the quartic case, p S 3, a, = ag = --- = a, = | then applies to 
the case under discussion. Hence we have Theorem 4. 


11. Proofs of Theorems 3 and 5. If So, = » > 0, where » is independent 
of n, it follows from Theorem 1 that r(n) > 0 when 


n> Cw = ((Cosad “(ay «++ a.) * TP (8/k)/nt (+ 1/k))Y. 
The proofs of Theorems 3 and 5 are thus reduced to the proof of 


Lemma 5. If for the cubie polynomial s = 9 and tf for the quartic polynomial 


s = 21, then 


~ 
- 
~ tn 


ao 


where n is independent of n. 
Sinee Mo(p') = N.,(p') we have from Lemmas 2 and | and Theorems 2 and 4 


in the respective CASES 
Dd Ao(q) p ""’ MA(p') = p ” ’ N.,(p) 


(16) win 
p (s py! yde it N. (p ») . p yis t ™ p ( 1 : 








528 KENNETH 8S. GHENT 


where ¢ = 4 for P(x) in (3) and t = 3 for P(x) as in (2). The final inequality 
holds since in the respective cases (l > 4 =v), (1 23 27). Then by Lemma 4 
with e = 1/8 


(17) po Ao(q) = 1+ > Ao(p’) ~i-= E, >. Pp — = i- E.(p"” = 1) 2 
alp! A~=1 A=1 


Finally, the singular series is convergent for s 2 (k — 2)2*"' + 5 and hence by 
(16), (17) and Lemma 3 


S, = lim D> Ao(q) = lim II >d Aol) 


9% [1-2 psp yt 
' aipt---p! Oo psp, ap 
; , 


> [] max (p “"”, 1 — Ea(p’* — 19’) 


rer @ 


UNIVERSITY OF CHICAGO 

















TRIGONOMETRIC APPROXIMATION IN THE MEAN 
By E. 8S. QuapDE 


The following theorem is stated without proof by G. H. Hardy and J. E. 
Littlewood.’ 

TueoreM. The class Lip(a, p) is identical with the class of functions f(x) 
approximable in the mean p-th power, with error O(n“), by trigonometrical poly- 
nomials of degree n. 

They remark in addition: This approximation may be made in general by the 
Fourier polynomials of f(x); the case p = ~«, in which this is not true, is exceptional. 

The initial purpose of this paper is to examine the range of values of p and a 
for which this theorem and remark are true and to supply proofs. In doing 
this, related theorems are obtained in which the approximations are in terms of 
the metric of a more extensive space than LZ, and in which the functions that 
measure the degree of approximation are more general than n *. These 
theorems and their proofs parallel to a large extent the theorems given by de la 
Vallée Poussin? and Dunham Jackson* for the class Lip(a). 

We assume throughout that our functions f(x) are periodic with the period 
2r. The functions &(w) and ¥(u) are of Young’s type.‘ That is, @(w) is non- 
negative, convex, and satisfies the relations (0) = 0 and @(u)/u— ~ asu— ~; 
W(u) has similar properties and is such that Young’s inequality 


uv S P(u) + Vu), u,v 20 
holds. ‘Throughout the paper we will write ® ul ,W)u),for®( uu. ),¥( uv! ). 
2r 
If f(x) is measurable and such that [ & f dx exists, f(x) is said to belong 
0 


to the space Le(O, 27). If f(x) is such that the product f(x)g(z) is integrable 
for every g(x) e Ly, then f(z) «L3. For this space 


|S le = sup [ f(x)g(a) dx 


for all measurable g(x) with p, = VW gidx < 1. This space’ is linear, 
Jv 

Received October 5, 1936; in revised form, July 9, 1937. 

' A convergence criterion for Fourier sertes, Math. Zeit., vol. 28 (1928), pp. 612-634; in 
particular, p. 633. 

2 Lecons sur U Approximation des Fonctions, Paris, 1919. We shall refer to this treatise 
aus (P). 

> The Theory of Approximation, New York, 1930. We shall refer to this treatise as (D). 

*A. Zygmund, T'rigonometrical Series, Warsaw, 1935, §$§$4.11, 4.142. We shall refer to 
this treatise as (Z). It contains extensive bibliographical references to original sources. 

* (Z), $4.541. 








530 E. 8S. QUADE 


metric, and complete. If f(z) « L$, we put, for 5 > 0, 


we(S;f) = sup || f(r + h) — f) |\e. 
O< hi sd 


eye e ps 


For p > 1, L,isaclass L¢. InL,,p 21, 


“oe lip 
w,(6;f) = sup ||f(x+h) — f(z)||p) = sup (| | f(x + h) — f(x) "dz) 
O<jhiss o<jajss \Jo 
If w,(6; f) = O(6"), 6 — 0, f(z) is said to belong to the class’ Lip(a, p). The 
limiting case of Lip(a, p), denoted Lip(a, ~) is identical with Lip(a). For 

brevity, we shall write | f | for | f |» whenever it will not lead to confusion. 
In order to prove the next theorem we need the following two lemmas. 
Lemma A. If t,(x) is a trigonometrical polynomial of degree n at most, then 


, 
th 4 = 2n tn *. 


This lemma is a trivial extension of a theorem of Zygmund’ that, when 
@(u) is a Young’s function and ¢,(r) a trigonometrical polynomial of degree n, 


then 
"2s t. 2 
| “drs | {\t,! dz. 
J® n 0 
If tr) = 0, th = (t.)) = 0. Assuming t,.(7) 4 0, we have, since || é, || 2 
| t, dr > 0, by Young’s inequality and the above theorem of Zygmund 
/ , , 
tp ~ €(2) 
= nsup q(x) dx 
t, : \ nit, 
a t, 
<n Pp dx t Pg 
0 " 1 ¢, 
22 / 
<- n| | db it, dr + rs | 
Since Dy = | and 
“i i 
[ ? dz = | 
J L, 
the lennurna follow 
This lemma holds also for the eases’ L,, p land p * which are not 


spaces of type Ly 


(Z), §$§4.76, 4.77, 4.78 
’ A remark on conjugate series, Proceedings of the London Mathematicenl Society, vol. 34 
(1942), p. 396 
*(Z), §4.5Al 
(Z), $7.31 








TRIGONOMETRIC APPROXIMATION IN THE MEAN 531 


Lemma B. If f(x) is absolutely continuous and has a derivative f'(x) ¢ L3(0, 2x), 
then 


| fla + h) — f(z) \le S 2Sh|-|| Ff" Ile. 
We have 
A+ W) — 10 |) = allis'litsup | [” o[ PLO | a) 


Doran oO ai} 


IIA 


We now need only show that 


"* f(t+h) — fo 
0 h lI f’ | 
Let e,(t) be defined for each h # 0 such that e,(0) = 
elsewhere. Then 


dt 


lA 


Il 
lA 
lA 


< |h!, and zero 


p fa +h) — fa) 1 /"f(t+a) 
= ’ 
h\\ f’ || h | 


| ext) ELF 
0 


= o> - Tf it 
[ e(t) dt 
2 w f(t+ 
< t)d lt 
. Om | ~ Pi 
by Jensen’s inequality.” 
Consequently 
ee fla +h) — f(x) we ie f(a td . 
i lr < | edd | p - dx s | 
| h\| f’ | seas h 0 i, \f’| di 


since 


drs 1 


“lr , » + 1) Se a -) 
| o Ft dr = | o Fe 
0 | f’ | 0 lf 
beeause of the periodicity of f(x). 
Let Q(r7) be a function not identically zero which satisfies the following 
conditions, 


(i) Q(r) = O and, at least for x greater than some wy, decreases monotonically 
to zero as 7r-> &%, 


ss * Q(x) 
(i) | u dy exists 
‘on Y 


(Z), $4.14 











532 E. S. QUADE 


We are now prepared to prove 
Tueorem 1. If the function f(x) can be approximated, for each n 2 1, by a 
trigonometrical polynomial T(x), of degree n at most, such that 


Q(n) 


n” 


S—Taile 


where r is a positive integer or zero, then f(x) is equivalent to an absolutely continuous 
function having a derivative, f(x), of order r, for which 


a/é Pe ¢ 
(1) web; ff) S A | [ Q(x) dx + I az) az, 
a 1/6 r 


where a and A are constants which may depend on f(x) but not on 6. 

In the case r = 0, we understand f(z) = f(x). Two functions in this space 
are equivalent if they differ on at most a set of measure zero. The statement 
that f(r) has a derivative f(z) means that the derivatives f(z), f(a), 


f(x), «++ f°" (x) exist on (0, 27), the last almost everywhere, and that f(z), 
f(x), F(x), «++ f°" (x) are absolutely continuous. 
The method of proof is like that used by de la Vallée Poussin” for the case 
p= * 
Set 
R(x) = f(x) — T,(x), 
di(xz) = Rarl(x) — Rae+i(x), 
where a is an integer 2 2 such that Q(x) is non-increasing for z 2 a. Then 
Rux(z) = f(z) — Tax(x). Sinee 7,2(2) is a trigonometrical polynomial, 7'.2(x) 


and its derivatives of every order are in L and by Lemma B, 
Ti? (x4 + h) — T82’(x) || < 2hh]-|| TH” II, 


<0 that weld; 742’) <= Mé, where M is a constant independent of 6. The theorem 
will be proved if we can show that /?,2(x) satisfies the required conditions. — For if 
R(x) is absolutely continuous and equivalent to R,2(2), then R(w) + T.2(x) is 
absolutely continuous and equivalent to f(z); moreover, if R(x) exists and 
satisfies a relation of the type of (1), f’ R” + TSP will satisfy (1) since 


wold; f' )s wold; R” )+ wold; T.2') 


wid Ain (1) may be chosen arbitrarily large. 


n 


ha t PJ Since R42 = D; t Ran i, 


2 


nel 


— (2 
Rh, pa oy s Ran i * ~ 3 


qgintber 


P), §39 











TRIGONOMETRIC APPROXIMATION IN THE MEAN 533 


~~. 
This implies that } > ¢. converges in L$ to R,z. By Lemma A, 
k=2 


k 
(2) lu |] S 2a**"|| p || < 2a**(|| Roe || + || Rowers ||) < 4a pl 
and consequently 
id 2 x k ~ k—l 2 2 
eo Sudss 5, SOs 7) tn < « 
k=? a—lim a** a—lj, 2 


oe 
"m™M:. : - 12 , . * . ‘ * 
rhis implies” that } x ¢ converges in Le to a function in Le, say p’(x). Set 


k=2 


(4) R(x) = [ p(t) + Ras(2), 


mn 


where the point # is such that }> ¢(#) — R.z(#); this is possible since there 
k=2 


k=? 


\r=e 


m 
must be a subsequence of {> os} which converges to R,: almost everywhere. 


, 13 . 
We have,” since 


(5) Yep dts Yd —p' 0, 
/9 k=2 k=? 
(6) Zz y(t) dt - | p'(t)dt = R(x) — R,2(2). 
k=? i z 
Thus from the equation 
(7) Yo =>dK | oa t+ ¥ oa, 
f=2 k=2 Je i= 


we have, by letting n — «, R(x) = R(x) almost everywhere. 
Consider r = 2. By Lemma <A, corresponding to (3), 


) oe 3 a 
‘ 9a) (2) 
1 (2) 4 : b+ii) 2? (2a Xx P : 
d || ¢% < 2200 ide || S [ aa dt < %; 


a—] 


~ 
this implies that Dre converges in L3 toa function in L$, say p? (zx). Pro- 
koe 


ceeding as in the ease of p’(x), we set 


(8) R’(x) [ p” (dt + p’(2), 


where 7, }m,} are so chosen that y ox(F) — p'(%). Then, by exactly the 
{= 


8. Banach, Théorie des Opérations Linéaires, Warsaw, 1932, p. 37 
"EC. Titehmarsh, The Theory of Functions, Oxford, 1982, $12.58. 











534 E. S. QUADE 


argument used for (4), we have R(x) = p’(xz) almost everywhere. Thus 
a 


, P * P . p 
} & é, converges in Le to R’ and we can denote the derivative of R by R’ 


instead of p’ 

For r | we have shown the existence of an absolutely continuous function 
R(x) equivalent to R.2(x) and having a derivative R’(r) which, for r 2 2, is in 
turn absolutely continuous. 

The existence of R” now follows by induction. Let s < r, and suppose 
RR" exists, is absolutely continuous, and has as its derivative p(x) defined as 


nx 
7 * » (a) —a . ° s ‘ 
the limit in Leg of > ¢,"’. This hypothesis implies the existence of an 
=2 
absolutely continuous function, R" (2), equivalent to p” (2) which has as its 
an 
° ° a+] . . ° . - . +1 
derivative the function p (xr) defined as the limit in Lg of b o," (x). 
k=2 


The prool ix as follows By Lemma A 


lA 


9 oi” || saa 1 6f S (2a‘)** |i bel, 


and. corresponding to (3), we have 


- wane Aa (2a)"** [* Q(a)d 
10 p> o. <2 @ pm bea s _ | ee < Dm. 


, ° * . ° . * 
Phis unplies that Zz o. (x) converges in Le toa function in La, say p(x). 


Corresponding to (4) and (8) we set 


1] R'(z | p- (ty dt +p" (2), 
here mr ie <0 Chosen that ts o (%) — p° (F) By the argument used 
h=2 
wees i QO and « 1 with equation: corresponding to (5), (6), (7) we have 
Re (2 p(x) almost everywhere 
When « 1, we denote p(x) by R(x). Forr = 0, R(x) = R.a(z), 
Y We must show that ’ (x) satisfies (1). Since, for r = 0, > > oy 
k= 
ee Lj have from Lemma Band (9), 
It ro kt J < } o, ‘r+ h) OH) ; (4) 
’ ,¥ dy (re 4+ hy) oy (4) || 
mel 
Zh , ¥ & || +4 22. py 
mil 











TRIGONOMETRIC APPROXIMATION IN THE MEAN 535 


< 2 inla’™? > aga’) + 27a’ DY ofa’) 
j= m+1 
9(2a)"* ait 9(2a)""! [* oO 
sh | o(s)de + 220 fH), 
a—l- : a—1 Jam 2 


. . . *, m—1 e—l1 s m 
If m is now chosen subject to the conditions @” ~~ S 6 < a" and A = 
+2 1 
2(2a)""“(a — 1)’, we have 


ws(6; R”) < | 5 | Q(a)de + | 22) ar] 
a 1/8 r 


If the Q(z) considered in Theorem 1 satisfies only the hypothesis (i), namely, 
that Q(7) | 0, and not (ii) also, then the condition | f — T, , S Q(n)n™’ 
does not necessarily imply that (1) holds. In fact f°°(r) may not even ¢€ 1}. 
For consider the space Le and let 


x 


fix) = DY Qk | sin ke. 
jem} 
Then, ifs, =,(27;f) is the n-th partial sum of the Fourier series of f(z), 
2 »(L 23 x } > ) 
a © *) < aw) ( > 4 < Qn) 
k—n+l ki tant KP n 


x 


But f(x) = pe O(k)k eos ker may not even be in Le since 


ly a hl 
ve bP ki 


does not necessarily converge; for example, if Q0r) = [log (2 + 1) 

We remark that the theorem just proved holds for L, as well as for 13 (and 
thus for L,, p > 1). The proof for 1, is the above proof with the L$ norm re- 
placed by the 1, norm since the inequalities of Lemmas A and B hold also 
for Ly. 


As an immediate corollary of Theorem | we have the positive assertion of the 


; 


following theorem. 

Turorem 2. Lf the function f(a) can be approximated for each n > 1, by a 
trigonometrical polynomial, t,(x), of degree n at most, such that f t 
O(n “), p = 1, then 


(i) fO<a 1, f(r) « Lipla, p); 


; l 
(i) efia 1, w,(d; f) (3 log ') 
0 


Moreover there extst functions for which f — t, O(n “) which do not belong 


lo Lap(l, p) 








536 E. S. QUADE 


In the previous theorem we choose r = 0, Lg = L,, Q(2) = Mx“, where 
M is a positive constant. Then 


( alé x ) 
w,(6;f) = ots | x“dx+ I. x “dx. 


This gives 
(i) w,(6;f) 
(ii) w,(5;f) = 0( log '), a =}. 


If f(z) « Lip(a@, p), a = 1, p > 1, then f(z) is equivalent’ to the indefinite 


0(6"),0 <a <1; 


II 


xe 
integral of a function in L,. Consider the function f(z) = > ne™. We have 
n=l 


bes 4 
If ile = (2 ae 
v=n+l 


But f(z) is not in Lip(1, 2) since 
f'(z) 2% Dv te” 
v=1 


and f’(z) is not in Le. Indeed, for h > 0, 


* j x . 2\ 4 
\f(z +h) — f(x — h) |\p = (x n* sin® nh) = n( a sinm*/) 
n=l nai nL nh 


— (2V ('S' Ve 3) “l) 
= (2) i( > n =\- h\ log an |)? 


j 
f(x +h) —flx —h) 2 0( a tog | ), 


If a > 1, we may take Q(7) = 2 “*, where r is an integer such that 0 < 
a—r<il. Then f(z) exists and the conclusion of the theorem holds with 
f(z) in place of f(z). 

We now turn to the consideration of theorems of the converse type. 

Turorem 3. If f(x) ¢ L§ possesses a derivative of order r, say f(x), in Lj, 
where r is a positive integer or zero, then, for any positive integer n, f(z) may be 
approximated in Ls by a trigonometrical polynomial t,(x), of order n at most, 


uch that 
If — talle = O(n root 7”)). 


1 (Z), $4.7, (8). By the use of lacunary series and the inequalities of (Z), §§9.601 and 


so that 


4.602, we need not restrict the example to the space L . 











TRIGONOMETRIC APPROXIMATION IN THE MEAN 537 


To prove this theorem we use the method and notation of de la Vallée Pous- 


= HB and ¢(t) = bs as(z oe >) 


the A + I constants a, ,k = 0,1, 2, +--+ , being so determined that ¢(0) = f(z), 
(0) = 0,s = 1, 2,---.\. The trigonometrical polynomial t,(r) is defined 


by the equation 
1 o t\/sin t\?* 
” = — lt, 
tale) 7(\ + 2) [. 6(4)( t ) " 
: "iia 
r(A + 2) = ; at. 


The order n of t,(x) is (A + 2)2*m — 1. Since o(0) = f(x), we write 


t,(x) — f(x) = [ F £\(= ae 
ie ol oe m t . 


where F(t) = o() + o(—1t) — 2¢(0). 


With these definitions 
) 2t 
= (gerne) +3) 


PF?) = o'°() + 6° (—d — 26'°(0), + even; 
= ¢(t) — ¢ (—2d, r odd. 


sin.” We set 


where 


and 


Consequently, 


~. la 2t 21 


iN la | 2 ; ; 21 
PD aris | (2 + >) = f(x) +f" (x) ~ (« aa x) |, rodd, 


PPro || 


IIA 


IA 


or, finally, 
F°°®) || = O(we(2t; f°”)). 


We also have, for r = 


-f [- [or Fe (u) dudt, gees te dt; 


'*(P), pp. 47-50. 








538 E. S. QUADE 


t t Plr—} 
F(é)|| s | tee | F’" (a) || dudt,_, «++ dt,dt, 
rt t "tri 
= o| [ toe | we(2u; f°’) dudt,. sae its | 
0 0 J v0 


t ty) Pbr—y ‘ 
r(£) s o| 3 [ [ be i wo( a f°) duty oe atts. 
m m Jo Jo 0 m 


Since f (x) is periodic, 


2 , ] : 
wo( 2 5  s (2u + Deo(, f°), 
m m 
Thus we have 


r(£) = 0 w( f°) | | ee [ . (Qu + 1) dudt,_, --- itd 
m m m 1) ( 0 


so that 


Thus 


Now 
“% ‘ 2h+4 
.-sil<— | r(! (= ‘) a 
r(X + 2) J, m t 
ce | . ls +4) 
= Om wo( -f' ‘ t * ‘) dt) 
m J . 4 ) 
, ' : 
= Om we A fe 
m j 
Since m = (n + 1)(\ + 2) (2°, we have 
f 
t, My Oon wo( eis )} 
n" ) 


‘Turonem 4. Jf f(x) « Lipla, p), p 2 1,0 < @ S I, then, for any posttwe 
integer n, f(z) may be approximated in L,, by a trigonometrical polynomial, t,(x), 
of order n such that 


f t, , On “) 


We put 7 0, A 0, and w,(n ‘3 f'’) Mn “ in Theorem 3. For p > I, 
. . * ‘ . . 
the result is obtained by taking L¢ tobe L,. For p 1, the result is obtained 
ai . , _,* 
by carrying out the proof with 1; in place of La 
Theorems 2 and 4 give the proof and range of Hardy and Litthewood’s theorem 
The question concerning the remark following the theorem is answered by 


raenh ” 
| neoremi 











TRIGONOMETRIC APPROXIMATION IN THE MEAN 539 


In the following s, = s,(f) = s,(x; f) denotes the n-th partial sum of the 
Fourier series of f(z) and ¢, = ¢,(x;f) denotes the n-th (C, 1) mean of s, . 

Lemma C. If f(x) € L, and t,(x) is an arbitrary trigonometric polynomial of 
degree n = 1 at most, then 

i) fp>1if—s|,,pSAlif—tiip, 

Gi) fp=1, f—s ji S$ ACL + logn) | f—tii. 
where A is independent of f(x) and n. 

We may write 


f — $$, = f — t, + é. > Tee = f _ | + _ — e. |. 


The trigonometric polynomial s,(7; f) — t.(z) = s,.(7; f — t,). Hence for 
p > 1. by an inequality of M. Riesz,"* we have 


e=—G1 S ANF — Qi, i 2 ® 


When p = 1, we have 
lls, —t,{| = : | [ [fla + u) — t(2 + WI D,(w) du! dz, 
wT jo 0 


where D,(u) is the Dirichlet kernel. Interchanging the order of integration, 


we have 


lls, —t, || S lf —t, if | D,(u) | du = ACL + log n) || f — th] 
rT 0 


i 


. ui 
since 


1 2 
[ D,(w) | du = . log n. 
0 i 


‘Tororem 5. /f f(x) € Lip(a, p), 0 < a S 1, then 
(i) when p > 1, f = 8 jp = O(n “); 
(ii) when p = 1, f — 8 1 = O(n “ log n). 
For p > 1, a@ < 1, O-large cannot be replaced by o-small and there exist functions 


. . 1 
m Lip(l, 1) for which || f — 8, || # o(n © log n). 
The positive assertion of this theorem is a corollary of Theorem 4 by applica- 


tion of Lemma C. 
To show that O-large cannot be replaced by o-small for p > 1, a < 1 consider 


the funetion 
, “cos 2" 4 
Iw) O<a<l 
. Dona ’ 


 (Z), $7.3, (1). Since jj si] S || sti] + Si, op > 1, we have | s, || S QA, + DIS i 


1 (Z), $8.3. 








540 E. 8S. QUADE 


This function” is in Lip(a) for each @ and, a fortiori, in Lip(a, p) for each a 
and all p. We have,” however, 


Jj — || 2 B( PB ::) ¥ o2™*). 


The function 


= sin vx 
fiz) = z 
v=] v 
is in the class Lip(1, 1). Making a slight change in notation, we set 


% sin 3mt\* 
t.(x) = t,(x;f) = hn | fla + o( =) dt, 


m sin 3t 
where n = 2m — 2 
_. ‘(@s) y 
L msin Ht)“ 
Our definition” 


= 2 and 
of t,(x) differs slightly from that of Theorem 3. 
For 


zx . 
sin vr 
f(x) = p> is : 


vel v 


. 1 4 
; Sin omar 
t,(x;f) = th, ae —}. 
m sin 5r 
write 


"St sin v( sin dnt \*! 
t.(x;f) h, i : in vr + °( in it) dt 
-e v= v 


m sin 43t 


n ot r . 1 4 
a = In ve — (= 3%) dt 
= a 2 : m sin 3 


a ss 
’ sin gml 
t.(z;f) = he te COS Vr cos vl _ dt. 
vel ® m sin MM 
i* (Z), §2.9, (3) 


9 (Z), $9.602, (1) 
20 We ¢ 


are here using the notation of Dunham Jackson, (1), page 3, rather than that of 
de la Vallée Poussin. Howe 


we have 


To show this we 


The n 


ver 


Pm sin hmt \* 
ta —-f hm fle +t) f€r)) dt 
. m sin 4 ; 
‘ sini 4mm \* ! 
Ol im t dt 0 
- m win ¥ n 











TRIGONOMETRIC APPROXIMATION IN THE MEAN 


This equation implies 


tie 


t'.(x; f) = th», (sz inz) oa 


m sin $x 
Now 
n 
, 
sn(z;f) = Dd cos vx = D,(x) — }, 
y=l 


where D,(x2) is the Dirichlet kernel. 
By an inequality of F. Riesz,” 


, ' l , , ig P sin }mu . 
[| tn — Sn || =- | tn — 82 || = : | D,(u) — wha (sin) du 
n nj -« m sin fu 
, , h, [* {sin} 4 ) 
a: [Daud | du — The (Sam) gy = aE" 
n J-s n J ~\msin }u n n 
This gives | t, — s, |) ¥ o(n' log n). Since 
Wf —talli tis — s|| Zit — &, and f—t.|| = O(n"), 


we have || f — s, |) # o(n' log n). 
TueoreM 6. Jf f(x) € Lip(a, p), 0 < @ S 1, then 


(i) if p> lorifp=la<l,\|f —o,|| = OW"); 
(ii) fp=a=1, ||lf—a]||= o( ", 


n 
For the case (i), O-large cannot be replaced by o-small. 


For the cases™ p 2 1,a@ < 1, and p = a = 1 we write 


llo, —f {| = : I [ [f(a + O — f(a) K,(0 dt dz, 
wT Jo 0 


541 


where K(f) is the Fejér kernel. The result follows by Minkowski’s inequality” 


hon OS 
[ ("K,(t)dt = On), a<: 


= O(n '‘logn), a= 1. 


When p > 1, f(x) « Lip(, p) is equivalent to the indefinite integral of a function 


in L Since 


3) ie oe 
66 n+ yl n+1 


i 0», s 


| f’ | O(n), 


2 (Z), §7.31, (b) 


2” ‘The case p > Land «@ < 1 was obtained by O. Sedsz, Uber die Fourierschen Rethen 


gewisser Funktionenklassen, Mathematisehe Annalen, vol. 100 (1928), pp. 580-536 
” (Z), $4.18, (A) 





542 


E. S 


- QUADE 


where 3,(x) is the n-th partial sum of the conjugate derived series of f(z), the 


result for p > 1, a@ 


1 follows from Theorem 5 and the inequality 


f-omllSif—s 


+ || Se — On 


n 





To show that O-large cannot be replaced by o-small in (i) we apply Lemma C 
to Theorem 5 for the case p > 1, a < 1. For p > 1, a = 1 consider f(r) = 
This function belongs to Lip(1, p) for every p. But 


$| = 5lifil = o(*). 


x, if f(z) can be approximated in L,, p 2 | 
by s,(x; f) such that the order of the approximation is w(n), it follows from 
Lemma C that no approximation of f(z) by a trigonometrical polynomial of 
order n can, for p > 1, be o[w(n)], and, for p = 1, o[w(n)(1 + log n)“}. 

Also corresponding to the case” p = x we have 

THEOREM 7. A necessary and sufficient condition that a function f(x) periodic 
in 2x belong to the class Lip(a, p), 0 < a S 1, p 2 1, ts that o,(x; f) belong 
to Lip(a, p) uniformly in n. 

The necessity follows immediately from the inequality f 
the sufficiency from the same inequality on the application of the Fatou lemma. 

THEeoreM 8. Let F(x) be periodic in 2x and the indefinite integral of a function 


cos I. 


oe Oe ee 


nm 


Corresponding to the case™ p 


llonj| & and 


f(z) eL,, pp 21. Then 
(i) ||F —s,(F)|| s - ‘ \ if-s() ||, p>; 
(ii) ||F —s,.(F)|| < a(! +” ") If-— sf ||, p=. 
Let 
f(z) = > a, cos vr + b, sin vr 2 
(in order that F(x) be periodic a, = 0) and let f(x) denote the conjugate funetion. 


Choose m >k > n. We may write 


s(z; F) — s(x; F) = Do (a, sin vr — 6, cos vx) 
n+l V 
= [s,.(a3f) — sala; f)] 
n+ 1 


1 


, . oe 
X vv + py [mls J) s(a;f)] 


| 


ileal; J) — x(a; f)I. 


24 (P), p. 22. 
28 (Z), §4.719 











TRIGONOMETRIC APPROXIMATION IN THE MEAN 5438 
Then 
; re 1 ; . 
lis’) — s,(F) || < | Sm — 8, I 
i} ra AF) on 4. l il (f) (/) || 
» ] le ff) «= of fii 
+ )» * : jy | mF) = s(F) I] + fll tm(A) — su(A) 


First let m — » and thenk— ». We obtain,” for p > 1, 


| F — s,(F) || | F — s.(f) || + }> |F — sf) 
G47 


IIA 


1 


sy lls- s(J) || + Ap p> ery llS— Wl 


and (i) follows since, for vy > n, || f — s(f) || S M ij f — s.(f) |). 
The above proof is due to the referee; it eliminates the log » from the first 
part of the theorem. For p = 1 


or 


F(x) — s,(4; F) = > ! (a, sin vx — b, cos vr) 
v=l 
= = [ (fle +t) — s,(x + t;f)} (> sat) a 


The interchange of summation and integration is possible sinee f(x) « L and 


_ ° y 
> v' sin vz is boundedly convergent. We have 


n+1 
||F - s,(F) || < tf’ ([’ fiz +t) — s(z + vate > ad dt 


‘\f- oti f. Ele 


intl 


*) SS sin vt! 1 + log ") 
the a 
[iz - dt < i( = 


and the theorem follows. 


Fil 


By Theorem 5 


UNIVERSITY OF I'LOoRIDA. 


26 (Z), §§7.21, 7.3. 








A NOTE ON NON-ASSOCIATIVE ALGEBRAS 
By N. JACOBSON 


It is the purpose of this note to obtain relations between an arbitrary algebra 
W (not necessarily associative) and an algebra % (necessarily associative) of 
linear transformations determined by 8%. If Y% is simple, the centrum € of % 
ix an algebraic field and N may be regarded as an algebra over ©. When this 
ix done 3 becomes a normal simple algebra, i.c., remains simple when this field 
is extended to its algebraic closure. A field having this property for algebras 
of characteristic 0 has been defined previously but less direetly by Landherr.' 
Some of our results have been announced for Lie algebras of characteristic 0 
by Albert. 


1. Let 2 be an arbitrary algebra (not necessarily associative or commutative) 
with a finite basis over a commutative field &; Nis a finite dimensional vector 
space over ® in which there is defined a composition zy of pairs of elements 


r, y such that 
(1) (x + y)z = rz + yz, 2(r + y) = zx + zy, 
(2) (ry)a = r(ya) = (ra)y, aed, 


The mapping z — 2a = rA, of R on itself will be called the right multiplication 
determined by a. Equations (1) and (2) show that A, is a linear transformation 
in the veetor space R. Similarly we define the left multiplication determined 
byaasz—-ar = 7A,. Let Abe the enveloping algebra of the left and right 
multiplications of RN, ie., the smallest algebra of linear transformations in ® 
containing all the left and right multiplications. The elements of % are sums 


of terms of the type Ay, +++ Ag, @a = 7 or Ll) where Ajj, is a multiplication 
determined by a,;. We shall therefore denote an arbitrary element of % by 
YA, «++ Ay. (not summed on 7,!). Thus % may also be defined as the smallest 
ring of linear transformations containing all the multiplications. 

Ifa, , +++, a, is a basis for R over & and A is a linear transformation in this 
veetor space, then A is completely determined by the matrix (a,;) such that 
aA La,a The correspondence between A and the matrix (a; ;) determines, 


well known, « reciprocal isomorphism between the ring of all linear trans- 

formations in R over ® and the matrix ring ®, of all n-rowed square matrices 

Kteceived February 11, 1937; presented to the American Mathematical Society, April 9, 
937. The author is a National Research Fellow 


W. Landherr, Uber emnfache Licache Ringe, Uamb. Abhandlungen, vol. 11 (1935), pp 


* Bull Am. Math Soc., vol. 41 (1935), p. 344 








its 


PP 





NOTE ON NON-ASSOCIATIVE ALGEBRAS 545 


with coordinates in &. In particular Yo may be represented (reciprocally) as a 
subring of ®,, . 

If S is an extension of the field &, we define Ry to be the algebra over Y having 
the same basis as ® has over %, ie., if R = ab +--+ + a,%, then Ry = 
a> +--+ + a,>. Any linear transformation A in 8 over ® has a unique 
extension to a linear transformation in Rs: over ©. The matrix of A (relative 
to the basis a, +++ ,@,) and of its extension are, of course, identical. It 
follows readily from the matrix representation that the enveloping algebra of 
the multiplications of Ry is the extension algebra Wy . 

As usual we define a (two-sided) ideal S of R as a subspace of S such that 
S DP zx, xz for allzeG and eR. Thus © is a subspace invariant under all 
the left and right multiplications and hence under all the transformations of Y%. 
M is a direct sum of the ideals M,,--- ,R. (MR = Ni @ --- @ R,) if every z 
in 3 is expressible uniquely as 7, +--+ + a,x, in R;. This notion coincides 
with that of decomposability of ® relative to the system %. Since r;2; € the 
intersection R; AR; = 0, zx; = O for any ze Rs, 7; Ee Rj. Ris simple if it 
has no proper ideal, or in other words, if %& is an irreducible system of linear 
transformations.’ 

TueoreM 1. A necessary and sufficient condition that N be a direct sum of 
simple algebras is that % be a completely reducible system. 

If WR = Ri @ --- @ M., where the R; are simple, then the R; are irreducible 
subspaces and Y% is completely reducible. Conversely if the ®; are irreducible, 
they are simple. For let S; be an ideal relative to M;, ie., 2:24, 22; € S; for 
all r,e Ms, 2€ Ss. Since 2;z;5 = 2:2; = O for x; eR; (GF + 1), we have rz, 
z,r € S;, and so ©; is an ideal of ® and hence an invariant subspace relative 
to %. This contradicts the irreducibility of R; . 

We recall that an algebra % of linear transformations in a vector space ® 
is completely reducible if it is semi-simple. Suppose conversely that Y is com- 
pletely reducible, say, RW = Mi ® --- @ Re, where the R; are irreducible 
invariant subspaces, and let 2 be a nilpotent ideal of A. If B is any subal- 
gebra of M, and S a subspace of R, we denote the subspace of elements LyB, 
yeS, BeBby SB. Since (SB)A = S(BA), SY is invariant if B is a right 
ideal. In particular 8M is invariant, and since the M, are irreducible, either 
RN = Oor RN = Ry. But RN = M; implies Ry = RM = 0 if p is suffi- 
ciently high. Thus R,R = 0 and RN = 0, ie., NR = 0 and so Y is semi-simple. 
By Theorem | we have therefore 

THEOREM 2. A necessary and sufficient condition that N be a direct sum of 
simple algebras is that A be semi-simple. 


* For definitions of irreducibility, direct sum, complete reducibility, equivalence (operator- 
isomorphism) for systems of linear transformations, see van der Waerden’s Moderne 
Algebra, vol. I], 1931, §108. We shall also require a number of results on the structure 
and representation of semi-simple algebras. These may be found in §§115, 116, 118, 119, 
121 of van der Waerden'’s book. 








546 N. JACOBSON 


An element z is an absolute zero-divisor if z # 0 and zx = 0 = zz for all 
xz in ®. Consider Rs; where ¥ is an extension of © and suppose that z’ = 
ft 2 » ss v 
ao; +:++ + a,¢, is an absolute aero-divisor. If aja; = Laxvei; (y €*), then 


z’a, = a,z’ = 0 implies that > yest, = Oand Twit = 0. Since these linear 


homogeneous equations have a non-trivial sshition tr, -:*,¢, in 3, they also 
have one, say, (:,°°°*, ¢, in ®, and so Ya;¢; is an absolute zero-divisor in ®. 
Thus R: has absolute zero-divisors if and only if R has. We note also that 
a simple algebra ® has no absolute zero-divisors unless R = z& where 2 = 0. 
We suppose from now on that ® has no absolute zero-divisors. With this 
restriction we have 

TueoremM 3. Jf R = Ri O--- @ M, and the R; are simple, then A = 
4A, @--- @ A, and the A; are simple, and conversely. A; is the enveloping 
algebra of the left and right multiplications of R; (acting in N). 

Let WR = RS --- S@ RM, and A; be the enveloping algebra of the multiplica- 
tions of R;. The elements of %; map ®; (J ¥ 7) on 0 and R on a subspace # 0 
of ®,. It follows directly that % = % @--- @ MA. W; # O since the ele- 
ments of ®, are not absolute zero-divisors. Since the transformations of 
map R, on 0, the algebra YW; is isomorphic to the enveloping algebra of the 
multiplications of R; acting in R;. The latter is simple since it is an irreducible 
system of linear transformations and henee YX; is simple also. Conversely if 


4 = % @--- @ A, where the A; are simple, YW is completely reducible and 
hence R = RN B--+ S My, where the MR; are simple algebras. By the first 


part and the uniqueness of the decomposition of an algebra as a direct sum of 
simple algebras we conclude that k = k’ and Y; is the enveloping algebra of 
the multiplications of R; . 

Coro._Lary. SR ts simple if and only if Wis. 


2. Let & denote the centrum of YW If R is itself associative and has an 
identity, © coincides with the multiplications determined by the elements of 
the centrum C'of Ro For ifee CC, = Cre ©, and if Ce Cand IC = ec, then 
rf (in)C = (C)r = cr = (r1I)C = c(IC) = ze, so that ce C’ and C 

C, = C,. If ® is associative but has no identity, we may adjoin an identity 
to it and repeat the argument. We then obtain the fact that € is the algebra 
of linear transformations determined by the multiplications of €’ plus the 
dentity mapping. When § is arbitrary we shall call © the extended centrum 


If Ris simple, so is MW and hence © = P is an algebraic field of finite order 
over ® fe eP, 
(cy)& = (xé)y = x(yé), 
id so Romay be regarded as an algebra over P. 
An algebra 2 over & will be called normal simple if Rg , the algebra obtained 


Ly extending ®& to its algebraic closure Q, is simple. 








NOTE ON NON-ASSOCIATIVE ALGEBRAS 547 


TuHeoreM 4. %& ts normal simple if and only if it is simple and its extended 
centrum consists of the multiples of the identity transformation. 

By hypothesis the centrum of the simple algebra % consists of the &-multiples 
of 1. It is a well-known result that % is simple in this case. But %o is the 
enveloping algebra of the multiplications of Rg , and hence by the corollary to 
Theorem 3 the latter is simple. On the other hand, if € is larger than ®, 
Ae is not simple‘ and hence MN is not normal simple. 

Thus if ® is an arbitrary simple algebra, it becomes normal simple when 
regarded as an algebra over its extended centrum. . 

Turorem 5. If ® is simple and has order n over its extended centrum P, then 
= P,, , the algebra of n-rowed square matrices with coefficients in P. 

We regard P as the underlying field. Since Rg is simple, % is an absolutely 
irreducible system of linear transformations. It follows from Burnside’s 
theorem that % contains n” linearly independent linear transformations and 
henee is isomorphic to P,, . 

More generally the structure of % when ® is a direct sum of simple algebras 
may be deduced from Theorems 3 and 5. 


3. Now suppose that 9 is an associative algebra with an identity. It is well 
known that the right multiplications form an algebra ®, isomorphic to # and 
the left multiplications form an algebra %, reciprocally isomorphic to §. 
®, (M,) is the totality of linear transformations in the vector space  commuta- 
tive with those of R, (WR). Thus R.A R, = C. 

If Ris normal simple, R, A R, = 14. If the order of R over &, (R:b) = n 
by Theorem 5, (M:b) = n? = (M.:6)(Ri:@). Thus % is a direct product of 
W, and M, and so we have obtained an elementary proof of the following theorem 
due to Brauer: 

TueoreM 6. The direct product of a normal simple algebra and its reciprocal 
algebra is a complete matric algebra. 


1. We return to the general case in which 8 is not necessarily associative and 
Ss . . 
suppose that x —> 2” is an automorphism of ® over ®, i.e., 


(c+y)*=2°+y", (ra)* = x*a, (ry)* = x*y’*, 
ands ow is (1-1). WP = Zz, Aes ++ Aye (a = r or Ll) is an element of YI, 
we define P* Ati, «++ AXi,, where A} is the right or left multiplication 


determined by a’. P* is independent of the representation of P. For if 
D Ari, °** Acs, = Dy Bry -** Bes Gin = 1, D, ie. 


«(> Aji, sic A.) ad (> B,,, eee B,;,) 


‘If © is separable, Yo is semi-simple though not simple, and if € is inseparable, Ya has 
a radical, Cf. van der Waerden, loc. cit., §119. 
*R. Brauer, Ober Systeme hyperkomplezer Zahlen, Math. Zeits., vol. 30 (1929), p. 108 








548 


for all x, then 


N. JACOBSON 





(XD Aly, +++ AS) = CL Bi, +++ Bis) 


8 


forall’. Since 2* ranges over all of R when x does, we have ) Ais, -** Ad, = 


D> Bis, «+> Bis, - 


morphism of % and (rP)* = 2° P*. 


Any automorphism of an associative 
Hence if ® is simple, an 


its centrum. 


It follows that the correspondence P — P* is an auto- 


algebra induces an automorphism in 
automorphism of ® defines an auto- 


morphism in the field © = P, the extended centrum. 


THEOREM 7. 


If R is normal simple over P and P D ® such that (P:) is finite, 


then the automorphisms of R over ® have the property (xt)* = x*t*, where & €P 
and & — &* is an automorphism of P. 

Let @ be the group of automorphisms of 8 over ® and X¥ the subgroup con- 
sisting of the automorphisms of R over P. 
invariant subgroup of @ and that @/X is isomorphic to a subgroup of the Galois 


group g of P over ®. 


Theorem 7 shows that ¥ is an 


Now suppose that 2 = My X P (= Nop regarded as an algebra over &) where 


M, is a normal simple algebra over #. If a, --- 
or for R over P and S is an element of g, then the correspondence z = 2, hs — 
> ait = 2* is an automorphism of ® such that (zé)*! = 2%"! = 2%'t*_ Let 
(%, denote the subgroup of @ consisting of the elements S,. Evidently G, > g 
and @, M ¥ = I the identity mapping. 
the form S,H where S,; ¢ @, and H ¢ X. 
may be used, as we shall show in another paper, to determine the automorphisms 
of simple Lie algebras and simple continuous groups. 


University or CHicaco 





By Theorem 7 any element of @ has 
Hence @/¥ = G, = g. This result 


, a, is a basis for R) over P 

















THE INVERSION PROBLEM OF MOBIUS 
By Erar HI.ie 


1. Introduction. ‘The present paper represents an attempt to give a rigorous 
treatment of certain inversion problems which have their origin in a little-known 
paper by A. F. Mobius." 

As a typical, though not the oldest, example of these inversion problems we 
might take the linear functional equation with constant coefficients 


wo 


(1.1) DX anf(nz) = g(z), 


n=l 


a formal solution of which has the form 


(1.2) > b,g(nz) = f(z). 


n=1 
These problems all lead to the same infinite system of bilinear equations 


(1.3) ab; = 1, pos Aabrja = 0, n>, 
din 
for which the algorithm of Mébius seems a fitting name. 
This algorithm is perhaps best known from the problem of finding the recip- 
rocal of an ordinary Dirichlet series, i.e., a solution of the problem 


(1.4) 2 as 2 ke @ 5. 


n=1 n=l 


We shall see that the properties of these series are fundamental in all these 
inversion problems. 

This observation suggests that there is a class of inversion problems associated 
with the problem of expressing the reciprocal of a general Dirichlet series or, 
still more generally, of a Laplace-Stieltjes integral as a function of the same class. 
In general the reciprocal is not so expressible, but whenever it is, certain func- 
tional equations of the type 


(1.5) i S(uz) dA(u) = g(z) 
1 
have solutions of the form 


(1.6) / g(uz) dB(u) = f(z), 
1 


Received April 24, 1937; presented to the American Mathematical Society, March 26, 1937. 
! Ueber eine besondere Art von Umkehrung der Reihen, Journal f. Math., vol. 9 (1832), 
pp. 105-123; Gesammelte Werke, vol. TV, 1887, pp. 589-612. 


549 





550 EINAR HILLE 


where 


(1.7) / A(“) dB) ath i¢a 


The last equation is the transcendental analogue of the algorithm of Mébius. 

In §2 of the present paper there is a discussion of the original problem of 
Mobius, of the algorithm of Mébius and of the problem of finding the reciprocal 
of an ordinary Dirichlet series. While there is comparatively little that is 
strictly new in this paragraph, the results are necessary for the rest of the paper 
and do not appear to be well known. In §3 we discuss equation (1.1) and 
various connected problems. In §4 we discuss equation (1.5) and the problem 
of expressing the reciprocal of a Laplace-Stieltjes integral as an integral of the 
same kind. 


2. Some classical problems. 
2.1. The algorithm of Mobius. Let A = {a,} be a given infinite sequence of 
real or complex numbers. The sequence is proper or improper according as 
a, ~ 0 or = O, and in the former case it is normalized if a, = 1. Following 
O. Holder,’ we call 8 = {b,} the reciprocal sequence of A if the latter is proper 
and % and % satisfy the algorithm of Mébius 
(2.1.1) a,b; = 1, be dabyja = 0, n> l, 

din 
or symbolically 4B = 1. The underlying product definition is that of Dirichlet 
multiplication, i.e., in general YB = C, where the sequence € = {c,} is defined by 
(2.1.2) Cc. = Zz Gabyja. 


din 
Thus, formally, 


we as 2 
> a8 *- 2,88” = 268. 
n=l 


n=l n=1 


In case of the reciprocal sequence, € is simply the unit sequence 1, 0, 0, --- . 
The system (2.1.1) determines 8 uniquely. We have 


@.14) be = (= 1) Cares --- (ar) (ag) =, 


where the summation extends over all factorizations of n = df'dy* ---, and 
Ca,a,-.. is the combinatorial function which gives the number of possible 
arrangements of a set consisting of a; objects of one kind, az objects of a second, 
etc. We have 


(2.1.4) Dd (—1)**"* "Cores --s = p(n), 


2 Uber gewisse der Mébiusschen Funktion y(n) verwandte zahlentheoretische Funktionen, 
die Dirichletsche Multiplikation und eine Verallgemeinerung der Umkehrungsformeln, 
Berichte d. Siichs. Akad. d. Wiss., Math.-phys. KI., vol. 85 (1933), pp. 1-28. 

















INVERSION PROBLEM OF MOBIUS 551 


the MOébius’ y-function, whereas 
(2.1.5) D Caaz--- = #(n), 


the number of factorizations of n into factors # 1 (nm # 1, r(1) = 1) when 
attention is paid to the order of the factors. (n) is a highly irregular function.* 
For the following it is enough to note that 


(2.1.6) DX x(n)n™* = [2 — ¢(s) 
n=l 

for R(s) > p, f(p) = 2, and that 

(2.1.7) a(n) < Cin’ 


for all values of n, whereas for every «€ > 0 there are infinitely many values 
of n for which 
(2.1.8) a(n) > Con’. 

2.2. The reciprocation problem for ordinary Dirichlet series. That the recipro- 
cal of an ordinary Dirichlet series with a, # 0 can be represented by a series of 
the same kind is well known. The best theorem in this connection is one due 
to E. Landau.’ 

THEOREM 2.2.1. Let 


(2.2.1) D(s;an) = Dann“, a #0 

n=1 
have a domain of convergence, and let the function represented by the series be 
holomorphic and different from zero fora > a. Then 


x 
(2.2.2) [D(s;a,))? = Do ban 

n=l 
is convergent for ¢ > a. 

This theorem lies quite deep. It naturally brings up the question whether 
it is possible to assign upper bounds for the real parts of the possible zeros of 
D(s; a,) and thus also for the abscissa of convergence of the reciprocal. The 
answer is in the affirmative and is fairly trivial. 


THEOREM 2.2.2. Let {r,} be a given sequence of positive numbers, 7, = 1, 
r, = O(n") for some fixed real x. Consider the class D of all Dirichlet series 
D(s; an) with ag = 1, | a,| = tr,n 22. Let S be the abscissa of convergence of 


D(s; r,) and put D(S + 0;r,) = RS «@. If R > 2, the equation 
(2.2.3) D(e;r,) = 2 


3 See E. Hille, A problem in ‘‘factorisatio numerorum’’, Acta Arithmetica, vol. 2 (1936), 
pp. 134-144. x(n) is denoted by f(n) in this paper. 

4 Uber den Wertevorrat von ¢(s) in der Halbebene « > 1, Géttinger Nachrichten, 1933, 
pp. 81-91, p. 90. 











552 EINAR HILLE 


has a real root p = p() > S. No series of D has any zeros in the half-plane 
o > p, whereas there exist series in D having infinitely many zeros in the strip 
p—e<o< p for everye > 0. If, on the other hand, 1 < R S 2, there are 
no zeros of any series in D for ¢ > S and there are series having either zeros or 
singular points in every strip S —e <a <S. 

Proof. The verification of the fact that the series in D cannot have zeros 
in the half-planes ¢ > p(Z) and o > S, respectively, is elementary and may be 
left to the reader. Further, the series [D(s; a,)]"' = D(s; b,) are easily shown 
to be absolutely convergent in the same half-planes. 

If R > 2, we note that the series 


x 
1 — } ran 
n=l 
is a member of the class T and vanishes at s = p(D). It further has infinitely 
many zeros in any strip p(D) — e < o < p(D). If 1 < R S 2, the point 
= S is a non-polar singularity of D(s; r,) and consequently also a singularity 
of [D(s; r,)]’. It follows that in either case the estimates given are the best 
possible valid for the whole class D. 

It would be of some interest to know if these estimates for the upper bound 
of the real parts of the zeros are imposed upon us by a relatively small set of 
elements in 2 or if they represent the rule rather than the exception. A dis- 
cussion of this question in general calls for an interpretation of D as a topological 
space, ie., a definition of closure, possibly based upon a definition of distance 


< 


or of measure. 
Zut there is one very special case in which a complete answer is available 
without any topology. Suppose that a, = 0 unless n is a prime, and put 


a, =a, Tp, = Me; D(s; a,) = P(s; ax), D = ¥. 


Let us suppose that # > 2. Using a classical theorem of H. Bohr on the rela- 
tion between the set of values of a Dirichlet series and of the associated power 
we conclude that every series P(s; ax) has 


series in infinitely many unknowns,’ 
infinitely many zeros in every strip p(B) — € < @ < p(P). Indeed, the asso- 
ciated power series is simply the linear form 


t 


t 
F "ha U] 
L(x) ] T > aux ] + > pe a 
! 


i 


putting z, ‘ **y,”, we get L(r) | Zz. pen” 0. By Bohr’s 
I 
theorem the value 0 is taken on infinitely often by P’(s; a4) in every strip 
pe a—p Hence in this ease all the reciprocal series in B have the same 
tecisea Of convergence, viz., p(B) 


Cher due Bedeutung der Potenzrechen unendlich neler Variabeln in der Theorie der 
* Gottinger Nachrichten, 1914, pp. 441-484, p. 451. 


barchletachen Mevhen Laon 



























— ~—-— = 


er 





INVERSION PROBLEM OF MOBIUS 553 


2.3. Mobius’ problem. Mdbius’ raised the following question: given a power 
series 


(2.3.1) fi) = Daz", a <0, 


n=l 


find the expansion of z in terms of the functions f(z"), n = 1,2,3,---. Let it be 


(2.3.2) z= > b,. f(z"). 


n=l 


A straightforward calculation shows that the b’s must satisfy the algorithm of 
Mobius. Further, it is clear that if 


(2.3.3) F(z) = ps A,2", 
then 

(2.3.4) F(z) = > B, f(z"), 
where 

(2.3.5) B, = d be Ana 


All this is highly formal and an analyst naturally wants to know the range of 
validity of the formulas, conditions for convergence, ete. 

As a preliminary step in this study, let us suppose that the power series in 
(2.3.1) has a cirele of convergence, and form the adjoint power series 


(2.3.6) os) = >, b, 32". 


We call ¢(z) the Mobius transform of f(z), 
(2.3.7) e(z) = MLS 2]. 


The Mobius algorithm shows that conversely f(z) is the Mobius transform 


of ¢(z), 
2.3.8) MIMO] = fl>). 


ic., the Mobius transformation is an involution. 

We must show that the power series in (2.3.6) is also convergent. This ts 
established in 

Turorem 2.3.1. Let the radtt of convergence of the power sertes tn (2.3.1) and 
(2.3.6) be Ry and Ry respectively. If O © Ry <1, then Ry Riu ls Ry, 
then also l = By 


"Loe, cit., Werke, vol IV, po dot 








554 EINAR HILLE 


Proof. Choose Ry < R,. Then there exists an M such that | a, | < MR,” 
for every n. Hence by (2.1.3) 
Se De te ell 


If Rk, Ss 1, then Ry < 1. We can assume M 2 1 without restricting the 
generality. Further, 


+) 


ond; + ad, +++ SN, aAtoat-::: S p(n), 
where p(n) denotes the total number of prime factors of n. Hence by (2.1.5) 


(2.3.9) 'b, | < x(n)M”™ Ro”. 


Since p(n) = O (log n) and x(n) = O(n’), we conclude that Ry S R.or R, SR. 


Suppose next that R; > 1. We can then choose Ry > 1. Further, 


ad, + ard, +--+ 2 mpi, + vepi, +++ = P(n) 
ifn = p;ipii---, where the p;, are the distinct prime factors of n. Hence 
(2.3.10) b,, < a(n) M”"' - Pin : 


But P(n) is infinitely often o(n). It follows that 


; j lin 
lim |b, 4 1, 


n--e 


or R. = l 

In order to complete the proof for the case R; < 1, we use the involutory 
character of the transformation. We have shown that R; < R.. If Re < 1, 
we can conclude that Re S R, , i.e., Ry = Re, simply by noticing that f(z) is the 
Mobius transform of ¢g(z). On the other hand, the assumption R, > 1 implies 
by the same argument that R, = 1, and this contradicts the original assumption. 


Hence R,; < 1 implies Ri; = Re. 


If R; = 1, we may well have R; > 1. The situation becomes clearer by 
introducing the associated Dirichlet series 
bed x 
D(s;a,) = bi < oe D(s;b,) = Zz b,n * 
n=1 n=l 


The assumption R, > 1 implies that D(s; b,) converges for all s and is an entire 
function of s. Hence D(s; a,) has a half-plane of absolute convergence and is 
not merely a formal Dirichlet series; moreover, a, = O(n‘) for some finite 
value of x. 

Thus if R; = 1, we have Rk; = R; unless D(s; b,) is an entire function, and then 
R, = R;. This case can arise only when a, = O(n’). 

If R; > 1, D(s; a,) is an entire function of s, and normally Rk; = 1, unless 
D(s; a,) # 0, in which case R, = 1. 

After this discussion it is easy to discuss the validity of Mébius’ inversion 
formula. 

TuHroreM 2.3.2. If R, is the radius of convergence of the power series for f(z), 








INVERSION PROBLEM OF MOBIUS 555 


the inversion formula (2.3.2) is valid for |z| < min (R,, 1). If Ri < 1, the 
series diverges for Ry, < |z| <1. The series may converge outside of the unit 
circle, but normally it does not represent z for such values. If the radius of con- 
vergence of F(z) in (2.3.3) is R, formula (2.3.4) is valid for | z| < min (R, R,, 1). 

Proof. Let 0 < R, <1. Then all the terms in (2.3.2) are regular analytic 
functions of z in |z| < R,. By (2.3.9) 


DX | bn fle")! S Le Dn | 2 | am |-| 2 |" 
(2.3.11) “~ . ‘ sal 
< p(n)+1 p—-n ie. 12} . 
< > a(n)M Ro i— |2)/R,’ 


and this is clearly convergent for |z| < Ryo. Here Ry < R; and as near to Ry 
as we please, i.e., the series in (2.3.2) is absolutely convergent for |z| < R,. 
Moreover, the double series obtained by substituting the power series for f(z”) 
on the right side of (2.3.2) is absolutely convergent, as we have just seen. It 
can consequently be rearranged at liberty. Collecting powers of equal degree 
and reducing with the aid of the algorithm of Mébius, the double series reduces 
to its first term z. This completes the proof for the case R,; < 1. 

If R, > 1, the terms of the series (2.3.2) are holomorphic for |z| < 1 and 
in no larger region. For such values formula (2.3.10) shows that the double 
series is dominated by 


2: 12) Mem — lz |” : 
(2.3.12) DX x(n) ) 1—|z|"/Ro 
It follows that the inversion formula is valid for |z| < 1. 

Let |z| <1. Then 

lim |b, f(z") |" = |z| lim |b, |“" = |z|/Re. 

It follows that if Ri < 1, so that R, = R,, the series (2.3.2) diverges in the 
annulus R; < |z| <1. 

Outside of the unit circle the situation may differ considerably in different 
cases. Thus if 


z = 2” 
f(2 = i age then Z2= L un) ——, 


n=1 


which clearly diverges for |z| > 1. But if 


f(2) = i 2? 


2 


then 


_ Ju(2k + 1), n = 2k + 1, 
~ \O, n = 2k, 








556 EINAR HILLE 


so that the series 
io) ~"” 
> oe 

converges also outside of the unit circle, but to —1/z instead of toz. Finally, if 


f(z) = 2, 


where k is a positive integer, then the series 


x 


n —znk 
> b,2"e 
n=l 
converges on the rays |z| > 1, arg z = v2xr/k, v = 0, 1,---,k — 1, and 


nowhere else outside of the unit circle. The sum of the series tends to zero as 
z— « along the rays in question; thus, the sum of the series cannot be z, but 
I am unable to determine its actual value. 

The reader will have no difficulty in verifying the statements concerning 
F(z) in Theorem 2.3.2 on the basis of the estimates for the coefficients, and this 
part of the proof will be omitted. 


3. A class of linear functional equations. 

3.1. The Mébius A-transform. Let % = f{a,} be a given proper, normalized 
sequence, B = {b,} the reciprocal sequence in the sense of §2.1. Mébius’ 
observed that the same algorithm enters in the study of the functional equation 


(3.1.1) G(z) = > a, F(z") 


n=1 


for which he proposed the solution 


(3.1.2) F(z) = 2) b,.G(e"). 

n=l 
Conversely, (3.1.1) is a solution of (3.1.2) if F(z) is the given function. Though 
Mobius claims that these relations hold for arbitrary given functions, he has 
presumably only had power series in mind, and there is no indication that he 
looked into convergence questions at all. 


Putting 
G(e*) = g(z), Fe’) = f(z), 
we can rewrite the functional equations as follows: 
(3.1.3) q(z) = Dd a, f(nz), 
nel 
(3.1.4) f(z) = D bag(nz). 
n «1 


’ Loe. cit., Werke, vol. IV, p. 593. 











INVERSION PROBLEM OF MOBIUS 557 


These forms are more convenient to handle than the original ones, and will 
serve as the basis of the discussion in the present paragraph. Special cases have 
long been in the literature. Thus the case a, = 1, b, = u(n), z = k, gives 
the inversion formulas 


= f(nk), 


(3.1.5) glk) = 
(3.1.6) f(k) = > u(n)g(nk), 


which are used extensively in analytical number theory.” 

We proceed to an analytical discussion of equation (3.1.3). Let E be a set 
of points in the complex plane such that if Z contains the point 2 it also con- 
tains all multiples nz of z , nm = 2,3,---. Let f(z) be given in E and such that 


a 


(3.1.7 Mf); A) = DX a, f(nz) 

n=l 
converges in EF. We call this function the Mébius Y-transform of f(z) with 
similar notation and terminology for 8 and for other sequences. The reci- 
procity of the two sequences A and & is reflected in the property 


(3.1.8) M[MLF(2); W; VB] = MMF); Bl; W = fe), 


valid for sufficiently restricted classes of functions f(z). 
THEOREM 3.1.1. A sufficient condition for the validity of (3.1.8) is the absolute 
convergence of the series 


(3.1.9) S[f] = > > dmb, f(mnz). 


m=1 n=1 


Proof. The series, being absolutely convergent, can be rearranged arbitrarily. 
If summed by columns its sum is M([M{[f(z); W; Bl, if summed by rows, 
IM [Me [f(z); Bl; AW, whereas summation over constant values of the product mn 
gives simply f(z), by virtue of Mébius’ algorithm. 

3.2. Inversion of the U-transform. We now turn to the question of finding 
the inverse of the Y-transform, i.e., the resolution of the equation 


(3.2.1) M (f(z); MW = gle) 


for f(z) in terms of q(z). 
THeoreM 3.2.1. A sufficient condition that 


(3.2.2) f(z) = Mig(z); Bl 


* See, e.g., P. Bachmann, Die analytische Zahlentheorie, Leipzig, 1894, p. 310 et seq. 
Bachmann does not seem to have been aware of Mobius’ paper. Thus he credits the 
introduction of the function w(n) to F. Mertens, Ueber einige asymptotische Gesetze der 
Zahlentheorie, Journal f. Math., vol. 77 (1874), pp. 289-338. 








558 EINAR HILLE 


be a solution of (3.2.1) which is absolutely convergent in E is that the series S{g] 
be absolutely convergent in E. On the other hand, there can be at most one solution 
of (3.2.1) which renders the series S{f| absolutely convergent, and whenever it exists 
this solution is given by (3.2.2). 

Proof. The assumption that Sg] is absolutely convergent implies that 
M(Mig(z); Bl; AL = gz) by Theorem 3.1.1. Hence (3.2.2) gives a solution 
under these circumstances, and the solution is evidently absolutely convergent. 
Conversely, if f(z) is a solution of (3.2.1) such that S[f] is absolutely convergent, 
then for the same reason 


S(f] = f(z) = M[MLS(z); 2; Bl = Melg(z); Bh, 


so the solution in question is uniquely determined and given by (3.2.2). 

It must be granted that Theorem 3.2.1 is of a rather restrictive character. 
It should be pointed out, however, that the mere existence of M[f(z); YW] is not 
enough to insure that this function is a solution of (3.2.1). Thus, for example, 
if there exists a non-vanishing function g(z) such that M[g(z); B] = 0, then 
formula (3.2.2) certainly does not give a solution of (3.2.1). See further §3.3. 

The following theorem is of a somewhat different character. 

THEOREM 3.2.2. Let g(z) be holomorphic in the sector S, 0, < argz < #2, 

z > R > 0, and let 


(3.2.3) g(z) =z“ E + o(4)] asz— ~ inS., 
2 


Further, suppose that D(s; a,) = > a,n™* is convergent and different from zero 
1 

for Ris) > Rla) — 6 € > O. Then (3.2.2) defines a solution of (3.2.1), holo- 

morphic in S, and 


(3.2.4) f(z) =z “| colDla a) *£ o(, )] asz— «x inS, 
z 


and this is the only solution of such asymptotic character. 
Proof. We have to show that M([M{g(z); Bl]; XW = g(z) under the given 
assumptions. We start by observing that 


(3.2.5) Miz *; Al = Dla; a,)z “, 
(3.2.6) Miz *; Bl) = Dla; b,)z-", 


the convergence of D(a; b,) being a consequence of Landau’s Theorem 2.2.1. 
Let us put 


(3.2.7) g(z) = coz * + gilz), 
(3.2.8) f(z) = «D(a; b,)z * + filz). 
Then if f(z) is a solution of (3.2.1), fi(z) is a solution of 














INVERSION PROBLEM OF MOBIUS 559 


But to this equation we can apply Theorem 3.2.1. Indeed, by assumption 
gi(z) = O(, 2 |”), y = Ra), and the series D(s; a,) and D(s; b,) are absolutely 
convergent for s = 1 + y, since they converge for s = y — ¢/2. Forming 
Slg:] and replacing each term by its absolute value, we find that the series is 
dominated by a constant multiple of 


2 «2 
ie pp lamb, | (mn) ” 


m=1 n=1 


convergent in S. Henee fi(z) Migi(z); B] is a solution of (3.2.9), and 


M[g(z); B] is a solution of (3.2.1). 


Further, 
Ss  dbiay - 
fi(z) = O12! TD |b] Mo 
n=l ) 
or 
(3.2.10) ifi(z)| = M|z2|"", 


whence it follows that the double series S[f;] is absolutely convergent in S. 
Hence, by Theorem 3.2.1, f(z) is the only solution of (3.2.9) having this property; 
and, a fortiori, the only solution satisfying (3.2.10). It follows that M[g(z); B] 
is the only solution of (3.2.1) of the form (3.2.8), where fi(z) satisfies (3.2.10). 

It is obvious that the solution breaks down if D(a; a,) = 0. In general, it 
also breaks down if D(s; a,) = 0 for an ¢ with R(s) > Ra). It should be 
noted, however, that 


(D(a; a,)) 2" 
is a solution of 
MLf(2); A = 2“ 


under the sole assumption that D(a; a,) is convergent and different from zero. 
This observation may sometimes be used in order to extend the validity of our 
solution. 

Thus, for example, Theorem 3.2.2 does not apply to the case a, = (—1)"" 
if 0 < a < 3}, but formula (3.2.2), nevertheless, gives a solution of the corre- 
sponding equation. This is readily seen by running over the proof again with 
this particular choice of the parameters. 

Further, it should be noted that the assumption on the remainder in (3.2.3) 
is chosen merely with the view of insuring that the Dirichlet series D(s; a,) 
and D(s; b,.) be absolutely convergent for s = 1 + y. If there should exist a 4, 
0 <6 < 1, such that these series converge absolutely for s = 5 + y, it is suffi- 


cient for our purposes to assume that 2“g(z) = eo + O() 2°). 


3.3. Additional remarks. We are dealing with two adjoint equations 
(3.3.1) Mlar(z), M) = a(z), 
(3.3.2) Mly(z), Bl = h(z), 








560 EINAR HILLE 


and the corresponding homogeneous equations 


(3.3.3) M lulz), A] = O, 

(3.3.4) M[v(z), B] = 0. 
We have already observed that 

(3.3.5) Mz *, A] = D(a; a,)z *, 


provided R(a) > oa», the abscissa of convergence of the series for D(s; a,). 
In the same domain we have 


(3.3.6) an{ ee. x] * [D(a;a,)z*). 


« = 
da" Oa 
From this we conclude that if the equation 


D(s; a,) = 0 


has a k-fold zero at s = a with R(a) > o», then 


(3.3.7) 2,2 “logz,-::,2 ” (logz)*" 
are solutions of the homogeneous equation (3.3.3). 

It is clear that, if there are infinitely many zeros of D(s; a,) in the domain 
of convergence of the Dirichlet series, then any function of the form 


(3.3.8) > ez 


satisfies (3.3.3), provided the double series 


a « 


> Am a c,(mz) *" 


m=1 n=l 


can be rearranged so as to interchange the order of the summations. 

It is perhaps worth while remarking that the series (3.3.8) do not form a 
dense set in any of the function spaces usually considered, such as C[1, ©] or 
L,(, «),1 S p < «. Indeed, the set {z “"} will be closed in the space in 
question only if the series 


a + bR(a,) 
X 1 + | Qn | 


diverges, where a and 6 are constants depending upon the space. But in our 
case R(a,) is bounded; hence we are demanding the divergence of the series 

a,| . But according to Landau’ the number of zeros of an ordinary 
Dirichlet series in the domain ¢ 2 oo + ¢, |t| S T, is O(T log T) and this 
frequency clearly does not permit the divergence of , 4 la, |” 


*—. Landau, Uber die Nullstellen der Dirichletschen Reihen, Berliner Sitzungsberichte, 
vol. 14 (1913), pp. 897-907. 














INVERSION PROBLEM OF MOBIUS 561 


Our next remark concerns the existence of a solution of (3.3.1) when g(z) 
satisfies (3.3.4). It is clear that the inversion formula of Mébius cannot give 
a solution in this case. It seems very plausible that no solution can exist under 
these circumstances. In certain simple cases it is possible to verify this surmise. 
Take, for example, 


dX f(z) = 1. 


Here D(s; 6.) = 1 — 2° and M1, B] = 0. Consider any domain E of the 


type described in §3.1, i.e., if zo « Z, so do nz for n = 2,3,---. If the equation 
holds for z = z and for z = 2z , then we get by subtraction f(z) = 0. If it is 
true for z = 2*z for every integer k, we get by the same argument that 


f(2‘z) = 0 for every k. But this clearly contradicts the assumption that the 
equation holds for z = z. Thus the equation in question cannot have any 
solution in E or even in a point set S which contains 2z) whenever it contains Zp . 

The final remark of this paragraph concerns the solution of (3.3.1) when g(z) 
is a solution of (3.3.3). It is enough to consider the case g(z) = z *, where 
s = ais a k-fold root of the equation D(s; a,) = 0 in the half-plane of con- 


vergence of the Dirichlet series. Equation (3.3.6) then shows that 
k—1 
x(2) = 2 “{(—1)"[D“ (a; an)|"" (log z)* + 2c, (log 2)"} 
v=() 


is a solution of (3.3.1). This result shows a further analogy between the formal 
theory of the equations here considered and that of linear differential equations. 

3.4. Further equations with the same algorithm. It was known to Mébius” 
that his algorithm entered in the study of other functional equations. The 
following example is slightly more general than the situation which Mébius had 
in mind. 

We consider two multiplicative systems, i.e., we give two sequences {e(p)} 
and {w(p)}, where p runs through the primes, we take e(1) = w(1) = 1 and 
define «(n) and w(n) by the equations 


(3.4.1) e(mn) = e(m)e(n), w(mn) = w(m)w(n). 
Let us form the functional equation 


(3.4.2) > a,e(n)f(w(n)z) = g(z). 


It is not difficult to see that a formal solution is given by 


(3.4.3) f(z) = E bae(ndg(w(ne), 


'© Mdbius, loc. cit., Werke, vol. IV, p. 594. 








562 EINAR HILLE 


where |b,} as usual is the reciprocal sequence of {a,}. The elementary methods 
of §3.2 can be used to develop sufficient conditions for the validity of this 
inversion formula.'’ The details can be left to the reader. 


4. General algorithms. 
4.1. The Mébius A(u)-transform. We can write the Mobius %-transform as a 


Stieltjes integral, viz., 
ML f(z), A] -f f(uz) dA(u), 
1 


where 


A(u) = bm Qn. 


n<u 


This suggests a generalization of the inversion problem to arbitrary functions 


A(u) of bounded variation. 
Let A(u) be given for u = 1, A(1) = 0, of bounded variation in every finite 


interval, and such that 
(4.1.1) A(u) = O(u*™) 


for every « > 0, where w 2 0 is a constant. 

Suppose now that f(z) is an analytic function satisfying the following condi- 
tions: (i) f(z) is holomorphic in a sectorial domain S such that if z is in S, so 
is uz for every u 2 1; (ii) the integral 


(4.1.2) mise) Aw] = | f(uz) dA(u) 
l 


exists in some domain S, C S. 

We shall as a rule use the abbreviated notation M[f, A] and refer to this 
function as the Mébius A(u)-transform of f(z). To every fixed function A(u) 
satisfying the above conditions there is a class §[A] of functions f(z) which admit 
A(u)-transforms in the sense of the definition. 

The effective determination of §{A] may be quite laborious except in the 
simplest cases, but it is easy to find a subclass of §[A]. Let f(z) be holo- 


morphic in a sectorial domain S and 


(4.1.3 f'(z)|< Miz", 7 > w. 


—) 
It is easy to see that Wf, A] exists in S; thus every such function belongs 
to nA] 
4.2. The reciprocation problem for Laplace integrals. In the case of the Y- 
transform the inverse or reciprocal transform was given a priori and had a 
sense for a sufficiently restricted but not vacuous class of functions. For the 


Some instances of this inversion formula figured in the papers of E:. Hille and O. Sziisz, 
On the completeness of Lambert functions, of which the first part appeared in the Bulletin of 
the American Mathematical Society, vol. 42 (1936), pp. 411-418, and the second in the 
Annals of Mathematics, vol. 37 (1936), pp. $00-S15 














INVERSION PROBLEM OF MOBIUS 563 


A(u)-transform the situation is different, and the inverse transform need not 
exist at all. By analogy with the sequence case we should consider the Laplace- 
Stieltjes integral 


(4.2.1) D(s) = [ u “dA(u). 


The hypothesis (4.1.1) insures the convergence of this integral for ¢ > w. 
We should then take the reciprocal of D(s) and find its representation as a 
Laplace-Stieltjes integral. But it is well known that [D(s)]~' ordinarily is not 
representable in this manner. 

Necessary and sufficient conditions in order that [D(s)]”' shall be repre- 
sentable by a convergent Laplace-Stieltjes integral do not seem to be known. 
In the following I shall give two sets of two conditions each. One condition is 
common to the two sets; the first set is necessary, but perhaps not sufficient, 
the second set is sufficient, but certainly not necessary.” 

THEOREM 4.2.1. Let D(s) be a function representable by a convergent Laplace- 
Stieltjes integral. In order that [D(s)|* shall also admit such a representation, 
it is necessary that (i) lim D(o) # 0, and (ii) there exist a half-plane ¢ > a 


ote 
in which D(s) # 0. 
Proof. That the conditions are necessary is obvious. Since 
lim D(c) = lim A(u), 
o> +e u—l+ 
we can replace condition (i) by the equivalent condition (i’) A(u) is discon- 
tinuous at u = 1. 

TuHeoremM 4.2.2. If A(u) is discontinuous at u = 1, A(1 + 0) = a # 0, 
and if the Laplace-Stieltjes integral representing D(s) is absolutely convergent for 
o > a, then [D(s)]"' is representable by a Laplace-Stieltjes integral absolutely 
convergent for ¢ > max (a, 8), where B is the root of the equation 


(4.2.3) la|= [ u “dVz[A(v) — al,” 
1 


if it exists; otherwise B = — ~~. 
Proof. Let us put 


ll 


A,(v) = A(v) — a, A,(1) = 0, 
ViAi(v) = Ao(u), 


A,(u, s) = [ v “dA,(v), 
1 


Ao(u, s) = [ v *dAolr). 
1 


12 [It is to be hoped that the investigations by R. H. Cameron and N. Wiener, now in 
progress, will throw further light on this question. 
13 Here and in the following, V°f(t) denotes the total variation of f(() ina S tS b 











564 EINAR HILLE 


Then Ay(u), A;(u), Ao(u, 8s), and A,(u, s) are continuous at u = 1 and tend to 
zero as u — 1. A simple consideration shows that 


(4.2.4) Vi Aju, 8s) S Aolu, oc). 


By assumption 
Ao(u) = O(u*™*) 


for every « > 0. It follows that for ¢ > a@ the increasing function Ao(u, o) 
tends to the finite limit Ap(*, ¢) as u — x. Further, it is obvious that 
Ao( x, c) is monotone decreasing when ¢ increases and tends to zero as ¢ > @. 
The latter conclusion follows from the fact that Ag(u) — 0 monotonically from 
above as u— 0+. Hence the equation 


(4.2.5) Ap(x, a) = | al 
has at most one root = a. We define 8 to be equal to this root if it exists; 
otherwise we take 8 = —=. 


Now let ¢ > max (a, 8). Then 
D(s) = a + Ai(~, 8), 


and 

A,(x,s)| S Ao(x, a) < lal. 
Hence 
(4.2.6) [D(s)]J' = >> (-1)"a""[A,(@, 8)]", 


and the series is absolutely convergent. We shall rewrite this series as a 
Laplace-Stieltjes integral. We have for ¢ > y > max (a, 8) 


(4.2.7) A(x, s) = [ ud, Au, y). 

Hence 

(4.2.8) [A,(«, s)]" = [ u" ’d,A,(u, y), 

where 

(4.2.9) A,(u, 7) = q A,:(u/v, y) d, Arty, y) (n = 2,3, -°->). 


Here (4.2.8) is absolutely convergent, being the produet of absolutely con- 
vergent Laplace-Stieltjes integrals.’ This fact also follows from the subse- 
quent estimates of A,(u, 7). 

‘ For the properties of Laplace-Stieltjes integrals used in this paper, consult D. V. Wid- 


der, Trans. Amer. Math. Soe., vol. 31 (1929), pp. 694-743, and E. Hille and J. D. Tamarkin, 
Proce. Nat. Acad. Sei., vol. 19 (1933), pp. 573-577, 902-912; vol. 20 (1934), pp. 140-144. 

























INVERSION PROBLEM OF MOBIUS 565 


We shall prove the inequality 
(4.2.10) | An(u, y) | S VrAn(v, y) S [Ao(u, y)]”. 


The inequality is obviously true for n = 1. Suppose that it has been proved 
for n = k. It follows in particular that A,(u, y) is continuous at u = 1 and 
tends to zero as u— 1. Using (4.2.9) with n = k + 1, we see that Ax,:(u, y) 
has the same property, whence it follows that it is sufficient to prove the second 
half of the inequality. But 


u v | 
Vy Agsilv, y) = [ af A,(v/t, y) d: Ax(t, ”| 


[ a. [ | Ax(v/t, y) |-| de Arlt, y) | | 


< | a, [ [Ao(w/t, y)J* de Ao(t, 7) 
1 1 


IIA 


= [ [Ao(u/t, y)I* d, Ao(t, Y) 


IIA 


[Ao(u, y)\* [ d; Ao(t, y) 


[Ao(u, vy)" i! 


This completes the proof of the inequality. 
Let us put 


1 = aR 7 
(4.2.11) |Biu, y) =a + X ( 1)"a A,(u, y), >i, 
Then by (4.2.10) 
(4.2.12) | Blu, y)| S$ Vi Be, y) s Dd lal" fAo(u, y)]", 


the series being absolutely convergent and uniformly bounded for 1 S u < , 
Hence B(u, y) is of bounded variation in [1, 2%] and 


(4.2.13) [D(s)]' = / u “Yd, Bu, y) = | u“dB(u), 
1 1 
where 
(4.2.14) Bu) = | 9d, BO, 9). 
Jt 


The second integral in (4.2.13) is clearly absolutely convergent for ¢ > max 
(a, 8). This completes the proof of the theorem. 
The assumption that D(s) has a half-plane of absolute convergence is obviously 








566 EINAR HILLE 


unnecessarily restrictive. But simple convergence is not enough to insure the 
existence of even a formal Laplace integral for the reciprocal, much less of a 
half-plane of convergence. 

4.3. Inversion of the A(u)-transform. In the present paragraph we shall 
assume that the reciprocal of D(s) admits a representation by means of a 
convergent Laplace-Stieltjes integral. The funetions A(u) and B(u) are then 
joined by the relation 


(4.3.1) i Blu/v) dA(v) = [ A(u/v) dB(v) = 1, u> 1, 
1 1 


for almost all values of u. This is the transcendental analogue of the Mébius 
algorithm to which it reduces when A(u) is a step function with jumps at the 
integers. 

We can now expect that for a sufficiently restricted class of functions f(z) we 
shall have 
(4.3.2) MIMS, A], B} = MIMS, Bl, A} = f(z). 
We have the following analogue of Theorem 3.2.1. 


THEOREM 4.3.1. A sufficient condition that (4.3.2) shall hold is that the double 
integral 


(4.3.3) I{f] = i [ f(uvz) dA(u) dB(v) 
1 Si 
be absolutely convergent. 


Proof. Let 
Vi A(t) = Ap(u), Vi Bt) = Bo(u). 


The condition of the theorem is then that the integral 


i [ S(uvz) | dAy(u) dBy(v) 
1 1 


be convergent. It is then permissible to regard (4.3.3) as a repeated Stieltjes 
integral and the order in which the integrations are performed is immaterial. 
Now 


ae x 
MIMS, Aj, By = i if f(uvz) dA(u)} dB(v), 
1 1 ) 


af a . 
MIMS, B), Aj} = i ff f(uvz) dB(o) } dA(u). 
1 \h ) 


Hence these two operations exist and give the same result. On the other hand, 
going back to the definition of the Stieltjes integral as a double sum and “sum- 
ming by hyperbolas” uv = const. before passing to the limit, we can show 
that the double integral can be written in the form 


[F see) ae [ B(w/u) dA(u). 
i 1 














INVERSION PROBLEM OF MOBIUS 567 


Formula (4.3.1) shows that this expression reduces to f(z). This completes the 
proof of the theorem. 

We can now pass to the question of inversion. 

THEOREM 4.3.2. Let B(u) exist as a function of bounded variation and satisfy 
(4.3.1). A sufficient condition that 


(4.3.4) f(z) = Mig, B) 
be a solution of 
(4.3.5) g(z) = Mf, A] 


in the domain S is that the double integral I{g| be absolutely convergent in S. On 
the other hand, there cannot be more than one solution f(z) of (4.3.5) which renders 
I{f] absolutely convergent, and whenever it exists this solution is given by (4.3.4). 
Proof. The assumption that J[g] is absolutely convergent implies that 
MI Mg, B], A} = g(z) 


by Theorem 4.3.1. Hence (4.3.4) gives a solution of (4.3.5) and the integral 
is obviously absolutely convergent. 

Conversely, if f(z) is a solution of (4.3.5) such that J[f] is absolutely con- 
vergent, then for the same reason 


f(z) = I[f] = MIMS, AJ, BY = Meg, B, 


so that the solution in question is uniquely determined and given by (4.3.4). 

Again it is necessary to remark that the mere existence of Yi{g, B] in a domain 
S is not sufficient to insure that this function be a solution of (4.3.5). Indeed, 
suppose that the Laplace-Stieltjes integral 


[D(s)]"' = [ u* dB(u) 


has a zero s = a@ in the half-plane of convergence. Then 
i) 
Miz “, B] =z “a u “dB(u) = 0 
1 


and is certainly not a solution of the equation 
Mz), AQ] = 2. 


4.4. Concluding remarks. In the previous discussion | have perhaps over- 
emphasized the rdle of the associated Laplace-Stieltjes integral 


D(s) = | u * dA(u). 
1 


It should be observed that this function and its reciprocal are mainly tools in 
the discussion, and the decisive réle is really played by the reciprocal functions 
A(u) and B(u) which are supposed to satisfy the algorithm (4.3.1). We know 
from the sequence case that these functions may very well exist without the 








568 EINAR HILLE 


associated Laplace-Stieltjes integrals having any domain of convergence or even 
any a priori obvious significance. The theorems of §4.3 really presuppose 
merely the existence of a pair of functions satisfying (4.3.1) and not the existence 
of the associated Laplace-Stieltjes integrals. 

But it must be admitted that when the integrals do not exist, the problem of 
finding the function B(u) reciprocal to a given function A(w) is in general not a 
very promising one. Sometimes we may circumvent the difficulties by a pre- 
liminary application of a suitable method of summation. Thus it may happen 
that s "[D(s)}"' is representable by a Laplace-Stieltjes integral even though 
[D(s)}' is not. In this case the corresponding function B,(u) can be used to 
find an n-fold integral of the formal solution of (4.3.5) from which the solution 
itself may be found by solving an integral equation of the Abel type. 

There are of course other functional equations which may be treated by the 
methods of this paper; for instance, the equation 


[ F(z — u) da(u) = G(z), 


which is associated with Laplace-Stieltjes integrals having 0 instead of 1 as the 
lower limit of integration. 

Finally, it should be remarked that there are various relations, some obvious, 
others less so, between the functional equations treated in this paper on one 
hand, and the theory of Watson transforms and the Karamata-Wiener Tauberian 
theory on the other. The author hopes to return to these questions at a later 
opportunity. 


YaALe UNIVERSITY. 














ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 
By H. S. VANDIVER 


1. Introduction. In the present article a report will be given on the work 
which has been carried out under a grant made to the writer from the Penrose 
Fund of the American Philosophical Society. 

One of the objects of the work was to extend the known tables of Bernoulli 
numbers expressed as rational fractions in their lowest terms and to check all 
previous tables. This was carried out by D. H. Lehmer,'’ who tabulated B, 
for n = 91 to 110, inclusive. The previous tables had given the values 
B, (n = 1, 2, --- , 92). The first ninety of these were reproduced in the tables 
of H. T. Davis.” Here 


Ba = (—1)" "bea 


and 


(b+ 1)" = b, (n # 1), 


where the expression on the left means that (6 + 1) is taken to the n-th power 
by means of the binomial theorem, and 6, is substituted for b* (k = 1,2, --- , n). 
All the above mentioned B’s were employed by Lehmer in checking the regu- 
larity of primes, a prime p being defined as regular if it does not divide the 
numerators of any of the first (p — 3)/2 B’s. The details of these latter com- 
putations will be treated below. 

Another object of the work under the grant was to extend the results of the 
writer on Fermat’s Last Theorem for special exponents. 

In other papers’ it was established that 


(I) r' + y' +2'=0 


is impossible for x, y, and z non-zero rational integers and J a given integer 
2 <1 < 307. In the present article we shall describe the work which estab- 
lished this result for 306 < 1 < 617, excepting | = 587 which exponent has not 
yet been tested (ef. note, p. 584). As the methods employed for these large ex- 
ponents are quite elaborate and complicated, we shall explain many details. 


Received May 24, 1937. 

! This Journal, vol. 2 (1936), pp. 460-464. This article includes references to previous 
tables. Lehmer computed the value of Bie in addition to those mentioned but not as 
part of the present project. Cf. Annals of Math., vol. 36 (1935), p. 648 

2 Tables of Higher Mathematical Functions, vol. 2, Bloomington, Indiana, 1935, pp. 
230 233. 

* Proc. Natl. Acad. Sei., vol. 17 (1931), pp. 661-673 (referred to later as N. A.) with 


references there given. 


a9 








570 H. S. VANDIVER 


We have persisted in the examination of special exponents in (I) in the hope 
that, if one of the criteria that we have employed throughout (N. A., p. 670, 
Theorem IV) for irregular primes | breaks down for a particular 1, we shall find 
such an / in the range of our computations. So far no exponent to which we 
have applied the criterion has belonged to this class. 


2. Congruence properties of the Bernoulli numbers. In a previous paper a 
number of congruences of this character were derived,’ and we shall here gen- 
eralize and simplify some of these results and derive others to be employed later. 
If p is prime, consider the identity 

pm 
vk zx -il 
(1) (x — 1) — = 
and write 
, (x) = 7 + 9° 1? + <a +4 (ps = 1)" pet 
fora > 1 and 
f(z) =1l+e+2e+--- +2". 


These functions will be called Mirimanoff polynomials. If e is the Napierian 


base, we see that 


(s) v_\ 
. df,” (e'x) . #® 
(2) | a -” So (x). 
In lieu of this we may employ the formal operation 
x df," (x) 
dz 


but we shall use the exponential function as most of the papers which have been 
written along these lines employ it. 

Setting x = ez in (1), differentiating a times, and setting » = 0, we have, 
employing Leibnitz’ theorem, 

(2™ — 1)fSTi(xz) + apkar™ fi” (2) 

(3) P 
= (2?" — 1)fS(z) + apmx’ f(x) (mod p’). 
Let p be an m-th root of unity # 1; then the last relation gives, ifm 4 0 (mod p), 


m—1 m1 m1 
> (p”” — 1)f."i(p) 4+ ap > kp” f." (p) apm > f'(e) (mod p). 
kel kel kewl 


Now 
a (p) (1+p >+.---+ p” yf." (p) 0) (mod p), 


* Proce. Natl. Acad. Sei., vol. 16 (1930), pp. 139-150. 











ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 571 


whence 
m —1 

(4) — >’ mfi2i(o) = apm _ > fi (p) (mod p’), 
k=l 


r - . . . . 
where >>’ indicates summation over all the distinct values of p. 
Consider the summation 


? pim 
ik foi(e). 
If we carry the summation over each term, then the term of the form }>’ t’p' = 
—t* for t not divisible by m, whereas if t is divisible by m the term is (m — 1) &*. 
Noting that if a 4 1 (mod p — 1), 


pm—tl m—1 p—l p-—l 
> *= ys (j + ph)* = mS, + ps aph Sa: (mod p’); | Sa = pe Pw 
i=1 h=0 j=0 h i=l 


we obtain, using S,.1 = 0 (mod p), the relation 
(5)  fSi(o) = (m**" — m)Sa (mod p’). 
Hence (4), (5), and the known relation S, = pb, (mod p’) give for a even, p > 3, 


a+l m1 
(6) we - ae > > fo (mod p), 
k=1 
whence 
a 1 =~ m* m—1 [vp/m] a 
(7) — 6, = Zz ay i (mod p), 
am* v=] j=l 


where [z] is the greatest integer in 2. 


We have 
(n — k) kp 
*"")-b-4)- 
n n 
whence 


: 2a—1 % 2a—1 
() Cy AE —_— 


From this, we may show that, if k < n, 
Citi + Ci. 2 0 (mod p); 

ip/m) 
(9) cm _ o —1 
i _ . 


i=[(1—1) p/mi+! 


In Cio) there are 


k +4 ve | » || -_ 
n — 








572 H. S. VANDIVER 


terms, and in C’,”’. there are 


terms, and these two numbers are equal since 


(n—k)p| _ = 
ae es 

P ; 2a—1 s 

> ({" — He) — i +1) =-) 


This is the relation (9). Now consider 


(n — 21+ 1)C;”. 
Also for n odd we have 
= 0 


Cc! n) 
(n+1)/2 


from (9). Hence from (9) we have, using (7), 


1 _ a {n/2] 
** Pan - 11 
(10) m—l =—) 2 
> (m-)cy” = > 
i=1 i=1 p=l 
For an odd n = n, this reduces to 
1 —" [n,/2] ( 7 1 —_ 
(10a) boa ss, = ny + : 
4an; 11 2 


CT — C, > € 
defined for p > 6. 


Now for 2a < (p 1) we have 


20 1 
l-—n- n” 


rea 7 ae | 
2an 


(n — = - Ek - k= 
n n 
kpj_ 


with a similar relation in which k + 1 replaces k. 


vP | 


Now (8) gives 


(7) 


(n — DC§” + @ — 1)CS5i-1. 


Using (9) we see that this is congruent modulo p to 


(n — 2 + 1)C;” 


y(m) 
cm, 


21) 


y(ny) 
ran 


To accord with the notation employed in other papers we write 


y(4) 
1 =A, 





















(mod p). 


(mod p) 


(mod p) 


(mod p). 





(mod p). 








nS 


ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 


Hence we obtain from (10) and (10a) 


p-2a 
(11) ae B)bee  5C, + 3C2 + Cr 
(12) (4P — Abe 34,4 As 
2a 
p-2a ’ 
(13) (3? * — 3)be _ Oo, 4 C 
4a 
observing that 
{p/3} 
Ci oa C2 = ” air 
s=1 


We note also that 
Ay + A2=(,4+0C¢24+ C3 
and (11) gives 


6” ™ — 6)ban 


2a = 2(C, + C2) + 2C1 + Ar + As 


p—2a p—2a vr 
mE Milind. 4 


= - bea + 2(C; — Ad) 


This gives 


seit 3” 2a es ile + 6” 2a 


— 
A, —C, = (-1)"B, ic 


9 


(2”-* — 1)(3""* — 2" — 1) 


= (-1)"B, ta 


and this may be put in the form 
[p/4) p-2a p—2a op—2a 
ta-1 so & — 13 —2 — 1) 

(14) mires . =(-i7's da 
forp > 7,2a < p— 1. 

From (10) we have 
(15) 
which with (13) gives 
” bea = 2(A1 + As) — 2Cs 


or 


(1/2) g! 2atl a 3! 2a 


! _— 
(16) . Mos : L _ay"'B, 
da 


573 


(mod p); 
(mod p); 


(mod p), 


(mod p). 


(mod p), 


(mod p) 


(mod p) 


(mod p) 


(mod p). 








574 H. S. VANDIVER 


Now set ” 
D, = Cy’, 
then (10a) gives 
(17) bo, Fw 2D, 4+ Ds (mod p). 
~ 4a- 52"! 
Now (11) and (15) give 
(6”-* — 6)be, (2? — 2)bea 
— = 4 . ” _ 
= C, + Ke + ne (mod p) 
or 
be, (6” 2a 9” 2a a : 
aX 4) = 20,+C: (mod p). 


4a 
Subtraction of this from (17) gives, modulo p, 


=p-—2a »p-2a »—2a 

(18 MA-€)4+h< Sek — 
4a 

3. Examination of primes as to regularity. ‘The first step in our examination 
of a particular 1 in (1) was to determine if it was regular, so that we could 
apply the known result of Kummer to the effect that (1) is impossible if 1 is a 
regular prime. For this purpose formulas (14) and (16) were mainly employed 
heretofore. For the larger primes, however, (18) was found to be more valuable 
and was mainly employed in the present work, where 1 = p. The number of 
values of s used in (14) is approximately [J — 1/12]. The number required in 
(18) is larger, but there exists a relation between the values of s in (18) which 
does not hold in (14). The range of values for s in the expression 2(D,; — (C)) 
is from {1/6} + 1 to [l/5|, where each value is used twice. The expression 
Dy, — Cy contains the values from [1/5] + 1 to [21/5] less the values from [1/6] + 1 
to [1-3). Thus the negative values in the second expression cancel the double 
values of the first, leaving the values of s in (18) in two ranges, [1/6] + 1 to 
(1/5) and [1/3] + 1 to [20/5] 


Let n be a number in the range [1/6] + 1 to [1/5]. Then n = [l/6] + 4, 
where 7 is a positive integer. Let 7 (1/6). Then 1/6 j + k/6, where 
0<-—k — 6, and 1/3 2) + k/3. Therefore, 2(1/6) 2 [l/3| l,and 2n 2 
(3) + 1. Also n = [l/5], and as above 2(1/5| < [21/5]. Therefore, 2n Ss 


(21/5). Thus we have the relation: 
19) If n ie a number in the range \l/6) + 1 to [l/5|, then 2n is in the range 
(1/3) 4+ 1 to (21/5) 


Jt also follows from the above argument that if mis an integer such that 2n 
is in the range [1/4] 4 1 to [2L/5], then n is in the range [l/6] 4+ 1 to [1/5]. Thus 
for a particular value of a in (18) it is sufficient to compute the powers of the 














ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 575 


odd values of s in the range [1/3] + 1 to [21/5]. The sums of the powers of the 
even values may be found in one operation by computing 
(1/5) 
(20) 974 1 s" . 
s=[1/6]+1 
M. M. Abernathy has obtained a result concerning primes of the form 4n + 1. 
If index s = 2k, then 


ind s°?? = 2n.2k = 4nk = 0 (mod / — 1). 


(1-1) /2+3 


Therefore s‘“"”? = 1 (mod J) and s = s' (mod 1), where 7 and & are 
integers and / is a given prime of the form 4n + 1. If index s = 2k 4+ 1, then 


ind gs? = 2n(2k + 1) = 4nk + 2n = 2n (mod J — 1). 


4 


Therefore s“~?’*? = —1 (mod J) and s“~”**' = —s' (mod 1). Thus in testing 
for regularity of primes of the form 4n + 1, powers of s greater than (J — 1)/2 
do not have to be computed. Division of values of s into two groups, those 
of even and those of odd index, permits computation of 


by taking 

(21) > pis > 1-1) af 
1 : 

where s; ranges over values with even index, se over those of odd index. 

In computing the right-hand member of (18) it sometimes happened that 
this was = 0 (mod /) because the second factor on the right had its numerator 
divisible by | = p. In these cases (14), (15), and (16) were employed until the 
numerator of the factor not involving be, was found to be # 0 (mod J). When 
a B, was found with its numerator divisible by J, then this was checked by 
employing the tables of Bernoulli numbers of Lehmer or of previous writers, 
provided the Bernoulli number fell within the range of their computations. 
Further, to aid in this part of the work, Lehmer suggested and earried out the 
division of each B, (a S 110) by each of the primes 1, where 547 S / < 601, 
and for all primes J; of the form 4n + 3, where 601 < 1; < 619. Jacobi's table 
of indices” was employed to carry out the other part of the computations. We 
shall now give an example of the method for 1 = p = 541. For this case, the 
terms in the right-hand member of (18) are 


tos 
2D, - C) =2>5 s*", 
s=9l 
216 
(De () > 3“ : 
geist 


' Canon Arithmeticus, Berlin, IS839. 








576 H. S. VANDIVER 


For each s appearing in the above ranges, s"”' was computed employing Jacobi’s 
tables for a = 1, 2,--- ,(l — 3)/2; then the computations of D; — C, and 
D, — Cz were each separated into two parts since information about some of 
the terms s“' in the second sum can be obtained immediately from some of 
those of the first, as we indicated at the beginning of §3. For the primes 


mentioned above tested by Lehmer we started with a = 111 in lieu of a = 1. 


In the computation of s°' (a = 1, 2,---,(l — 3)/2), periodicities may 
appear, which, since we are using indices, may be determined in advance. If 
1 — 1 = km and index s = kn, then s will begin repeating at s”*'; for s"** = 
s (mod 1) follows from the fact that since r“" = s, 

kn(m + 1) = kmn + kn = kn (mod 1 — 1). 
At any stage of the work we have another check by the possible use of the 
formula 
k 2k+1 
ge te Tagen 
a=1 s*— | 


and in particular the convenient relation 
(i—1)/2 
3” '=0 (mod 2). 
a= 

The regular’ primes 1 (306 <1 < 619) are as follows: 313, 317, 331, 337, 349, 
359, 367, 373, 383, 389, 397, 419, 431, 439, 443, 449, 457, 479, 487, 499, 503, 
509, 521, 563, 569, 571, 599, 601, 613, and the irregular primes are 307, 31 
347, 353, 379, 401, 409, 421, 433, 461, 463, 467, 491, 523, 541, 547, 557, 57 
587, 593, 607, 617. 

As to the time required for the work on a particular prime I, close to the 
value 600, in a test for regularity, a person experienced in this work requires 
on the average about forty hours to complete the test provided the prime is of 
the form 4n + 3. A prime of the form 4n + 1 requires about two thirds of 
this time. 

Concerning the numbers in the set 
(22) B, ’ Be, oe os By 3)/2 
which are divisible by l for a particular | we found 

B, = 0 (mod lL) 
in the following cases: | = 307, n = 44;1 = 311, = 146;l = 347, = 140; 
l = 353, n = 93, n 150;1 = 379, n = 50,n = 87;1 = 401, n = 191; 1 = 409, 
" 63; 1 421.0 120;1 = 433, n 183;1 = 461,” 98; l = 463, n 65; 
l 467.7 47,1 97;1 = 491,n = 146,n = 169;1 = 523, n 123, 200; 


l 5Al, n a 547, 1 135, m 243; l 557, n Ll1;2 577, 
n = 26:1 = 587,n = 45,n = 46;1 = 593, n = 11;1 = 607, n = 296;1 = 617, 
Z 10, n 7, om 169. From the above it will be noted that just two 


of the B’s in the set (22) are divisible by Lin each of the cases | $53, 379, 


© The irregular primes < 21) are listed by the writer in the Transactions of the American 
Mathematical Society, vol. 31, pp. 614, 615 616, and for 211 5 Ll < S07 in N_A., p. 667 











ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 577 
467, 491, 523, 547, 587. For 617, three of the B’s in (22) are divisible by 617 
and this is the first / of this sort encountered in our work since the beginning. 


4. Treatment of the exponents in (I) which are irregular primes. Here we 
employ the following:’ 

THEOREM 1. Under the assumptions: none of the units E, (a = a), @2, --- , Ms) 
is congruent to the |-th power of an integer in k(¢) modulo B, where B is a prime 
ideal divisor of p; p is a prime < ( — l) of the form 1 + lk; and a, , a2, +++ ,@, 
are the subscripts of the B’s in the set 


B,, Bz, reid Ba-se 


whose numerators are divisible by 1; the relation (1) is impossible in non-zero ra- 
tional integers x, y, and z. 
In the above statement, 


e= (< —~pa-s 2); 
aq—pa— ry)’ 


r being a primitive root of 1; — = é""'' As indicated in N. A., p. 667, to test 
this for a particular irregular 1 and a p < ( — 1) of the form 1 + &l, we com- 
pute the value of E,(d) modulo p, where E,(d) is written in the form 


a® “TL” al —™ ') r" Li 
i=0 dt’ — 1 7 


R=! St rt rp ee HY), 
h =1 — 2n. 
d is a rational integer such that 
d' = 1 (mod p). 


In N. A., p. 668 a systematic method of carrying out such a computation 
a 


was explained for the case 1 = 271, p = 1627. M. E. Tittle has further ab- 
breviated this scheme in general. Since 


d=1 (mod p), 


all values of (d"° — 1) are obtained immediately from a table of the first powers 
of d, modulo p. ‘This renders unnecessary the third row employed in N. A., 
p. 668 and eliminates the computation involved in obtaining the fourth row. 
A companion table to the powers of d, with powers of p, a primitive root of p, 
substituted and with blank spaces for the omitted numbers permits calcula- 
tion of ind (d" 1). Wd’ 1) is not one of the numbers appearing in the 


™N. A., p. 670, Theorem IV; Bulletin of the American Mathematical Society, vol. 40 
(1934), p. 124, paragraph immediately following statement of Theorem 2. 








578 H. S. VANDIVER 


companion table, multiply (d"’ — 1) by p, and examine the table for the appear- 
ance of this number. Repetition of this operation will yield a result in at 
most k — 1 steps, for the numbers n, pn, pn, ---,p' 'n, have as indices the 
numbers m, m + 1, m + 2, ---,m +k — 1, where index n = m, and one of 
the numbers in this last set is of the form ki, 7 an integer, and all indices of the 
form ki are presented. To obtain the actual value of (d” — 1), subtract from 
the observed value in the table the power of p required in the multiplication 
p'(d" — 1). Detailed checks on this type of work are described in N. A., 
p. 669. 

The following table gives the specific results of the computation for each 
irregular prime. In the table, / is the prime exponent appearing in (1), n is a 
Bernoulli number in the set (22) whose numerator is divisible by I, r is a primitive 
root of |, d is the integer selected such that 


d' =] (mod p), 


p is a prime of the form 1 + kl referred to in the statement of Theorem 1, p is a 
primitive root of p, ind is the index, modulo I, found for E,(d), modulo p. 
When two values of n are listed for a particular 1, then the corresponding 
indices are listed in the same order in the last columr. Thus for 1 = 353, 
By = 0 (mod 353), ind Ey(2804) = 57 (mod 353); Bio = 0 (mod 353), ind 
Eyo(2804) = 13 (mod 353). The exponents 1 = 587 and l = 617 have not yet 
been tested by the criteria of Theorem 1. 


l nm r d iD p ind 
307 44 5 168 1229 10 213 
311 146 308 135 1867 1857 27 
347 140 337 64 2083 2 333 
353 93, 150 3 2804 4943 10 57, 13 
379 50, 87 2 3954 4549 6 101, 200 
40] 191] 211 3* 3209 3 365 
409 63 235 2 1637 2 115 
$2] 120 238 6'° 4211 6 338 
433 183 10 2 1733 2 332 
16] 98 10 10° 2767 10 190 
463 65 174 2'2 5557 2 215 
467 47, 97 10 y 2803 2 347, 87 
49] 146, 169 10 5 983 5 148, 382 
523 123, 200 10 _ 5231 2 85, 228 
541 44 10 3° 9739 3 458 
547 35, 243 17 so” 5471 3 177, 179 
557 11] 4] 10° 3343 10 222 
577 26 10 15 2309 2 556 
587 45, 46 
593 1] 10 y 1187 2 523 
607 206 575 2" 3643 2 491 


617 10, 87, 169 











ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 579 


It is a little curious that the smallest prime p satisfying the conditions in 
Theorem 1 gave for each 1 


ind E,(d) # 0 (mod I). 


We may now state 
THEOREM 2. 


xr" + y” - 2" 
is impossible in non-zero rational integers x, y, and z if n is a given integer, 
2 <n < 617, excepting possibly 587. 

The above results have also been established for various prime exponents 
l < 700 not included in the above. The computations are proceeding under 
other auspices. 


5. Application of the data obtained to other parts of the theory of cyclotomic 
fields. The numerical data obtained gives special information concerning many 
other questions in cyclotomic field theory. First, various results are known* 
concerning regular cyclotomic fields which have not been extended to other 
fields, and which have been applied to obtain results in Diophantine Analysis. 
By means of our work described here, we have isolated many regular cyclo- 
tomic fields in which all the results just referred to apply. 

Let us now consider primary units in a cyclotomic field. A unit 7 is primary 
in the field defined by a primitive /-th root of unity A(¢) if it is not the /-th 
power of an integer in k(¢) and if 


n=a’ (mod \’), 


P ° ° i ai ° 9 ° ° 
where a is an integer in k(¢) and = (1 — ¢)._ Iflis regular’ there is no primary 
unit in k(¢). If l is irregular, we may show that £,(¢) is primary provided 
B, = 0 (mod J). For, we may set 


E.(g) = En) + aA) — 9°. 
Then for an indeterminate w 


t 
_ ’ a > w-—l 
(23) w E,(w)' ' = E,(1)'' + &(w)(1 — w)* + 3 w)(’ ") 
v — 
since V(w) is some polynomial in w with rational integral coefficients. In this 
relation set w = e', take the logarithms of each member, differentiate & times, 
and set v = 0. Then" since for i # n 


24 a =. —Dis—n) - 
(24) k log E.,,(¢ | 3 byt B @* — 2), 
vm t 


dy™ ae. | y 4 

* Hilbert, Werke, 1, pp. 278 312. Maillet, Annali di Mat., (3), vol. 12 (1906), pp 
145-178; Acta Math., vol. 24 (1901), pp. 247-256; Vandiver, Proc. Natl. Acad. Sei., vol. 17 
(1931), pp. 662.663; Monats. Math. u. Phys., vol. 43 (1936), pp. 317-320. 

® Hilbert, loc. eit., p. 287. 

© Vandiver, Transactions of the American Mathematical Society, vol. 31 (1929), pp 


619 620. 











580 H. S. VANDIVER 


and fort = n 
E log pe) _ (=) BG — 1)G* — 1) 
dv" vel) im ; " 


4n : 


using B, = 0 with 
| — log re | sods 
veel) : 


dy**+1 
d‘ 6(e")(1 — e°)* 
du* 


ad —* =xHa -H™. 


To prove that x(¢) is divisible by 1 — ¢ and hence that £,(¢) is primary we 
may write (23) in the form, d being some integer, 


we obtain 


0 (mod J) (k = 1,2,---,l— 2). 


Hence 


U 
(25) w"* E,(w)'* = E,(1)'* + x(w)(1 — w)'* + Vi(w)(w! — 1) +d © : 


In this relation set w = 1; this givesd = 0. In (25) set w = e’, take logarithms 
of each member, differentiate (1 — 1) times, and set v = 0, and we have, em- 


. “yy — yt 
ploying (24), since k q = ) | = 0 (mod l) (s < 1 — 1), 
v vel) 
d' 'y(e(Q e”)! . - 
| d= * 0 (mod J), 
and 
d YY —_ e")' 1 = 
| x’ ) a he 0 (mod J), 
whence 
Ix(e")|0 = 0 (mod J), 
or 
x(e') = (1 — e’)w(e’) (mod J). 


This gives the result. 

We may now show that £,(¢) is primary when B, 0 (mod 1) if we note 
also that #, is never the Lth power of a unit in k(¢) in any of the cases we 
have tested since in each such case we found a prime ideal B such that #,.(¢) 
is not congruent to the Lth power of an integer in k(¢) modulo $B, where $B 
is a prime ideal divisor of p. Hence for any Ll where we have found £,(¢) 
primary we have explicitly determined an absolute (Hilbert) class field" of k(¢). 


Hilbert, loc. cit., pp. 149 156 














ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 581 


We now consider the connection of our results with the second factor of the 
class number of k(¢). This factor may be written” 


Mine > Ny 


where A is the determinant 


the b’s and n’s being rational integers not all zero such that 
hy => ef! ee? eae ei (s = l, 2. ---,h), 


where y1, Y2,--- , ¥:, is a set of fundamental units of k(¢); « is the unit ob- 
tained from our ¢, previously defined, by the substitution (¢ ee i, = 
(Ll — 3)/2. Ina former paper,” the writer showed that for this second factor 
(say he) to be divisible by | it is necessary and sufficient that at least one of the 
units E,(¢) (¢ = 1, 2, --- , h) be the l-th power of a unit in A(¢). In view of 
what we observed above, concerning the E’s not being congruent to the /-th 
power of an integer in k(¢) modulo 2, we may state the 

TuroreM 3. If l is an odd prime and ¢ = e*""'', then the second factor of the 
class number of the field k(¢) ts prime to l for each | < 617 excepting possibly 587. 

If A is the class number of k(¢), where k(¢) is defined by an irregular prime, 
then we may write 

h = I'j, 

where j is prime tol. If we raise each ideal in k(¢) to the j-th power, then the 
ideal classes defined by these ideals will form a group called the irregular class 
group of k(¢). The prime ideal $, the factor of (p), which we have used in 
connection with each #, which was primary, is such that 2 belongs to this 
irregular class group. This follows since £,(¢) is not congruent to the l-th 
power of an integer in the field &(¢), and, since E,(¢) is primary, this cannot hold 
if $8 does not belong to the irregular class group. That is, B* is not a principal 
ideal unless & is divisible by l. For example, turning to the table we see that 
for 1 = 307 the prime ideal B, a prime ideal divisor of 1229, belongs to the 
irregular class group, ete. Specifically, such ideals can be written in the form 


(¢—p”””, p) 
and since they are not principal, they cannot be further reduced. 
Report on the Theory of Algebraic Numbers, Bull. Nat. Res. Coun., No. 2, February, 


1928, pp. 34 and 38, with references to Kummer there given; Fueter, Synthelische Zahlen 
oor 


theorie, Berlin and Leipzig, 1925, 2d ed., p. 228 
 Proe, Natl Acad. Sei, vol. 16 (1980), pp. 743 749. 











582 H. S. VANDIVER 


In connection with some previous work on Fermat’s Last Theorem we em- 
ployed for particular exponents the following (N. A., p. 670) 

TueoremM 4. Under the following assumptions 

(1) the second factor of the class number of the field k(¢) is prime to l; 

(2) none of the Bernoulli numbers B,, (n = 1, 2, --- ), (lL — 3)/2, is divisible 
by I’, 
the equation (1) is impossible in rational non-zero integers x, y, and z. 

The above theorem was applied to prove Fermat’s Last Theorem for all 
primes | < 307. We are enabled to remove the restriction to case II mentioned 
in the theorem in N. A. because of the results in another paper of the writer’s."* 
The question involved in the second assumption in the theorem is intimately 
connected with other questions concerning the divisibility of certain other 
Bernoulli numbers by /. We have the known relation 


Bi., — 2B... + B. =0 (mod?P); uz = (l — 1)/2, 
where 
Bi = = By 


From this we obtain easily by induction” 
(26) | a) rn ee (mod [’). 


If we now select | and n so that B. = 0 (mod 1) with ai va Bi. (mod P), 
it follows that there exists an integer y which yields B),,,, = 0 (mod [).  Pol- 
laczek took 1 = 37, n = 16, which gave y = 7; hence Biy = 0 (mod 37’). He 
also found two other examples of this type. Hence the numerator of a Ber- 
noulli number may be divisible by the square of a proper divisor, where a proper 
divisor of the numerator of a Bernoulli number B,; is one which is prime to 7. 

Pollaezek showed that a necessary condition that k(¢) contain an ideal be- 
longing to the exponent [ was that one of the (J — 3)/2 Bernoulli numbers 
Besar Ct . eee, 4) be divisible by l. In view of the above, we 
consider the number of Bernoulli numbers in the set 


B, ‘ B,, ae * ss Basa l)y 


which are divisible by [The relation (26) shows that if two of these numbers 
are divisible by ©, then all of them are. Here we are assuming B, = 0 (mod lL). 
The work of Pollaezek mentioned above for the case | = 37 shows that 


B.., # B. (mod [). 

Hence it follows that, since 
Bia = 0 (mod 37°), 
- ZV (mod 0’) 


Houlletin of the American Mathematical Society, vol. 40 (1934), p. 118, Theorem | 
*Pollaezek, Math. Zeitschrift, vol. 21 (1924), p. 46 








ON BERNOULLI’S NUMBERS AND FERMAT’S LAST THEOREM 583 


or 

B,. F# 0 (mod 1°) 
for n = 16,1 = 37 likewise 

Buy # 90 (mod I). 
Here we note that 

ee (2n — I+ z 
2 

and if 

Bu, = 0 (mod /°), 


we may have an ideal k(¢) belonging to the exponent [. Hence Pollaczek’s 
computations verify independently for | = 37 that the class number of k(¢) is 
divisible by 1 but not by T° and also that 


Nie.s7 # O (mod 37°). 


His work (loc. cit.) in connection with the primes 59 and 67 furnishes similar 
checks. 

The data obtained concerning the examination of primes as to regularity 
furnishes much other information concerning the divisibility properties of the 
Bernoulli numbers. For example, we may find the least positive residue of the 
numerator of any Bernoulli number modulo 7], where / is any prime < 619. 
For, in each case we have computed the left-hand members of one of the for- 
mulas (14), (15), (16), or (18), finding the least positive residue modulo / for 
the p used in these formulas. Thus in (14) we may conveniently reduce the 
factor on the right not involving B, employing Jacobi’s table of indices and 
also using the explicit known form of the denominator of B,. Fora > (J — 3) 2 
we employ the known formula 


» &. B 
(. $7 —=—2 a (mod 1) 
a— up a 
except when a is a multiple of uw, a = (J — 1) 2. In the latter case we have 
pBy, - } (mod 2) 


which enables us to reduce the numerator of B,, . modulo / 

We may note that arithmetical machines were not employed in the computa- 
tions deseribed in this paper except in connection with addition. — In my opinion, 
there are probably many other types of number theoretic computation in which, 
in the future, it will be very convenient to employ, if they exist, tables of indices 
for primes beyond 1000.) In connection with the second type of computation 
involving the treatment of irregular exponents im (1) we constructed special 
tables of indices as previously described. “That is, for the value of p which we 
used, we set up partial companion tables from which we ean quickly obtain 


the smallest residue of any integer raised to any power modulo p and having 








584 H. S. VANDIVER 


given an integer we can find what power of p is congruent to that integer modulo 
p where p is a primitive root of p. For the primes < 211 these companion 
tables are complete for each p selected in connection with each irregular prime l. 
For the convenience of anyone who may wish to make use of these particular 
tables, I shall list values of p > 1000 that we employed in this connection. 

1187, 1229, 1543, 1579, 1627, 1637, 1699, 1733, 1867, 2083, 2309, 2767, 2803, 
3209, 3343, 3643, 4211, 4549, 4943, 5231, 5471, 5557, 9739. 

All the numerical data obtained in the course of this work will ultimately be 
deposited in the library of the American Mathematical Society. 


UNIVERSITY OF TEXAS. 


NOTE 
Since the above was written, Fermat’s Last Theorem has been proved for the 
case | = S87. Hence, in view of Theorem 2, the theorem is true for all prime 


exponents less than 617. 














ON THE NECESSARY CONDITIONS FOR THE MINIMUM OF A 
DOUBLE INTEGRAL 


By Max Cora. 


1. Introduction. The purpose of the present paper is to exhibit an intimate 
connection which exists between the theory of the calculus of variations for a 
double integral and the corresponding theory for a simple integral. The varia- 
tion problem which will be discussed is that of minimizing an integral 


(1.1) | [Shes ay 205 54 tan Ben oy Bony By ey Eddy 
: " 


in a certain class of sets of functions z,(r, y) (¢ = 1, 2, --- , ) all of which take 
on the same values on the boundary of the region A of integration. When it is 
not assumed that the minimizing set Z;(r7, y) (¢@ = 1, 2, --- . m) have continuous 


partial derivatives of order greater than the first, the differential equations 
which must be satisfied by the minimizing set were first derived (at least for 
the case nm = 1) by Haar [1],' [2], who made use of his so-called Fundamental 
Lemma of the caleulus of variations for double integrals. A survey of the 
literature concerning this Fundamental Lemma will be found in the Chicago 
dissertation of Miss Huke [8]; the proof of the lemma was simplified considerably 
by Haar in his last paper [4] on the subject. 

Of the further necessary conditions for a minimum of the integral (1.1), the 
analogue for double integrals of the Legendre condition for the minimum of a 
simple integral was first established by Mason [5] for the case l. The 
nnalogue of the Weierstrass condition was first proved by FE. FE. Levi [6] for the 
cause n Land by MeShane [7] for the general ease. We shall not be concerned 
in this paper with the analogues of the Jacobi condition. 

The Fundamental Lemma of Haar is unnecessary for the development of the 
theory of the calculus of variations for the integral (1.1) and the well-known 
Du Bois-Reymond lemma of the theory for simple integrals suffices. Indeed, 
it will be shown below that there is associated with the problem for the integral 
(1.1) an auxiliary minimum problem fora simple integral and that the necessary 
conditions of Haar, Weierstrass, and Legendre for the problem involving the 
integral (1.1) ean be respeetively deduced as simple corollaries of the necessary 
conditions of Euler, Weierstrass, and Legendre for the auxiliary problem. The 
condition of Haar will be derived below in a moditied form involving integral 


equations 


Received May 24, L987. The results contained in this paper were obtained tn part by 
the author during his tenure of a National Research Fellowship 
' Numbers in square brackets refer to the bibliography at the end of the paper 


ONO 











586 MAX CORAL 


2. Hypotheses. Let A be a region of points (z, y) and let 
(2.1) 2; = Z,(z, y) [(z, y) in Ast = 1,2, ---,n] 


be functions which are continuous and which have continuous partial deriva- 
tives of the first order in A. The function f(z, y, z, p, q) will be supposed to be 
defined and continuous for sets 


(x, y, 2, p, 9) = (2, Y, 21, -+* y2ny Dry ++ s Puno Gy *** » Qn) 


in a neighborhood G of the sets (2, y, Z, Z., Z,) belonging to (2.1), and in G 
the function f shall have continuous derivatives of the first and second orders 
with respect to the arguments z;, pi, qi (@ = 1, 2, ---,n). 

We shall denote by IN the class of sets of functions 


(2.2) z; = 2,(z, y) [(z, y)in A;t = 1,2, ---,n] 


<a 


which have the following properties: 

(a) the functions z,(z, y) are continuous in A and the region A may be decom- 
posed into a finite number of subregions, each bounded by a simply closed 
regular curve, such that on each subregion the functions z,;(z, y) have continuous 
partial derivatives of the first order; 

(b) the elements (x, y, z, z-, 2,) belonging to a set z,(x, y) are all in G; 

(c) the functions z,(z, y) take on the same values on the boundary of A as do 
the functions of the set (2.1). 

Under these hypotheses the integral 


J = | | I(x, y, 2, 22, 2y) da dy 
J ‘ 


has a finite real value when computed for any set z,(z7, y) of the class Dt. We 
shall suppose that the set (2.1) furnishes a minimum to the integral J in the 
class Wt. The necessary conditions of Haar, Weierstrass, and Legendre which 
must then be satisfied by the minimizing set (2.1) will be derived below by 
consideration of an auxiliary minimum problem for a simple integral. 


4. The auxiliary minimum problem. Consider any point (29, yo) interior 
to A and let Po represent a ring of points (x, y) defined by the equations 


(3.) J ty + pcos by yo tpsind (O< 7nSpsr;0 5 08 Qn), 


where ris chosen so small that Plies interior to A. Let 70) be any funetion 

which is continuous on O <= 6 = 2m and such that 7'(0) T(2r) and which 

has the further property that the interval 0 < @ < 2m may be decomposed into 

finite nurnber of subintervals on-cach of which 70) has a continuous first 

derivative, and let I represent the class of sets of functions Ri(p) (mS p S 7; 
a n) which with their derivatives Ri(p) are continuous on 7; 

op ~ vr and have all their elements [p, R(p), '(p)| sufficiently near the sets 

















NECESSARY CONDITIONS FOR MINIMUM OF DOUBLE INTEGRAL 587 


(p, 0, 0) and for which Ri(n) = Rr) = 0 (@ = 1, 2,---,n). For any set 
R; of N put 
(3.2) ¢:; = Zi(xo + pcos 8, yo + p sin 6) + Ri(p)T (A) 

(1Sps7r,0s5 08 2n;t1 = 1,2,---,n). 


Then in we have 
= fi = Zis + R:T cos @ — R;T’ (sin 6/p), 


(3.3) (¢ = 1,2, .--,n). 
Qe. = Zi, + RT sin 6 + R;T" (cos 6/p), 


ay Si 
If the definition of the functions ¢; is extended so that ¢; = Z; in A — T 
(¢ = 1, 2,---,mn) the resulting set of functions clearly belongs to Mt. With 


7(@) held fixed, f(xo + p cos 0, yo + p sin 6, ¢, &, &,) becomes a function of 
(p, 0, R, R’) as is clear from (3.2) and (3.3). If now we put 


(3.4) g(o, R, R’) = | f (x0 + p cos 8, yo + p sin 8, &, &., Fy) dd, 
0 


then g(p, R, R’) is defined for all R and R’ sufficiently small and for all p such 
that the circle x = 2» + pcos 6, y = yo + psin @liesin A. Then the minimizing 


property of the set (2.1) implies that the set Rip) = 0 (mn S p sr; 
i= 1,2, .--,m) furnishes a minimum to the integral” 


J = [ gle, R(p), R'(p)| dp 


in the class N. 


4. The necessary condition of Haar. A first necessary condition which must 
be satisfied by the minimizing set (2.1) is contained in the following theorem: 
Turorem |. For every simply closed regular curve C lying interior to A the 


equations 
(4.1) | | & dy _ t. dx = | | f.dedy €] = l, » eee, n) 


are satisfied, the arguments of f,, ,f, and f., being the set (x, y. Z, Zz, Zy) belonging 
to the minimizing set (2.1). 

The condition (4.1), while not in the form given by Haar, is equivalent to his 
system of differential equations. The theorem will be proved first for the case 
when C is a cirele. Let C have its center at (vo. Yo) and radius ro Let ry be 
any positive number less than rand consider the ring P with center at (ro . Yo) 
and radii ry andr. For the corresponding auxiliary minimum problem, con- 

2 The integrand g(p, RB, RR’) possesses all the continuity properties needed to establish 
the first three necessary conditions for the auxiliary minimum problem. See Carathéodory, 
Variationsrechnung und partielle Differentialgleichungen, Leipzig, 19385, p. 190, footnote 








588 MAX CORAL 


structed as described in the preceding section, the minimizing set R,(p) = 0 
(1 Sp Sr;t = 1, 2,---,m) must satisfy the following equations, which are 
the Du Bois-Reymond form of the Euler equations for the problem: 


gri(r, 0, 0) = / gr, (p, 0,0) dp + grei(ri, 0, 0) (¢ = 1,2, ---, n) 


or, by virtue of (3.4), with 7(@) = 1, 


(4.2) [ [(f,, cos @ + f,, sin @)p}i—), dé = / [ f.,edpd@ (¢=1,2,---,n), 
0 ri 0 


the arguments of f,, , fo, , fz, in (4.2) being the set [z, y, Z(x, y), Z.(zx, y), Z, (x, y)! 
with x and y replaced by the values given in (3.1). Taking the limit as ri 
approaches zero in (4.2) one secures the desired equations (4.1) for the case of 
the circle C. 

Now let C be any simply closed regular curve interior to A. Consider a 
closed region A’ interior to A and containing C in its interior, and let r > 0 
represent the minimum distance between the boundaries of A and A’. If 
C,,. is a circle with center at (z, y) and radius p, then the functions 


M;,(z, y) = If t,, dé dn, 


(4.3) N,,(z, y) = | / f,, dé dn, (¢ = 1,2, ---,m) 


| [ f-, dé dn, 


in which the arguments of f,, , f,, and fz, are the set [r + & y + n, Z(x + &, 
y +n), Z(x + ty + n), Z,(x + &, y + n)], have the following properties :* 
(a) for each value of p on the interval 0 < p S r the functions (4.3) are single- 
valued and continuous for (x, y) in A’; 
(b) for each value of pon 0 < p S r the functions (4.3) have continuous first 


P.,(z, y) 


partial derivatives with respect to z and y for (x, y) interior to A’; these deriva- 
tives are given by 


OL Cey.p 
(4.4) (i = 1,2, ---,n) 


7] 


— M,,(x, y) = -[ Sv, 4, 
OY Cry 


with similar formulas for the partial derivatives of N,, and P,,; 


Property (c) is readily proved by means of the mean value theorem. For a proof of 
the analogue of property (b) for triple integrals see O. D. Kellogg, Foundations of Potential 
Theory, sgerlin, 1929, p 224 





f 








NECESSARY CONDITIONS FOR MINIMUM OF DOUBLE INTEGRAL 589 


(c) the functions (4.3) satisfy the conditions 
lim (1/xp°) Mi,(z, y) = fo:[z, y, Z(a, y), ZAz, y), Z(z, yl, 


lim (1, rp) N,,(z, y) = Feast: Y; Zz, y), ZAx, y), Z,(z, y)], (¢ = 1,2,---, n) 
p70 
lim (1/xp’) Pi(z, y) = fe:[a, y, Z(z, y), Z.(z, y), Zy(2, y)], 


uniformly for (2, y) in A’. 
Consider now any value p of the interval 0 < p < r. Since for each point 
(x, y) of C the equations (4.1) hold on the circle C,,,, , we have 


(1, of [Cf 4 Js, dedn) de dy = (1/zp’) Ii fo, dn ~ fad) ded 
= (1, wo) { [[ Aare, y) 


+ = N,,(z, »)| dx dy 
ay 


= (1/mp’) / M,,(x, y) dy — Ni,(x, y) dz 
3 
(i = 1, 2, +++), 
by virtue of Green’s Theorem and property (b) above. Using property (c) we 
secure the equations (4.1) by taking the limit as p approaches zero. 
5. The necessary condition of Weierstrass. The Weierstrass E-function is 
defined to be 
E(x, y, 2, p, 9, P,Q) = f(z, y, 2, P,Q) — f(a, y, 2, P, 9) 
— (Pi — pafo. (2, y, 2,2, D — Qi — Whar, yt PD, 


the repeated subscript ¢ indicating summation with respect to ¢ over the 


range 7 = 1, 2,---,n. We have then the following result: 
THEOREM 2. At each point (x9 , yo) of A the inequality 
(5.1) Elz» » Yo, Z(xo ’ Yo), ZAXo ’ Yo), Z, (20 ’ Yo), Ps Q) 20 


holds for every set (P, Q) for which [x0 , yo , Z(x0 , Yo), P, Q] ts in G and for which 


the matrix 
(5.2) 1} P; ae Ziz(Xo ’ Yo), Qi = Ziy(xo ’ Yo) || 


has rank one. 
Since the matrix (5.2) has rank one there exists a pair of constants (a, b) # 
(0, 0) such that 


(5.3) alP; — Zire, yo)| + (Qi — Zila, yo)| = 0 (¢ = 1,2,---, nm). 








590 MAX CORAL 
Let a (0 < @ S 2m) be such that sina = a/(a’ + b*)} and cosa = —b/(a” + b’)! 
and determine constants Rj (i = 1, 2, --- ,n) by the equations 


5.4) Ri = [Pi — Zislto. yo)] cos a + [Qi — Zi,(x0, yo)] sin « 
(i 


II 
to 
= 

— 


It follows from (5.3) and (5.4) that 
R: cos @ = P; — Ziz(x0, yo), 
Ri sin a = Qi — Ziy(t0. yo), 
The desired inequality (5.1) now becomes 
5.5) E(x. yo. Zo. Zor, Zn, Zor + R’ cos a, Zy, + R’ sin a) = 0, 
where we have put Zi» = Zi(to, yo), Zine = Ziz(Xo, Yo), Ziny = Ziy(Xo, Yo) 


= 1,2,---,n). 

Now consider again the auxiliary minimum problem constructed in $3. The 
minimizing set F,(p) O(r, Sp Sr;t = 1,2, --- ,n) must satisfy the Weier- 
strass necessary condition for that problem. Hence for p = 7, 

5.6 g(r, , 0, R’) — g(r; , 0,0) — Rigxi(r, 0, 0) = 0. 


After division by mr, and the use of (3.4) and (3.3) we secure, by letting rm ap- 


proach zero 
7) | Elio, yo, Zo, Zoe, Boy, Zoe + RTO) cos 0, Zo, + RT) sin 0] dd = 0. 


Suppose that the inequality (5.5) is false. Then there exists an interval 
0 <a 2 6 = a < 2x to which @ is interior, such that 
5.5%) Elto, yo. Zo. Zoe. By, Boz + RR’ cos 0, Zy + R’ sin 0) < O 
(cy <s 0 < ae). 


Here and in what follows the modifications which are necessary in case a = 0 
ora = 2m will be obvious to the reader.) Choose a positive number ¢ so small 
that o eand a, + eure interior to0 < @ <= 2m and define 7,00) as follows: 

‘le +0 oy)/e (ay «¢€S5 0S ay), 

‘A.9 7%) ‘J (ey <—~ 0S ay), 

(« 0 + an)/e (ag S 0 <= ae + e@), 
7%) OQ elsewhere on O 7 6 © 2r. Por this funetion 7.00) the integral 


(50 | Kau, yo, Bo, Boss Boy, Bar + OVO) e020, Za, 4 RTO) sin Of 0, 














NECESSARY CONDITIONS FOR MINIMUM OF DOUBLE INTEGRAL 591 


and this integral may be broken up as follows: 


aote al ae ag+e 
(5.11) i Edé = [ Edé@ + / E dé + i E dé, 
ai~—e aji~—e a ae 


the arguments of E being the same as in (5.10). 

The second term on the right of (5.11) is negative by virtue of (5.8) and 
dominates the right side of (5.11) for sufficiently small ¢, since the first and 
third terms clearly approach zero with e. For our present choice of 7(6), then, 
we have a contradiction with the inequality (5.7), which must hold for all fune- 
tions 7'(@) of the type described in §3. Hence the inequality (5.5) is true. 


6. The Legendre condition. The necessary condition of Legendre for the 
minimizing set (2.1) can be secured by well known means out of the Weierstrass 
condition. We shall derive it here, however, from a consideration of the 
auxiliary minimum problem. 

THeoreM 3. At each point (xo, yo) of A the inequality 


(6.1) Grim Mi At + Bora As Be + favaeBiBs = 0 
holds for every set (A, B) for which the matrix 
(6.2) || As, B; 


has rank one. In the left member of the inequality (6.1) the arguments of the 
derivatives of f are the set [xo , yo, Z(to. Yo). ZA to. Yo), Zy(to. Yo)| belonging to the 
minimizing set (2.1). 

As in the previous proof, determine a pair of constants (a, b) = (0, 0) as solu- 
tions of the equations 


(6.3) aA; + bB,; = 0 (¢ = 1,2, ---,n) 
and define a@ by the equations 
sina = a/(a’ + BY, cosa = —b (a + by)’, (0 S a S 2r). 
Determine constants Cy (7 1, 2, ---,n) by the equations 
(6.4) C; A; cos a + B, sin a (¢ = 1, 2, ~.n) 
Then in consequence of (6.3) and (6.4) we have 
Cy, cor a As, 
(t 4 n) 
Cy, sin a Bb, 
and the inequality (6.1) becomes 
(6.5) | cos a 4 Zig, COS a SIA + fyi sin’ aC Cy 2 0 


Suppose the inequality. (6.5) is false. An argument similar to that m= the 
preceding seetion would lead to the existence of an interval a, S @ NS ag intervwr 











592 MAX CORAL 


to0 < 6 S 2rand toa function 7,(@), defined as in (5.9) for ¢ sufficiently small, 
such that 


(6.6) [ (foco, COS’ O + 2fy.o, COS O sin 6 + f,,o, sin’ 6) T.(0)” C:C,. dd < 0, 


) 
the arguments of the derivatives of f being the set 
[xo > Yo; Z(xo ’ Yo); ZAXo ’ Yo); Z, (Zo ’ yo)). 


On the other hand, the Legendre condition for the auxiliary minimum problem, 
which must be satisfied along the minimizing set Ri(p) = 0 (n S p S 7; 
i = 1, 2, ---,m), requires that 

Jrini( . 0, 0)C;:Cy e 0. 


If this inequality is expressed in terms of the derivatives of the function f by 
means of (3.4), and the result is divided by 7 , the inequality which results in 
the limit as r; approaches zero is found to be in contradiction with (6.6). Hence 
the supposition that (6.5) is false is incorrect. 


BIBLIOGRAPHY 


1. A. Haar, Uber die Variation der Doppelintegrale, Journal fiir die reine und angewandte 
Mathematik, vol. 149 (1919), p. 1. 

2. A. Haar, Uber eine Verallgemeinerung des Du Bois-Reymondschen Lemmas, Acta Lit- 
terarum ac Scientiarum, vol. 1 (1922), p. 33. 

3. A. Huxe, An historical and critical study of the fundamental lemma of the calculus of 
variations, in Contributions to the Calculus of Variations, 1930, Chicago, 1931, pp. 
45-160. 

4. A. Haar, Zur Variationsrechnung, Abhandlungen aus dem Mathematischen Seminar des 
Hamburgischen Univ., vol. 8 (1930), p. 1. 

5. M. Mason, A necessary condition for an extremum of a double integral, Bulletin of the 
American Mathematical Society, vol. 13 (1907), p. 293. 

6. kb. EF. Levi, Sulla necessita della condizione di Weierstrass per l’estremo degli integrali 
doppi, R. Accademia dei Lincei, Atti, vol. 24 (1915), p. 353. 

7. bk. J. MeShane, On the necessary condition of Weierstrass in the multiple integral problem 
of the calculus of variations, Annals of Mathematics, vol. 32 (1931), p. 578 and p. 723. 


Wayne UNIVERSITY. 














CONCERNING APPELL SETS AND ASSOCIATED LINEAR 
FUNCTIONAL EQUATIONS 


By I. M. SHEFFER 


Introduction. We have elsewhere considered the linear difference equation 
with constant coefficients’ 


(1) DX ay(e + w) = Fo, 


from the point of view of a local solution. That is, assuming only that F(z) 
is analytic about a point, we have shown that y(x) exists satisfying (1) in the 
neighborhood of this point. Now (1) is a particular case of the general linear 
differential equation of infinite order* 


(2) L{y(2)] = p> anys) = F(x), 
where if we set 
(3) Li) ~ Dd a.,t" 

0 


and call L(t) the generating function for the operator L[y], then the generating 
function for (1) is 

7 

Li) = >» a;e""". 

j=l 
This suggests the possibility of developing a local theory for equation (2), at 
least when L(@) is suitably restricted. 

The solubility of equation (2) is linked with the problem of expanding F(r) 

in a series of Appell polynomials {P,(r)} generated by L(é); i.e., where the 
sequence {P,(x)} is defined by 


(4) Le” ~ dX P,(vt". 


n=O 


We see this formally from the fact that 


L 2] = P,(z), 
n! 


Received June 1, 1937; presented to the American Mathematical Society, April, 1937. 
' Sheffer, Transactions of the American Mathematical Society, vol. 39 (1936), pp. 345 
379, and vol. 41 (1937), pp. 153-159. 
? For an investigation of equation (2) from another point of view see H. T. Davis, 
American Journal of Mathematics, vol. 52 (1930), pp. 97-108. 
593 


(5) 








594 I. M. SHEFFER 


and therefore if F(z) has the expansion 
(6) F(z) = Doc. P,(2), 


a formal solution of (2) is given by 

7 x 

(7) y(x) = > Ca —- 
n! 


We are thus led to examine those functions L(t) whose Appell polynomials can 
be used to expand the general analytie function. 

For this purpose the following classification of functions L(t) seems to be 
significant: 

(i) L(t) ~ : a,t" has a zero radius of convergence. 

(ii) L(t) = > a,t” has a finite, non-zero radius of convergence. 

(iii) L(t) is an entire function, not of finite exponential type.” 

(iv) L(t) is of finite exponential type. 

The functions of classes (i) to (iii) appear to be inadequate to expand the 
general analytic function; we accordingly restrict our attention to functions of 
class (iv). We shall not prove the inadequacy of classes’ (i) to (iii); rather, we 
shall give some examples to suggest the truth of the statement. 

Suppose L(t) = 1 (t — a) (a # 0), so that L is of class (11). One finds from 
(4) that 


= n—1 n—2 
P| lj zs r 4 3 1 4 1 
WZ == tt _ cee 
a\n! (n—I1)!a (n — 2)! a? a" |’ 
or: 
| (ax) (ax)" 
wlor* i" al 
P,(z) = - 
a"! et ‘ 
Now the expression in brackets can be written | — r,(z), where 
(ar)"*’ (ax) (axr)* 
r(x) = ¢ : , 1 + = | : — + eee |5 
(n + 1) n+2 (n + 2)(n + 3) 
and 
9,414 nil , = 
rf) = 2e""C a RB)" /n +1)! (n 2 Nesufficiently large), 
where ¢g ik Again, 
: ez FT Cn as ST T(z) 
LD enPls) = —e* + 2, 
a"?! ant! 
L > 4 in ol finite erp. type pif limsup nta, '* = p 
Jt would be of interest to have classes G) to Gii) investigated, in order to determine 


et of functions possessing a convergent expansion in the Appell polynomials generated 


home ¢ TTD 











APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 595 


We see, therefore, that > c, P(x) converges for at least one value of z if and 
only if > c,/a” converges; and if convergent for one z it is convergent for all z. 
That is, when the series converges the sum is an entire function. Even so, not 
all entire functions can be expanded in this manner. We are thus far from 
having an expansion for the general analytic function. 

As a second example, choose L(t) = e~*'*, which is of class (iii). The asso- 
ciated Appell set is the set of Hermite polynomials, and for these it is known 
that the expansion converges in a horizontal strip of which the real axis is the 
bisector. Here again the general analytic function possesses no (convergent) 
expansion. 

The generating functions L(t) of class (iv) come closer to our aim; and it is 
the Appell sets generated by this class that we study in the present paper. 
In Part I we determine the character of the regions of convergence of Appell 
expansions. They are bounded by simple closed convex curves, and their 
position depends on the singularities of a function related to L(f). In Part II 
we obtain a solution of equation (2); not, indeed, for all analytic functions, but 
rather for all functions that have a radius of convergence exceeding a suitably 
chosen number (depending on L(t)). This then permits us to find a class of 
functions possessing a convergent Appell expansion. 


Part I. Convergence regions for series of Appell polynomials of class (iv) 
Let 


(1.1) LO = Dd a,t" 


be an entire function of finite exponential type (see footnote in Introduction), 
and let {P,(x)} be the set of Appell polynomials generated by L: 


(1.2) e“LO = > Pade’. 


Definition. Let h(@) = do ext" be of exp. type o@ < «©. Then the series 
H(t) = DYonte,t" has a radius of convergence 1/o, and we say that A(t) ts the 
Borel entire function associated with H(t). We shall denote this relation by 
h(t) = BEF {H(O}. 

Lemma 1.1. For n 


— t"') 
4 mod te “ 3K ‘ (n : 
(1.3) - BEF { (l— tr)” | 


>. 


The proof is straightforward, and need not be set down here. 
«x 


Now e“ L() = & a,(e"t"), so that formally we have 


(1.4) > PCr) t" BEF 1 “a a 











596 I. M. SHEFFER 


Relation (1.4) is however more than formal; it is valid for z in any bounded 
region FR and ¢ sufficiently near to the origin (how near depending on R). For 
the expression in braces is uniformly convergent in z and ¢ for z in R and | ¢ | 
sufficiently small, and may therefore be expanded in a convergent power series 
in ¢ by Weierstrass’ theorem. On doing so one finds for the coefficient of t” 


the expression 


n n—l —2 
aor" + nar" + n(n — ljagxr” ~ +.--- + nlan, 


so that (1.4) holds, since from (1.2) we obtain 
n n—l 


(1.5) P(x) = ao . + a, 
€. 


zr 
——_  *- 
If in (1.4) we replace t by 1/t and divide through by t, we obtain 
THeoreM 1.1. Let L(t) of (1.1) be of exponential type a < ~. Then 
> n! — n!P,(xr) 
ay, = 


n=O (t — z)**" n=) {n+l 


(1.6) 


whenever both series converge. The right-hand series converges uniformly in x and t 
for x in any bounded region Rand tin’|\t\ =p > o¢ +d, where dis the maximum 
value (or least upper bound) of x in R. The left-hand series converges uniformly 
for zandtsuchthat t—x 22r> 0. For xin any bounded region, (1.6) holds 
forall t sufficiently large. 

Consider now the function L*(t) of which the BEF is L(t): 


nw 


(1.7) L*(t) = Do nta,t". 
Then 
] ] nia 
(1.8 1+( = = 
t ’) X tnt? 


‘ ] ~ ] al P, (x) 
1.9) P Le ( ) bu at? 


valid for ¢ in any bounded region and ¢) sufficiently large. 

equation (1.9) is an important relation, for by means of it we can aequire 
information concerning the behavior of | n!P,(r) |" for large n. This is a 
consequence of the simple observation that the eirele of convergence for the 
right-hand member of (1.9) is determined by the most distant singularity, of 
the Jeft-hand member, from the origin 

Define Alu) by 


‘V0 Alu) ul*(u), 








APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 597 


and suppose for the moment that A(u) is single-valued throughout its domain 
of existence. Let G = {a} be the set of all the singularities (including ~ if 
necessary) of A(u), and E = {8} be the set of points defined by 

(1.11) B = —1/a 


as aruns through G. To each a@ (in G) corresponds a singularity ¢ = z + a = 
x — Bof (1.9), and conversely. If then we define D(x) by* 


(1.12) D(x) = max |x + 4 = max |r— 6, 
' a! 
we have 
THEOREM 1.2. For every z, 


(1.13) lim sup | n! P(x) |" = D(z). 
n=20 
This theorem is also true when A(u) is a branch of a multiple-valued func- 
tion, but in this case it is necessary to make precise those singularities of the 
complete analytic function “A(u)”’ (as we shall denote it) that are to be regarded 
as singularities of the particular branch A(u) given by 


(1.14) A(u) = > nta,u™*". 


From (1.4) we have 


(1.15) A( : ) = > n!P,(x)t"", 


1 — tr 


valid for x in any bounded region on taking | ¢| sufficiently small. Let r = 


be” = w + in be fired, and consider the transformation 
t u 
1.16) “= t= . 
( l — fe’ 1+ ur 
When ¢ describes the cirele || = r, u describes a cirele C, with the following 


radius and center: 
R=r|rs 1|; (—wr? /(r?8? — 1), nr? / (°F — 1)). 


Let B be the point —w + ¢», and let 1 be the ray issuing from O (the origin) 
through B, and U’ the ray in the opposite direction. Three cases are to be 
distinguished : 

(i) r < 1/5: the center of C, is on ray VU’. 

(ii) r > 1/5: the center of C, is on ray L. 

(iii) r 1/5: C, is a straight line (hereafter denoted by L,) cutting / at right 
angles at a distance 1/25 from O. The distance from the center of C, to O is 
ré|rse Se 


® Point @ runs over G and 6 over EL The set & is clearly bounded and closed, so that 


max | co 6 | exists 








598 I. M. SHEFFER 


Let J, denote the region in the u-plane into which |¢| < r maps under the 
transformation (1.16). It is then found that, according as we have cases (i), 
(ii) or (iii), 7, is respectively the interior of C, , the exterior of C, , the half-plane 
(determined by L,) that contains ray lI’. In every case the origin (OQ) is in J, . 

In case (i), R regarded as a function of r is an increasing function, varying 
from 0 atr = Oto « asr-—> 1/6. On the other hand, for case (ii), the radius 
decreases to 0 as r —> ~, the limiting center being the point H: (—w + in)/é° 
(on l). 

Let r increase from Oto ~. From 0 to 1/6 the transformed curves are circles, 
C,, inside C,, for r; < rz, all lying on the origin side of L, , and filling out this 
half-plane. For r = 1/6 we get the line L,. Then for r from 1/6 to « the 
circles form a decreasing set, C,, surrounding C,, for r; < 7, all lying on the 
other side of L, , and shrinking down to the point H as r— «; and they fill 
out this other half-plane. 

Now consider the complete analytic function “A(u)”. For the given fixed x 
there is a smallest r = r; (> 0) such that C,, passes through a singularity’ 
“ee” of “A(u)”’; (i.e., at least one branch of “A” is singular at “a’’). We are 
to decide whether or not “a” is to be regarded as a singularity of A(u) relative” 
to the fixed x. What we mean by this phrase will appear shortly. We consider 
three cases. 

(i) ry < 1/6. It is clear that series (1.14) can be continued analytically 
throughout J,, , so that branch A(u) is single-valued and analytic in J,,. If 
it is possible to continue this branch beyond C,, across “a’’; i.e., if there is a 
circle containing ‘“‘a’’ and therefore overlapping with J,, , such that series (1.14) 
can be continued from J,, into this circle; then “a” is not to be regarded as a 
singularity of A(u) (relative to 7). We shall say that the singularity “a” of 
“4” is passed over. If there is at least one singularity “a’’ on C,, that cannot 
be passed over, such a point is a singularity of A(u) (relative to x), and the 
radius of convergence of (1.15) is then precisely r;. If all singularities ‘a’ 
on C,, are passed over, there will nevertheless exist a smallest value r = re (> 0) 
such that on C,, there is at least one “‘a’”’ that cannot be passed over. Such an 
“a” is a singularity of the branch A(u) (relative to z), and series (1.15) has the 
radius of convergence fe . 

(ii) rm, > 1/6. Here series (1.14) can be continued from O throughout the 
whole exterior of C,, (including ~ ), and in this region J,, this branch is single- 
valued and analytic. The argument now follows the lines of ease (i). 

(iii) r, = 1/6. C,, is now the line L,,. If “a is a finite singularity (of 


“A”) on L,,, the method of case (i) tells us whether or not “a” is a singularity 
of branch A(u) (relative to z). If there is no finite “a on L,, , or if every such 


finite ‘‘a’’ is passed over, there is yet the possibility that ‘a’ 2. (If wisa 


7 We can ignore those singularities “‘a’’ of “A’’ that lie interior to the cirele of con- 
vergence of (1.14), since these cannot be singularities of A(u). Hence r, > 0. 

* We shall presently see that if “a@’’ is regarded as a singularity relative to one z then it 
ean be regarded as a singular point relative to all z; i.e., it is independent of x 














APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 599 


singularity of “‘A”’ we consider it to lie on L,,.) It may not be possible to 
determine directly if branch A(u) can be continued across “x”. We then 
proceed as follows: 

If there exists an r > 1/6 such that A(u) is single-valued and analytic in J, 
(and therefore at ©), then “«” is passed over. But if for every r > 1/4, 
A(u) does not remain single-valued and analytic in J, , then © is a singularity 
of A(u) (relative to 2). 

To sum up: For each fixed x there is as described above a smallest positive 
number r = r, such that the branch A(u) given by (1.14) can be continued 
analytically (and single-valuedly) throughout J,, but not throughout J, for any 
r>r,. OnC,, there is at least one singularity “a’’ of “A” that is a singu- 
larity of A(u) (relative to x). The radius of convergence of series (1.15) is r. . 

Let G, be the set of those singularities of ‘‘A”’ on C,, that are not passed over; 
i.e., that are to be regarded as singularities of A(u) (rel. to x). And let H, 
be the set of those “a’s” lying in and on C,, that are passed over, and FE, the 
set related to G, by the transformation (1.11). 

As x varies these sets can vary in their membership. Denote by G and E 
the respective logical sums of all the sets G, and all the sets E,. It is then seen 
that Theorem 1.2 is valid with this choice of G and E provided we show that a 
point “a’’ which is a singularity of A(u) relative to an x = x; can never be a 
passed-over point relative to some other r = 22 ; i.e., that the logical product 
G,,-H,, is’ null for all x; and ze. 

Suppose it were possible to have an “a” so that G,,-H., # 0, and let C{), 
C® be the circles corresponding to x; and zz in the description above. Then 
“q’”’ is interior to 7 but is on CS). The closed segment joining 0 to “a” lies 
wholly in i, and (except for “a’’) wholly in If). If we continue A(u) (as 
given by (1.14)) from 0 along this segment, we can pass over “a” (relative 
to x2). But this same continuation obviously applies relative to 2, , so that 
“a”? is passed over relative to x,. Hence “a” is not in G,, , and this contra- 
diction proves that (1.13) remains valid. 

We shall call the locus D(x) = ¢ a level curve for the sequence {| P,(x)}.  Con- 
cerning these level curves (to be denoted by J.) we have the following properties. 

(i) D(x) is a continuous function of x (for all finite x). 

This follows from the fact that D(x) is a distance function, the set 2 being 
bounded and closed. Now D(x) — « as |x| — «©; hence D(x) has @ minimum 
value” D,,. We shall see presently that the value D,, is taken on at only one 
point. 


* This amounts to saying that the set of singularities of A(u) relative to a given 2 is 
independent of x in the sense that no two 2's will vield contradictory information. Ac- 
cording to our definition of G, we ean regard G as the absolute set of singularities of the 
branch A(u) given by (1.14). This is so even though there may be singularities ‘‘a’’ of 
“A” which never present themselves for test (by our above method) as to their passed-over 
or singular character relative to an x 

‘0 Tf FE consists of a single point, D, = 0. In all other cases, D,, > 0. 











600 I. M. SHEFFER 


From what has already been said, J. (ec 2 D,,) is a bounded, closed set. 

(ii) Fore > D,, , J- consists of more than one point. 

If not, suppose J, is the point 2». Let 2, be a point on Jp,, and 22 on J, 
wherea >c. (Clearly, J, is not-null for every a 2 D,,.) Then an are joining 
z, to x2, and not passing through xz), must contain at least one point of J. . 
This provides a contradiction. 

(iii) Fore > D,, , Jc has no isolated points. 

If otherwise, let z) on J. be isolated. Then there is a neighborhood of 2 
(e.g., a sufficiently small circle K) such that for no point x # 2 in K is D(z) = c. 
Therefore either D(z) < ¢ for all z # xz in K or D(x) > c for all z ¥ x in K. 
Suppose D(z) < c. Let &, in E, be such that d(x , Bo) = | to — Bo | = D(x). 
If the segment joining 8» to zr» be extended beyond zp , then for z on this exten- 
sion (and in K) we have D(z) 2 d(z, By) > D(xo) = c; a contradiction. 

Now assume it possible to have D(z) > c. By (ii) there is a second point 
x, on J,, and we assume zy to be chosen the closest of all points of J. — 2» 
to tm. (J. — 2 is a closed set.) Let C, C’ be circles of radius ¢ and centers 
z, x5. All points of E lie in or on C and in or on C’, and therefore in the 
closed zone Z common to C and C’. But for every z on the open segment 
joining x» to x, , the circle C, , with center at z and passing through the inter- 
sections of C and C’, contains all of Z in or on it; and its radius is easily found 
to be less than c, so that D(z) < c. Since z can be chosen to lie in K, we 
thereby arrive at a contradiction. 

(iv) If 2, , 22 are any points on J. (c 2 D,), then every point x of the (open) 
segment joining 2, to x2 satisfies the relation D(x) < ce. 

The proof follows from the latter part of the proof of (iii). Since c = D,, 
is not excluded we arrive at the result: 

(v) The locus Jo,, consists of a single point (denoted by x*). 

This is important when we come to consider expansions in {P,(x)|-series, 
for it tells us that all regions of convergence of such series have the unique 
point z* in common. Another corollary of (iv) is 

(vi) The locus J. cannot contain three or more collinear points. 

Let ly be any ray (i.e., a half-line) issuing from z* at the angle 6,0 < @ S 2m. 

(vii) The ray ly meets each locus J, (ec > D,,) once and only once. 

First we observe that since D(z) — « as | z|— @, every locus J, (¢ > D,,) 
is met at least once by 4. Suppose meets J. in two or more points, and 
let 2), 22 be two such. By (iv), if zs lies on 4 between x, and zz , then D(z3) = 
ce < ¢. Then by continuity there is an z, between z* and x, (supposing 2, 
closer to z* than is z2) for which D(z) = ec’; so that z, , lying between x, and 2, , 
must satisfy the relation D(z,) < ce’ < ¢, a contradiction. 

Consider the locus J.,¢ > D,,. If we introduce a polar coérdinate system 
(r, 6), using 2* as the pole (r = 0), we have shown that for each @ there is one 
and only one r. That is, if we write the locus J, as r = F,(0), then F.(6) is a 
single-valued function of 0,0 = 6 = 2x. We can however say more: 

(viii) The function r F'.(0) 18 continuous for all 0. 











APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 601 


For suppose @ = @ is a point of discontinuity, and let rm») = F.(%). Since 
F.(@) is a bounded function, there exists a sequence {6@,} —> % such that 
{r, = F.(0,)} has a limit ro ¥ ro. Now 2, = (ra, 0.) isa point of J.. Hence 
by continuity of D(x), lim (r,, 0.) = (ro, %) = 2 is a point of Je. Now 2% 
is on ray lg, , and x x a. We thus contradict (vii), so the assumed discon- 
tinuity cannot exist. 

Combining these results we can state the 

THEOREM 1.3. The locus Jp,, is a single point x*. For everye > D,,, J-isa 
simple closed convex curve, containing in its interior the point x* and the locus J. 
for every c’ < ¢. 

If the set of singularities of A(u) is finite in number,” the loci J. consist of 
circular ares, each J. being made up of a finite number of such ares, of radius c. 
There can be ares of circles even in more complicated cases, but if A(u) has, 
for example, curves of singularities or regions of singularities, then the loci 
may not have this simple character. 

From Theorem 1.2 we immediately have 

TueoreM 1.4. Let lim sup | ¢,/n!|"" = p < 1/D,, and set c = 1/p. The 
series 


(1.17) F(x) = Se. P(x) 


converges absolutely for every x interior to J. , and converges uniformly tn every 
closed region interior to J.. The sum function F(x) is analytic within J... 

One naturally inquires if there can exist points of convergence beyond J. . 
This is sometimes the case, and was indeed encountered in the treatment of 
equation (1) of the Introduction.” 

Before going on to the solution of the linear differential equation (2) of the 
Introduction, we wish to examine the exponential type of the function e*L(d). 
Let L(t) be of exp. type p (< ©), and let e“L(é) be of exp. type r = rir). e® 
is itself of exp. type |x|. We have 

LeMMA 1.2. For all x, 

(1.18) r(x) = D(x). 

For the radius of convergence of a series }> 6,¢" is the reciprocal of the exp. 
type of its BEF, }* b,t"/n!. Now e“L() = BEF, n! P,(x)t"}, so that from 
(1.13), r(x) = D(x). 

From this follows 

Lemma 1.3. For all x, 

(1.19) p lri|sS r= Die) Saet+i\z 


. . ° e nei: 
For, the radius of convergence of A(t) = ¥2 nta,t"'' is 1p, so that the a 
point nearest the origin satisfies the relation | a 1 p; therefore the 3 point 


" This was the case in our Transactions paper (already cited) 
® Loc, cit., Transactions, vol. 39 (1936), p. 355, footnote 











602 I. M. SHEFFER 


farthest from the origin is such that |8| = p. Since D(z) = max |z — £6}, 
we obtain r(x) = D(x) S |x| + p. Again, the point 8 nearest to z satisfies 
the condition |2 — «| 2 |p — |2||, as a diagram readily shows. Hence 
r2|B—2| 2\|e—|2}|. 

Both limits in (1.19) can be attained. Consider, for example, L(t) = e”. 
Then r(p) = 2p = p+ |z!, and r(—p) = 0 = p— |r|. 


Part II. The associated linear functional equation: a semi-local solution 


Let 
(2.1) L@®) = Dat" 
0 
again be of finite exp. type p, so that 
(2.2) lim sup | n!a,|"" =p < @. 


We consider the linear differential equation of infinite order generated by L: 
(2.3) Liy(x)] = > any (2) = F(a). 
0 


It is no restriction to suppose” that a9 = 0. Then K(t) = 1/L(t) exists as a 
convergent power series in ¢, and if we define {Q,(z)} as the Appell set for K(#), 
we have 


(2.4) x| 2 = Q,(z), 
or, what is equivalent (as is readily established), 
(2.5) LIQ,(2)] = =. 


From the relation between Appell set and generating function (see (4) of Intro- 
duction) we also have 


(2.6) K(de* = D Q,(x)t", 


so that if C is a contour around ¢t = 0, lying within the circle of convergence of 


K(t), then 
i ee e”” 
(2.7) Q,(x) = Oni [ Len 


The set {Q,} is less suitable for expansion purposes than is {P,} of Part I, 
since the generating function for {Q,} is only of class (ii). But by altering 
13 For suppose a; = 0, i = 0, 1, ---, p — 1, ap ¥ O, and let Li(@) = L()/l = 


ap + Gpyit +--+. If we can solve L,ly(x)| = G(x), then on differentiating p times we 
get L[y(x)| = G™ (x), so that (2.3) is solved on choosing G to satisfy G(r) = F(z). 

















APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 603 


the contour C we are led to a sequence of functions much more serviceable 
than {Q,}. The method used is due initially to Hurwitz (who was concerned 
with the simple difference equation Ay(z) = F(x), F(x) entire). 

Since K(t) = 1/L(t) is a meromorphic function the right-hand member of 
(2.7) will define a function no matter how distant the contour C is from the 
origin, provided that C does not pass through any singularity of K(t). Choose 


such a contour C = C,, of radius r,, center at the origin, and denote the 
corresponding function by Q,.,,, : 
(2.8) ee fee 

ii 2ri Jc, L(t 


On applying operator L to both sides of (2.8), and taking L under the integral 
sign (an operation which is easily shown to be valid), we obtain 


(2.9) LIQnr(2)] = =. 
n. 


We now examine the modulus of (2.8). From (2.1) and (2.2) we see that for 
every « > 0 there is an N = N, such that 
(2.10) M(r) = max | L()| < Ne’**”. 

jtj=r 

Again, there is the following theorem on entire functions: 

Let f(z) be of finite order and let k be any number exceeding one. There is a 
number H = H(k) such that for every R > 0 the closed interval (R, kR) contains 
at least one number r for which 


| f(z) | > (Mer) 
for all z on |z| =r. 
Here again M(r) is the maximum modulus of f(z) on | z| = r. 
Set k = 1 + 6,6 > 0, and apply the theorem to L(t). It gives 


M(r) > [M(r( +. 8)", 


or 


(2.11) , 


Lb | 


where, r’ being any number, the number r exists somewhere in r’ S r S r’(1 + 4). 
. , 
As we are going to vary our contours C, , let us replace r’, r by r,, 7. We 
. , ° o 
take r, (which depends on r,) as the radius of C,. Then from (2.8): 


< [M(r(1 + 8)", 


t—n 


'Qnr,(2)| FM (ra + 6) elt crim ltl Pr (1 + 8)?))"; 


and from (2.10): 


(2.12) | Quira(a) |S Noris?-cfstrolicitacerocse, 


14 Acta Mathematica, vol. 20 (1896-97), pp. 285-312. 
18 G. Valiron, Lectures on the General Theory of Integral Functions, Toulouse, 1923, p. 89. 








604 I. M. SHEFFER 


- , . . 
Now r, is arbitrary. Let us choose 
, 


(2.13) rT, = dn, 


d being independent of n, and make use of the asymptotic relation 


nin (2en)'(*) , 
e 


Then 

(2.14) n! | Qn.r,(2) | <N. our .e™ 1+d(1+6){|2| PaCote +O} 
and 

(2.15) lim sup n! Qn.r,(2) poe < acta 


Consider the right-hand member as a function of d. It is found that there 
is a minimum for d = {(1 + 4)[|z| + H(o + &)(1 + 4)Jj}*. In order to have 
d independent of x, we set x = 0. Furthermore, since ¢ > 0 is arbitrary, it is 
suggested that we choose 


(2.16) d = {Hp(1 + 6)*}"". 
The right-hand member of (2.15) becomes 
Ho(1 + 8)? .¢! lor tiai taeda) 


Now the left side of (2.15) is independent of «. We therefore obtain” 


(2.17) lim sup | n!Q,,,,(z) |" < Hp(1 + 6)?-e!2!/"ear, 
Let {ce,} be a sequence with lim sup | ¢, |" = ¢, so that 
(2.18) lim sup | n!enQn,r,(a) |" S oHp(1 + 6)?-e!2/ "Cr. 


The right-hand side will be less than one for | 2| sufficiently small if 
oHp(1 + 6)” < 1. We can therefore state 

TuHeoreM 2.1. Let L(t) be of exponential type p(< ~). There is a number 
A > 0, depending only" on L(t), such that if 


(2.19) lim sup |e, |" =¢0< : ‘ 
pA 
then series 
(2.20) D nben Qu.r, (x) 
n=0 


converges absolutely and uniformly in some neighborhood of x = 0. 


16 Since 6 > O is arbitrary it might be thought that we can let 6— 0. But H depends on 
6 in a manner that is not specified in the theorem (in which H and k = 1 + 4 first appear); 
we do not therefore know what value of 6 will make the right-hand member of (2.17) a 


minimum. 
17 \ ean be taken as the minimum (or greatest lower bound) of H(1 + 6)? for all 6 > 0. 








APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 605 


Corotiary. There is a number w > 0, depending” only on L(t) and o, such 
that (2.20) converges absolutely for all |x| < w, and converges uniformly in every 
closed region in |x| <w. 

When the exponential type p is zero, there is a marked simplification in the 
above theorem. For consider (2.15) where now we have set p = 0. Since 
¢ > Ois arbitrary, we can let e— 0. This eliminates the term He(1 + 6). Again, 
since 6 > 0 is arbitrary, we can let 6— 0. The left-hand member is indepen- 
dent of « and 6, so we obtain 


(2.21) lim sup | n!Q,,-,(x) |"" < aot, 
(2.22) lim sup | n!e,Qn,,,(x) |" Ss Se 


Now choose’ d = o (where for the moment we assume that ¢ # 0). Then 
(2.23) lim sup | n!¢nQnir,(x) |!" < e7!2!7. 


From this follows 

THEOREM 2.2. Let L(t) be of exponential type zero. To” everyo < ~ there isa 
sequence of contours C, such that if lim sup | c, |" = o then series (2.20) con- 
verges absolutely for all x in |x| < 1/o and converges uniformly in every closed 
region in |x| < I/e. 

LemMa 2.1. Let L(t) be of exponential type p(< ~). Then 


(2.24) Liy(2)] = Dd any (a) 

0 
converges absolutely in |x — x | < X — p éf y(x) is analytic in the circle 
|x — %| < A, (A > p~); and the convergence is uniform in every closed region in 
| 2x —_— Xo | <A— p. 


To show this, let C be a circle with center at x) and radius \’, where p < \’ < X. 


For z in |x — a | < ’ — p we have 


(n) _ ni y(t) dt 
uy" @) = 3 [ (¢ — x)’ 


‘'S If 6’, H’ are the values giving rise to A (see previous footnote), then w is defined by 
w = —pH'(1 + 34’) log [epAl. 
This is seen by finding for what values of x the right-hand member of (2.18) is less than 
unity. If we set 
A = —pH'(1 + 3’), B = A log (pA), 
then we can write 
w= B+ A loga, 

where A and B depend only on L(t), not one. 

'® This means that the contours C,, , which serve to define the functions Q,,,, (x), change 
with o, so that for different o’s we may be dealing with different sequences {Q,.,,,(x)}. 

2° The preceding relation d = o which furnishes the proof requires that o + 0, but Theo- 
rem 2.2 is readily seen to hold even if ¢ = 0. 








606 I, M. SHEFFER 


and therefore 


l 4 ‘2 nia, 
Lly] = — | wi) sa “pal 


since the series converges uniformly (and absolutely) for t on C and z in any 
closed region in | x — 2) < ’ — p. On letting ’ increase toward \ we obtain 
the desired conclusion. 

Corotiary. If p = 0, then L{y(x)| converges in the neighborhood of every 
point xy at which y(x) is analytic, and the convergence reaches out to the circle of 
convergence of the power series of y(x) about 2. 

LemMA 2.2. With the hypotheses of Lemma 2.1, if 


(2.25) y(xz) = Dd yx — a)", 
0 
the radius of convergence being X, then 
(2.26) Liy(z)] = 2 ynLl(e — 2)"), 
n=0 


the latter series converging uniformly in every closed region in |x — x | < »— p. 
™ ‘ I/n 
Since lim sup | y, |" = 1/A, to every « > 0 corresponds an a = a, such that 


1Ya| < a(t _ :) : 
y(t) Ka D I(; rs :) r—% ] - a 7 
a (3 ) sitll, 


Therefore on setting u = (1/A + €) | 2 — 2], 


« n!|a,| (; oF :) 
(2.27) Llyl«<a>, : 


n= (1 = u)nt! 


hence 


This is valid provided that | u | < 1 (a condition which is true if z lies in any 
closed region in |x — 2) | < A, when ¢ is chosen sufficiently small), and that 
}1—u!>p(l/A+.). This latter condition is fulfilled if 


\ — p — pre 
] + Xe 


and from the arbitrariness of « we need only have | x — zr) | <A —p. Absolute 
and uniform convergence of (2.27) in any closed region in x2 — %| < A — p 
is immediate. From the absolute convergence it follows that the double series 


Lly] = | yn (x — x" = add yx — ny} P 


xr—-2%\< 


0 k=0 














APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 607 


converges absolutely, and may therefore be summed in any way. But one 
x 

way of summing is to take L under the D-sign: Zz yn, L(x — 2)"]. This es- 
0 


tablishes the lemma. 
Corotiary. If p = 0, then (2.26) is valid in |x — x| < X. 


Lemma 2.3. Let {u,(x) = 7 bux — xo)*}, n = 0, 1, ---, be analytic in 
k=0 


x — 2% > <A, A> p, and let the double series a | Du(x — xo)" | converge for 
n,k=0 


\2—2%o| <A. Then 


(2.28) | vate | = z. L{u,(x)], 
n=0 n=0 
both series being uniformly convergent for x in any closed region in |x — %| < 
A — p. 
The proof is patterned after that of Lemma 2.2, and need not be given in 
detail. Essentially it consists in showing that the double series 


ao 

2, aaful” + af? + af” +... } 

n=0 
is absolutely convergent in | x — 2», < \ — p (uniformly in any closed region 
therein), and may therefore be summed in any way. One such summing 
yields the right-hand member of (2.28). 

Now the series (2.20), with x replaced by + — 2, fulfills the conditions of 

Lemma 2.3 when ¢ is so small that w > p. For, (2.8) gives us the series 


Q.(2-2 = Ee (1 fe at) 
_ ™ 2 2ri Je, L(t) P 


convergent for all z. We must therefore show that series 


— |r — a!" inte "init 
— 0 Cn 
; | i it 
(a) Le * Qi [ Lio ‘ 


converges for |x — x2 | < w. On using inequalities (2.10) and (2.11) in the 
. . , . . 

manner already considered, and choosing r,, = dn, series (a) is found to be less 

than 


x 5 
(b) N’ nm iC, nd (1+6 {W(pte)+6)+\r rol} 


n=0 (ed)" ks 


We ean let ¢€ > 0. On again choosing d = {Hp(1 + 6)°}~', we see that (b) 
converges provided x is such that oHp(1 + §)?.¢e!F teltMedtOI"" & 4 Since 
A = H(i + 5)’, it now follows that (a) converges for all x in |x — x| < a, 
as was to be shown. 











608 I. M. SHEFFER 


This gives us the relation 
Abs nie,Qn. a - | = > nie. L[Qn..,(e — rol; 


and on using (2.9) (with x replaced by x — 2x9), we have 
TuroreM 2.3. Let L(t) be of exponential type p(< ~). Let limsup ¢,) " =¢ 
be so small that™ @ > p. Then 


(2.29) L| n'e,Q,.7,(% — | = > c,(r — xo)", 


the left side being convergent in x — x | <w— p and the right side convergent in 
xr—2%| < I/e. 
Otherwise stated, Theorem 2.3 gives us a semt-local existence theorem for 


the equation (2.3): 
Turorem 2.4. Let L(t) be of exponential type p(< «). If F(x) is analytic 


about x = x in a circle of radius r > 1/0*, then equation (2.3) possesses an 
analytic solution y(x) in the neighborhood of x = x». Morcover, if 
« 
(2.30) F(x) = Deedx — x0)", 
0 
then y(xr) has the form 
x 
(2.31) y(x) = } n'e,Qu.r,(a@ — Xo), 
v0 
this latter series converging in| x — 2%) < w — p, where w is determined for the 
value” o = 1/r. 
We refer to this as a semi-local solution because our method does not permit 
the circle of analyticity of F(x) about z = ax to become smaller than 1/o*. 


There is however no reason to believe that this restriction, which results from 
the method used, is inherent in the problem. The question remains open. 
If p = 0 this restriction automatically disappears, so that we have 


THroreM 2.5. Let L(t) be of exponential type zero. To every function F(x) 


analytic about x = 2x» there is a function y(x), also analytic about x = x9, which 
satisfies equation (2.3); and if (2.30) has the radius of convergence r, then (2.31) 


also converge s° for z— | < fr. 
These solutions, semi-local or local as the case may be, permit us to return 
to the problem of Appell expansions. Let F(x) be analytic about x = 0 ina 


21 We saw that w = B + A loge where A, B are independent of ¢ and A <0. Hence 
Let us denote by o* that 


limw = +* aso— +0, so that values of ¢ exist for which w > p. 
value such that o < o* implies w > p. Then 


o* = el B)jA_ 


2 In particular, ifo = 0 then w = ~, so that equation (2.3) possesses an entire function 


solution when F(x) is entire. 
23 In particular, if F(z) is an entire function, so is y(z). 











APPELL SETS AND LINEAR FUNCTIONAL EQUATIONS 609 


circle of radius r > 1/o*, so that Theorem 2.4 holds, and let y(x) = > %.2" 
be the solution. Its radius of convergence is at least \ = w — p. Suppose 
\ > p, so that w > 2p. Then by Lemma 2.2, 


L{y(x)] = > ynL[z"] = > n!ynP,(2), 


where |P,(x)} is the Appell set generated by L(t). But Liy(x)] = F(z). 
Accordingly we have 

THEOREM 2.6. Let L(t) be of exponential type p(< «) and |P,,(x)} the corre- 
sponding Appell set. If F(x) is analytic about x = 0 in acirele of radius r = 1/¢, 


where o is so small that w > 2p, then F(x) possesses the Appell expansion 
(2.32) F(x) = bs n!ynP,(2), 
n=0 


valid in the neighborhood of x = 0. Here \y,} is the set of coefficients of the solu- 
tion y(x) = >> ynx" of (2.3) given by Theorem 2.4. 

This theorem is not as satisfactory as one might wish. It refers F(x) to the 
origin, whereas we know from Part I that the central point in P,-expansions is 
the point 2*. The difficulty resides in the semi-local character of the solu- 
tion of (2.3). 

When p = 0 this difficulty disappears. For then A(é) is an entire function. 
Its only singularity isa = «, so that 8 = 0. Hence z2* = 0, and the level 
curves are concentric circles with center at the origin. Application of Theorem 
2.5 and Lemma 2.2 gives 

TuHeoremM 2.7. Let L(t) be of exponential type zero. Then every function F(x) 
analytic about the origin possesses a P,-expansion that ts valid in the same circle 
as is the power series for F(z). 

This shows that the Appell expansions corresponding to a function L(¢) of 
exp. type zero are markedly like ordinary power series in their convergence 
properties. 


PENNSYLVANIA STATE COLLEGE. 








UNIFORMITY PROPERTIES IN TOPOLOGICAL SPACE SATISFYING 
THE FIRST DENUMERABILITY POSTULATE 


By L. W. CoHEeN 


Theorems involving convergence, completeness and uniform continuity are 
consequences of what may be called the uniformity properties which inhere in 
metric spaces. It is perhaps of some interest to seek in spaces which are not 
metrizable similarly effective uniformity properties, in terms of which the 
theorems referred to may be reformulated.’ It is the purpose of this note to do 
this. The spaces considered are the topological spaces of Hausdorff satisfying 
the first denumerability postulate, namely, that to each p C S the topology 
at p is determined by a sequence of neighborhoods l’,(p). The uniformizing 
entity is the class [U’,] of all U,(p) for fixed n and all p C S. We make the 
following definitions: 

1. A sequence p, C S is a Cauchy sequence if for each n there is a k, and a 
gn © Ssuch that p C U,(q,) if k > k,. 

2. A space S is complete if every Cauchy sequence has a limit. 

3. Aset M C S is totally bounded if for each n there are points 


Pal, Pn.2; acta Pram, 


such that 


MC ¥ U.Apa.i). 
i=l 
4. A function f on M C S to S’ is uniformly continuous on M if for each n 
there is an m(n) such that if p C M 


S{MU nen (p)] © UL(S(p)). 


It is clear that these definitions become the usual ones for metric space with 
spherical neighborhoods. 

The justification for these generalizations is to be sought in the theorems 
that a set M in a complete space is compact if and only if it is totally bounded, 
and that if F is uniformly continuous on M C S to a complete space S’, then 
there is a function F* on M to S’ identical with F on M and continuous on M. 


Received June 2, 1937; presented to the American Mathematical Society, November, 
1935, under the title Cauchy convergence in non-metric space (preliminary report), and 
April, 1937, under the title Uniform continuity in topological space. 

1 The relation between completeness and compactness is discussed by J. von Neumann, 
Annals of Math., vol. 36 (1935). Recent notes on the subject have been published by 
Garrett Birkhoff, Annals of Math., vol. 38 (1937), pp. 57-60, and by L. M. Graves, Annals 
of Math., vol. 38 (1937), pp. 61-64. 

610 











UNIFORMITY PROPERTIES IN TOPOLOGICAL SPACE 611 


It will appear that for certain of these results a further restriction must be 
placed on the spaces, but it does not imply metrizability. 
THEOREM 1. If Sis complete and M C S is totally bounded, then M is compact. 
Proof. Let px be a sequence of points of M. Then 


(m») CMC DUG) 


for some points qi, 91.2, °°: Qi.m, in S. Denote by Ui(q) one of the 
Ui(q.;) containing a subsequence p{” of py. Assume that U,(q,) contains a 
(n) 


subsequence p;.”" of pe. Then 
Mn+1 
(pi) ej MU (qn) .. > U n4s(Qn4t.s) 


for some points Gn41,1, Qnit.2) °** » Unttimas,- Denote by Unsi(Gn41) one of 
, . . 1 
the Unsi(dn41.;) Which contains a subsequence p,"*” of pi". We now have a 


sequence of neighborhoods U’,(q,) and a sequence of sequences ps” such that 
Un(gn) D (pt) D (pi"*”). 
The diagonal sequence pp = pi"? satisfies the condition 
pr CS Un), k>n. 


ps is a Cauchy sequence and, S being complete, M is compact. 

It is to be noted that the notion of Cauchy sequence is not topologically 
invariant. This is the case in metric space also. Under uniformly continuous 
mapping, however, we do have invariance. 

TueoreM 2. If f is uniformly continuous on S to S’ and p, is a Cauchy se- 
quence in S, then f(px) is a Cauchy sequence in S’. 

Proof. Consider n and m(n). Then p,y C Un (p) for some p C S and 
k>kn,. Hence f(pr) C f[U min (p)] C UL(f(p)) fork > kn. This being so for 
all n, f(p.) is a Cauchy sequence. 

We impose on S the 

PostutaTe. To each p C S and positive integer n there is an integer m(n, p) > 
0 such that if q © U men.» (p) then p C U,(q). 

TureoreM 3. If M C S is compact, then M is totally bounded. 

Proof. Assume that M is not totally bounded. Then for some n, M is not 
contained in the sum of a finite number of U’,(p). For each k and 


Pi, Po, +++ » Pe CM 


there is 


k 
Pwic M—M YD U.,(p), 


t=1 
M being compact, a subsequence px, of py has a limit p C S so that 


Pr, © U ;(p), ¢>t;. 








612 L. W. COHEN 


Choosing 7 = m(n, p) and t > tnin,»), We have p C U,(px,) and 
Pe [ Un(pe,) 


for infinitely many k. Let px, = p, and r be the smallest integer greater than s 
for which p, C U,(px,). We have 


r—1 a 
pCoM—-MDYU,(p) C M— MD U,(p) C M — MU,(p,) 
i=l t=1 


and a contradiction. 

With S subject to the postulate given above and S’ regular and complete, 
we have 

TueoreM 4. If f is uniformly continuous on M C S to S’, then there is a 
Junction f* on M, the closure of M, such that f = f* on M and f* is continuous 


on M. 
Proof. To each p C M there is a sequence p, C M such that lim p, = p. 
Consider, for p and n, the number m(n, p) of the postulate. Since 


Pr Cc U min, p) (Pp) 


for kn > kmin,») , P CG Un(pe,) for each n and some k,, . 
Since f is uniformly continuous on M and p, C M, we have 


$(Pr) FS F[MU mem (Phminy)] C Un(SPemen))s Kk > Kmon, 


so that f(p,) is a Cauchy sequence in S’. S’ being complete, there is p* C S’ 
such that lim f(p,) = p*. If lim q@ = p and q C M, there is the sequence 
r, © M whose terms are alternately py and q which has p as limit. Hence 
lim f(rx) = lim f(q) = lim f(px) = p*. Thus for each p C M and every se- 
quence p, © M with limit p, lim f(p,.) = p*. This defines a single-valued map- 
ping f*(p) = p* on M to S’. 

If p C M, then p = p C M and f*(p) = lim f(p.) = f(p) so that f* = f 
for p © M. If f* is not continuous on M, there is a p C M and an n such 
that for every m, MU,,(p) D p,, such that f*(p,.) C S’ — UL(f*(p)). We may 
choose p,, so that 


Pm Cc MU ym) Cc M I] U,(p). 
“= 


Since S’ is regular, there is a vy such that U/(f*(p)) C U,(f*(p)). Thus for 
all m 


f*(pn) CS’ — Un(f*(p)) CS’ — THf*(p)) CS’ — U;(F*(p)). 
Now pn C M so that there is a sequence qm,x C M such that lim; mx C pm- 


Hence gn,x © [] U,(p) if k > km. Further limy f(¢m,x) = f*(pm) implies 
p=l 


Sam) CS’ — OY(f*(p)) ifk > kn. 











UNIFORMITY PROPERTIES IN TOPOLOGICAL SPACE 613 
Hence for k(m) > max (km, ky) we have 
QYm,k(m) G I] U,(p), S(Gm.x¢m)) ~. S’ _ U:(f*(p)) Cc S’ = UL(f*(p)). 


Now lim, @m.k(m) = Ps Gm.kim) © M and the closure of S’ — U}(f*(p)) yield 
lim f(qm.xcm) = f*(p) CS’ — UL(S*(p)), 


which is a contradiction. Thus f* is continuous on M. 

We give an example of a non-metrizable topological space S satisfying the 
first denumerability axiom and our postulate. The points of S are the points 
p(x, y) of the Cartesian plane. Let S(p, r) be the interior of the circle with 
center p and radius r. The neighborhoods of po(0, 0) are defined by 


U,(po) = s(m, ‘) — |p}, 


n 


where p(z, y) has x > 0, y = 0. If p(z, y) has y ¥ 0, U,(p) = S(p,n"). If 
p(x, 0) has x > 0, then U,(p) = S[p, (k + n)~'], where k is the smallest integer 
such that the distance from p to po is greater than k’. The space fails to be 
metrizable because it is not regular at pp. 

By adding the restriction of regularity to the space we can continue the 
theory to include Baire’s theorems on category and on the distribution of 
points of continuity of the limit function of a sequence of continuous func- 
tions. These theorems are of considerable importance in existence proofs in 
functional analysis. In the subsequent discussion it will be assumed that the 
space S is regular as well as complete in the sense defined earlier. 

THeoreEM 5. If A, is a sequence of sets in S such that A,,4; is dense in A, and 
G, is an open set containing A, , then [] G, is dense in A, . 


Proof. Consider py) C A; and any U,(p:). There is U,,(p1) C U,(pi)G, . 
Then Ui(p:)Un,(p1)A2 D pe and there is a neighborhood U’,,(p2) such that 


(2) Ong (p2) © Ur(pr)U ny (Pr)Ge 


If p, C A, and U,,(px) are defined, the Ui(pr) Un, (pe) Ars PD Peas and there is 
a U,,,,(pexi) such that 


(k + 1) Once (Dest) © Ur(pe) Ung (pe Ges « 
From the relations (k), we have 
Prom Ong, (Perm) © Ux(pe), m= 0. 


Thus p, is a Cauchy sequence and, S being complete, lim p, = p exists. Also 
from (k) it follows that 
Pkim © CU .,(px) cS Gr, ’ m = 0, 


so that p CU, CG, for all k. Thus p C]] G,U,(p:) and JJ G; is dense in A. 
k k 








614 L. W. COHEN 


TueoreM 6. If A, is a sequence of Gs sets in S each dense in every other one 
then Il A,, ts dense in each A, . 
n 


Proof. Each A, = [JT G,.,, where G,,, is open. Let 
m 


Gy ” Gy, (ry _ (21 , Pres G,, aes Cans 


be the enumeration of the G,,,, by diagonals. If we set B, = A, when G, = 
Gm», then G, D B, and B,,; is dense in B,. Hence [] G = [] Gn. = [] A. 
k m,n n 


is dense in A; by Theorem 5 and consequently in every A, . 

We have a consequence in 

TueoreM 7. If M is a G; in S and A, in a sequence of sets open in M and 
dense in M, then II A,, is dense in M. 

Proof. Each A, , being open in M, is a G; set in S. Each A, , being dense 
in M, is dense in every other A,. Hence by Theorem 6 [J A, is dense in Ay 
and also in M. 7 

From this it follows that S is of the second category. In fact we have 

TueoreM 8. If M is a G; in S and A, is a sequence of subsets of M each 
nowhere dense in M, then M — >> A, is dense in M. 


Proof. If F, = MA, and G, = M — F,, then G, is open in M and dense 
in M. By Theorem 7, [] G, = M — > F, is dense in M, hence M — }> A, D 
M — > F, is dense in M. 


This leads to the theorem of Baire on the continuity properties of the limit 
function of a sequence of continuous functions. 

TuHeoreM 9. If A is a G; in S and f,(p) is a sequence of functions continuous 
on A to a metric space R, having the limit function f(p), then the set of points of 
continuity of f(p) is dense in A. 

Proof. For a given » > 0, let Ay be the subset of A on which 


Al f(p), frlp)| <0 ifn > k. 
Then A = >> A, since f is the limit of f, on A. By Theorem 8, at least one 
k 


of the A, is not nowhere dense in A. Hence for some k and some non-empty 
A* open in A, A, > A*. 
Let q be any point of A*. Then for some m > k 


a(f(Q), fm(Q)] < 1, 


since f(q) = lim f,(q). Now every U,(q) contains a point p C A, and for such p 
d(f(p), fr(p)] < 1, ifn > k. 
For the given m and each n > k, there is a U,(q) such that 


A[fm(P), fm(Q)] < 2, al fn(p), f(Q)] < 0 if p CUAQ)A, 














UNIFORMITY PROPERTIES IN TOPOLOGICAL SPACE 615 


because f, and f,, are continuous at g. Further, since m > k, 


d[f(p), fm(p)] < 1, if p C Ax. 
Hence for the given m and each n > k and an existing p C U,(q)U,(q)Az, 
we have 
A[S(q), $n(Q)] S AFG), fm(Q)] + A[fm(Q), fm(p)] + A[fm(p), f(p)] 
+ d[f(p), fn(p)] + a[fn(p), fn(Q)] < 5n. 


Now consider any « > 0, 7 < %e and the corresponding A*. Since A* is 
open in A, there is, for each g C A*, a neighborhood U’,(q) such that 


d[f(p), f.(p)] < «, ifn >k, p CU(Q)A. 


Let A(e) be the set of all g in A satisfying this condition. Then A(e) is open 
in A and not empty. The function f(p) is continuous at each point of [] A(n™). 


These statements remain true if the set A is replaced by any set B open in A. 
=i, * ° _ =f, « 
Hence A(n’') is dense in A for each n and, by Theorem 7, [] A(n™’) is dense 
n 


in A. Thus the set of points of A at which f(p) is continuous is dense in A.” 

As an example of a regular Hausdorff space which satisfies the first denu- 
merability axiom and is complete but not metrizable we have the space 7’ whose 
points are the ordinals of the first and second classes and whose neighborhoods 
are open segments of the ordered set 7’. 


UNIVERSITY OF KENTUCKY. 


2 It is to be noted that the proofs of Theorems 5-9 are parallel to those for metric spaces. 
Cf. Hausdorff, Grundziige der Mengenlehre. 








SOLUTIONS OF SYSTEMS OF DIFFERENTIAL EQUATIONS IN TERMS 
OF INFINITE SERIES OF DEFINITE INTEGRALS 


By Jesse PIERCE 


Introduction. The general solution of a differential equation of the first order 
and first degree can be found in terms of an infinite series of definite integrals.’ 
The definite integrals appearing in the solution are solutions of linear differen- 
tial equations. 

The same method is applicable to finding the general solution of a system of 
differential equations. The functions giving the solution, however, are ex- 
pressed in terms of infinite series of solutions of systems of linear differential 
equations. The general solution of each system of linear differential equations 
can be found in terms of infinite series of definite integrals.’ 

Hence the solution of the original system of differential equations is ex- 
pressed in terms of infinite series of infinite series of definite integrals except in 
the case where the original system is linear or the system comprises just one 
differential equation of the first order. 

In the present paper the general solution of a system of differential equations 
will be found in terms of infinite series of definite integrals in which every 
integrand consists of a finite number of terms. 

The system of differential equations to be considered has the form 





dy; ; 
(1) 54 = gilt, x1, “+5 Yn) (2 = 1, -o+, mM), 
where the g(t, y1, --- , Yn) are analytic in the y; (j = 1, --- ,) and have as 


coefficients aj,,...,, funetions of ¢ which are integrable (Riemann) and satisfy 
the inequalities 


(2) | Qiny-+m | S S(u), 
f(u) being a positive integrable function of u, the arc length of a rectifiable curve 
drawn from the origin to the point t. The exponent of y; (j = 1, ---,m) in 


the expanded form of the ¢; is represented by u4;. The independent variable 
t is assumed to be of the form 


(3) t = g(u) + VY—-1 Hu), 
where ¢(u), ¥(u) are real functions with continuous first derivatives. 


Received June 9, 1937. 

! Solutions of a differential equation of the first order and first degree in terms of infinite 
series of definite integrals, presented by the author to the Ohio Section of the Mathematical 
Association of America, April, 1937. 

2 Solutions of systems of linear differential equations in the vicinity of singular points, 
American Mathematical Monthly, vol. 43 (1936), pp. 530-539. 

616 























SOLUTIONS OF SYSTEMS OF DIFFERENTIAL EQUATIONS 617 


Each unknown y; is obtained as the sum of a series yy, (hk = 1, 2, --- ), 
whose terms are found by integrating, sequentially, certain polynomials in the 
yx (9 = 1,---,n;k =1,---,hk — 1). 

For convenience, the sequence 4; , --- , un Will be represented by uw and hence 
the coefficient a;,,...,, will be represented by aj, . 

In §1 the formal solution of the system of differential equations (1) is found 
and the convergence of this solution is proved in §2. In §3 a more general 
system of differential equations is reduced to the form (1) by a simple trans- 
formation. 


1. Formal solution of the system of differential equations (1). Equations 
(1) can be written in the expanded form 


dy; . . . : 
(4) 7 = a(t) + 2 ai(t)-y; + > ai(t)yi' y2* + yn (@=1,---,n), 
where v = wy +--+: + wn. 


When the y;, in equations (4), are replaced by 
(5) yi = De yin”, 


there results 


dK" >= = a(t) + Lait) DK’ yin + Do K'Falt, yn) 
(6) h=1 dt j=1 h=1 h=2 
( =1,---,n;k =1,---,hk —1), 


where the ‘F(t, yj) are polynomials in the yx of degree h and each coeffi- 
cient is one of the a;,(t). The parameter K is introduced in order that a con- 
venient arrangement of the terms in the right-hand members of equations (6) 
can be made. 

A formal solution of equations (6) can be found by replacing K by unity and 
then solving the following system of differential equations: 





dyin _ 
dt = a,(t), 
(7) | dys = > ai(Oyn, 
dt j=1 
dy; = a 
a = Zz. a;;(t) Yj na + Si nalt, Yin) (h = 2, 3, ae -). 
j=1 


Equations (7) are obtained by equating the coefficient of K” in the left-hand 
member of (6) to the coefficient of K"™ in the right-hand member. 








618 JESSE PIERCE 
Equations (7) have the formal solution 


[ a(t) dt + ¢; = nad), 


Ya = 
tn 
iye2 = | >» ait) nat) dt = ned, 
(8) J0 j=l 
| Yih = [ b 6,;(t) Nj h- i(t) + Fi nrlt, nad) | dt = nin(t) 
0 j=l 
| (k=1,---,h—2;h = 2,3, ---), 


where the ¢; are arbitrary parameters. 
The set of functions defined by the infinite series 


(9) yi = D nal) (i 


h=1 


II 
— 
“ 
= 
— 
- 


is a formal solution of the system of differential equations (4). 


2. Proof of the convergence of the series (9). A particular case of the system 
of differential equations (1) is the system 


dy, flu) 
(10) 5 “ on ree 
au en > y, 
j=l 


whose right-hand members dominate those of (1). The system (10) has a 
solution in which all of the Y; are equal, that is, the Y; all satisfy the equation 


dY flu) 


du «1 — nY" 


(11) 


The general solution of the differential equation (11) is 


1 — (1 — 2n[G(u) + c]})' 


(12) Y = . 
n 

where 

(13) G(u) = [sau 


and ¢ is an arbitrary parameter. We shall consider ¢ real and non-negative. 
The minus sign is used before the radical in order that all of the terms in the 
right-hand member of (12), when expanded in a power series in [G(u) + el, 
be positive. This expansion has the form 


nlG(u) +f + n'[G(u) + e] +... =bn'"[G(u) + c]', 


(14) Y =[G(u) +e] + 5 9 




















SOLUTIONS OF SYSTEMS OF DIFFERENTIAL EQUATIONS 619 


where the }, are real positive constants. The series (14) will converge when 
(15) 2n[G(u) + ce] < 1. 


The power series (14) can be found directly from the differential equation 
(11) by expanding the right-hand member of (11) in a power series in Y and 
then making the substitution 


(16) y=) YK". 
The result can be arranged in the form 


) > = K* = f(u) + f(u) > nY,K" 
h=1 h=1 


(17) + fu) 2, K"[n't Vana tees + ¥naNi} +--+ +n Yi 


= flu) + flu) © Kn, + ou  O<4,--.,h-o 


It follows from the inequality (2) that the polynomial F,(u, Y;) dominates the 
polynomial ‘Fi,(t, yx). 
The system of differential equations corresponding to (7) is 


(d¥, _ 

| du flu), 

—dY. _ , 

> fia f(u)n¥i, 
te 

_ . — 

| du oes f(u) [n¥ 2 + n Yil, 

| dY;, = ‘ 

| Pe = f(u)[n) aa + F,a(u, Y,)] (k = 1, cos yh _ 2: h= 3,4, = -). 


Equations (18) can be solved in terms of indefinite integrals in the form 


ly; = | 100 du = G(u) + ¢ = H,(u), 


, 
|Y:= | f(u)nH,(u) du = 4n[G(u) + ec!’ = He(u), 


(19) ¢ 
| Y; = J sa intts + n?Hi(u)} du = 4n'[G(u) + c]’ = H3(u), 
Ys = / [f(u)nHya(u) + Fia(u, Hi(u))] du = an[G(u) + cl" = H,(u), 


where the a, are positive constants. 








620 JESSE PIERCE 


The series 


(20) Y = > > Hu) 


h=1 


is a power series in [G(u) + ce] and hence is precisely the same as the series (14). 
When equations (18) are solved as in §1 in terms of definite integrals, one 
obtains 


Z,(u), 


Y; = [se au +ece=Gu)+e 


Y, = [ soonaic du = 4n[G(u) + cf — 3ne’ = Z,(u), 
(21) 4 : 
| ¥3 = [ seotnazat + n°Z? (u)| du = 4n[G(u) + ec} — 4n®e'G(u) — 4n’e’ 
| = Z;(u), 
By comparing equations (21) with (19) it is clear that 
(22) Z\(u) = H,(u), Z(u) S H,(u) (h = 2,3, ---). 
It follows from equations (21) and (8) that 
(23) | nin(t) | S Z(u) (h = 1,2,---), 


provided 


IIA 


(24) les| Se. 


Hence the series (9) will converge when the inequalities (15) and (24) are satisfied. 

Since the series (9) are absolutely convergent and the independent variable t 
satisfies the relation (3), the functions y; have derivatives of the first order and 
satisfy the system of differential equations (4) for all values of ¢ on the path of 
integration except at the set of points of measure zero at which the coefficients 
a;, are discontinuous. 


3. Systems of differential equations with more general coefficients than those 
of (4). Consider the system of differential equations 


(25) dat = 0(t) + a(t); + >. 6;,(t)x; + Dd filtdae x --- 2" (§ =1,---, Nn), 
j=l v=2 


where « represents the sequence w,---,u4, and vy = wy +--- + wa. The 
function @(t) is assumed to have the indefinite integral 


(26) | o(t) dt = Bit), 














SOLUTIONS OF SYSTEMS OF DIFFERENTIAL EQUATIONS 621 


which may become infinite at a set of points of measure zero on the path of 
integration defined in §1. The functions 6,(t) have the form 


(27) a(t) = se, 
where the ¢,(t) are integrable (Riemann) and satisfy the inequalities 
(28) | 2) | S flu), 


for all values of t on the path of integration. The functions 6;;(¢) are integrable 
and satisfy the inequalities 


(29) | 4:() | S flu), 


for all values of ¢ on the path of integration. The functions f;,(t) have the 
form 


(30) fult) = Fle PP (» = 2,3, ---), 
where the ¢;,(¢) are integrable, and satisfy the inequalities 
(31) | Sut) | S fu), 


for all values of ¢ on the path of integration. 
The transformation 


(32) x, = OM y;, 
reduces the system of differential equations (25) to the form (4) and hence the 
system (25) has the solution 


(33) a, = & D> nis(t), 
=1 


where the n,,(t) are defined by equations (8). 
When the real part of the function @(t) approaches minus infinity as ¢ ap- 
proaches t’, a point on the path of integration, then 


(34) lim z(t) = 0. 


t—t’ 


When the real part of the function 8(¢) approaches plus infinity as ¢ approaches 
t’, then 


(35) lim z(t) = 2, 


t--t’ 


provided 


(36) lim > nin(t) # 0. 


tt’ h=1 





622 JESSE PIERCE 


Conclusion. The coefficients of the differential equations (4) and (25) can 
satisfy the assumptions of §§1 and 3 without being continuous or bounded with 
respect to ¢. 

The solutions (9) and (33) are the general solutions of the systems (4) and 
(25), respectively, because they contain n parameters c;, which are arbitrary 
except for the inequalities (15) and (24). 

In the case where the limit of the real part of the function 8(¢) is minus 
infinity when ¢ approaches zero, the initial values of the dependent variables 
zr; are zero for every set of values of the ¢; . 


HEIDELBERG COLLEGE. 

















NON-n-ALTERNATING TRANSFORMATIONS 


By D. W. Hatt anp G. E. ScHWEIGERT 


Let A and B be compact metric spaces and T7(A) = B a single-valued con- 
tinuous transformation. We shall say that T is non-n-alternating provided 
that, for any point x of B for which there exists a cutting K of A — T™'(z) 
consisting of at most n points, there is no point y of B such that 7~'(y) inter- 
sects both sets of the separation A — (T7'(r) + K) = A, + Ag. If K is the 
null set, this is the definition of a non-alternating transformation.' Conse- 
quently, this type of transformation is non-alternating; in fact, we have the 
following characterization: 

THeorEeM I. A necessary and sufficient condition that a single-valued con- 
tinuous transformation T(A) = B be non-n-alternating is that T be non-alter- 
nating on the complement of every subset of A consisting of at most n points. 

Proof. Let x and y be points of B and K any subset of A consisting of at 
most n points. If 7 "(r)-(A — K) separates” T'(y).(A — K)in A — K, 
ie., if (A — K) — T(2)-(A — K) = Ai + Ae, T'(y)-(A — K)-4; ¥ 0 
(i = 1, 2), then this separation may be written in the form (A — T”'(x)) — K = 
A, + Ac. Hence K separates 7”'(y) in A — T”'(x), contrary to the definition 
of non-n-alternating. Thus the condition is necessary. 

To establish the sufficiency, we notice that if there exist two points x, y in B 
and a cutting K of A — 7” '(x) consisting of at most n points such that 7” '(y) 
intersects both the sets A; and A» of the separation A — (7 '(r) + K) = 
A, + As, then (A — K) — T'(x)-(A — K) = A; + Ag and therefore 
T'(y)-(A — K) is separated by T™'(x).(A — K) in A — K. Consequently, 
T is not non-alternating on A — K. This proves the sufficiency. 

Lemma. Jf 7(A) = B is non-n-alternating, B is non-degenerate, y « B, and 
two points of T”'(y) are separated in A by a cutting K consisting of k < n + 1 
points, then k = n + 1 and T(K) = y. 

Proof. If k < n, then T is non-alternating on the complement of AK, by 
Theorem I. But this is impossible since 7 '(y) intersects two components of 
this complementary set. Thusk = n+ 1. If 7(K) # y, there exists a point 
pin K such that T(p) ¥ y. Then the set of n points (K — p) separates 7” '(y) 
in A — 7 '(T(p)), contrary to the fact that 7 is non-n-alternating. There- 
fore, T(K) = y. 

One consequence of this lemma, namely, the fact that a point of order not 


Received June 12, 1937. 

1See G. T. Whyburn, Non-alternating transformations, American Journal of Mathe- 
maties, vol. 56 (1934), pp. 294-302. 

2? If L and M are subsets of N, we say that L ‘‘separates’’ MV in N provided M is con- 
tained in N — Land N — L = N,+ Ne, where N:iN, =0 = Ni: Nz and MN, #+ 0 # MN;. 


623 








624 D. W. HALL AND G. E. SCHWEIGERT 


greater than n in A is equal to the inverse of its image provided B is non- 
degenerate, suggests the following theorem. This theorem covers, as a special 
case, the class of regular curves of order n + 1. 

Tueorem II. If A is a compact metric space every pair of points of which is 
separated by a set containing at most (n + 1) points and T(A) = B is non-n- 
alternating (n > 1), then T is a homeomorphism on A provided B is non-de- 
generate. 

Proof. We first show that the inverse of every point b of B is an A-set.° 
Let G = 7°'(b); then G is closed and lies in a single component H of A. From 
the hypotheses of the theorem it follows that H is a locally connected continuum. 
Consequently, if p and q are any two points of G, there is a simple are pq joining 
these two points in H. If we assume that there is a point z in pq which is 
not in G, it follows that there is a last point y of G in the are from p to z and 
a first point z of G in the are from x to gq. Thus the open are yzz contains no 
points of G. By hypothesis, there exists a cutting K consisting of at most 
(n + 1) points and separating y and z in A, so that some point of K must lie 
on the open are yxz. Hence T(K) is not b, contrary to the lemma. It follows 
that G contains every simple are joining two of its points, and hence, being 
closed, it is an A-set. 

It is also a consequence of the lemma that no set of n points separates G, 
so that this set lies in a true cyclic element C of A; therefore G = C, since C 
is a minimal A-set.* The set C now has the property that any pair of its points 
can be irreducibly separated by (n + 1) points. It follows’ that C cannot 
exist except as a single point. Thus 7 is 1-1 and hence a homeomorphism. 
This completes the proof of the theorem. 

Under the same hypotheses for n = 1 the above proof holds except for the 
non-existence of the true cyclic element C. If C exists, it must be a simple 
closed curve, and the image space B is not only a boundary curve,° but it is 
homeomorphie with the original curve, provided that no true cyclic element of A 
has a degenerate image. 

If n = 0, the transformation is non-alternating and acts on a space having 
dendrites as components; this situation has already been discussed in the 
original paper on non-alternating transformations.’ 

* That is, 7~'(b) is closed and contains every simple are joining two of its points in A. 
See Kuratowski and Whyburn, Fundamenta Mathematicae, vol. 16, p. 309. 

‘See Kuratowski and Whyburn, loc. cit. 

® This follows from the theorem that for n > 2 there exists no continuous curve M such 
that for every pair of points ? and Q of M there are exactly n independent ares from P 
to Q, but there does not exist for any pair of points (n + 1) such ares. This theorem has 
been proved for the cases n = 3and n = 4 by Kusner and published in the Comptes Rendus 
des Séances de la Société des Sciences de Varsovie (1932). It has been established for 
the general case, but not yet published, by J. R. Kline. 

® That is, a compact locally connected continuum each true cyclic element of which is a 
simple closed curve. This term has been suggested by G. T. Whyburn, loc. cit., p. 301. 
It follows that a boundary curve is a compact continuum every pair of points of which is 
separated by at most two points. Here, this characterization is needed rather than the 


definition 

















NON-2-ALTERNATING TRANSFORMATIONS 625 


From this same source we learn that the property of being a boundary curve 
is invariant under non-alternating transformations. Sinee this result repre- 
sents in a certain sense a sharper form of the present theorem, it might be sus- 
pected that for a compact metric space A the property of being separated 
between each pair of its points by at most (nm + 2) points is invariant under 












































BERRESEREREERKRERVADLRESES 


Fia. 1 





non-n-alternating transformations. That such is not the case can be shown 
by simple examples. However, the failure of the purest form of the analogy by 
no means denies the possibility of other closely connected results. For example: 
Is the non-n-alternating image of a regular curve of order (nm + 2) hereditarily 
locally connected? 

The question as to whether or not the property of being a regular curve is 











626 D. W. HALL AND G. E. SCHWEIGERT 


invariant under non-alternating transformations was raised by G. T. Whyburn 
and offered difficulties which led to this and to at least one other specialized 
type of non-alternating transformation. In answer to this question we may 
now make the following statement: There exists a regular curve A of order three 
and a non-alternating transformation’ T(A) = B such that the inverse of any 
point of B is at most three points, but B is not hereditarily locally connected, hence 
certainly not a regular curve. 

Example. Let P» denote a square on (0, 1) and for each positive integer k 


° ° TT 
let P, be the square on (0, 1) which lies above and makes an angle 4 = 3 


with Py»). Consider, for future reference, the intervals with end points 
a7, +37" and 3 *r + 2-3 (m = O,1,--- 37 = O,1,---, 3” — I), 
and denote each such interval by J,,,.. In Py for k = 3” + 7, make the 
following construction: (i) on the base [,,,- erect a square and denote the 
side opposite the base by s; (ii) using s as the middle third of the hypotenuse 
construct an isosceles right triangle which is disjoint with the interior of this 
new square. 

The space A is the sum of the boundary of Po and all the figures constructed 
in (i), (ii) above. The transformation T(A) = B will consist of a simple 
identification of all the squares P; with the particular square P, , i.e., let @ = 0 
for all k. The image space B may be constructed as indicated by the figure. 
We have at once that T is one-to-one on A except at the ends of the hypotenuse 
of each triangle constructed in (ii) and at certain obvious points on the lines 
perpendicular to the basic unit interval. If p is any point of A such that 
T ‘(T(p)) # p, then p is not a point of the basic unit interval. Consequently, 
p lies in a non-degenerate cyclic element of A-P;,, where pe P,. No other 
point of 7 '(7(p)) is in P,, hence T”'(T(P)) does not separate A-P,. It 
follows easily that no inverse set separates A, i.e., T’ is non-separating, hence 
surely non-alternating. It will also be observed that the bases of all the tri- 
angles constructed on squares of the same height are joined in B to form an 
interval of unit length, and that the sequence of intervals thus formed has the 
unit interval as a continuum of convergence. Thus B is not hereditarily locally 
connected. The remaining properties of 7, A, and B are easy consequences of 
their respective definitions. 


UNIVERSITY OF VIRGINIA. 
7 In fact, this transformation is non-separating in the sense of Wardwell, i.e., for no 


b « B does T~'(b) separate A. See James F. Wardwell, Non-separating transformations, 
this Journal, vol. 2 (1936), pp. 745-750. 

















RESIDUATION IN STRUCTURES OVER WHICH A MULTIPLICATION 
IS DEFINED 


By Morcan WarpD 


I. Introduction 


1. Consider a set of elements A, B, C, --- forming an abstract structure’ = 
over which there is defined a commutative and associative multiplication XY. 
The multiplication operation is connected with the structure by assuming that 
it is distributive with respect to union: 


(3.3) A(B, C) = (AB, AC) for A, B, Cin ¥. 


This condition is satisfied in the important instance of the ideals of a com- 
mutative ring. 

If we assume that any subclass of elements of = has a union and correspond- 
ingly strengthen assumption (3.3), the existence of a residual (German, ideal- 
quotient’) A:B follows for each pair of elements A, B of » with the defining 
properties 


A > (A:B)B; if A > XB, then A:B DX. 


It is easy to show that the residual thus defined has the formal properties of 
the residual in polynomial ideal theory (Macaulay, [3]). But in the special 
instance of ordinary arithmetic (when > is interpreted as the ring of rational 
integers) the residual has a number of additional interesting properties which 
do not hold in general; for example, 


(A:B, B:A) = 1; (A, B):M = (A:M, B:M), 
M:[A, B] = (M:A, M:B), M:AB = {(M:A)(M:B)}:M, 
A:(B:A) = A(A, B):B, [A, B]:(A, B) = (A:B)(B:A). 


The problem arises then of determining the conditions under which these 
and other properties of residuation in ordinary arithmetic will hold in the 
abstract structure. It is not difficult to show that it suffices to assume‘ 

PostuLtaTE E. If A divides B, there exists a unique element Q such that 
AQ = B. 

Received June 22, 1937. 

1 Other terms are “‘dual group’’, ‘‘Verband”’, “‘lattice’’. For a definition, see §2 of this 


paper, or O. Ore, reference [1] at the close of the paper. 
2 The concept appears to be due to Dedekind [4]. See van der Waerden [2] or Ma- 
caulay [3]. 
3] here is the unit element with respect to multiplication. See §§2 and 3. 
‘ Postulate E is satisfied in every principal ideal ring. 
627 











628 MORGAN WARD 


But this solution of the problem is trivial. First of all, Postulate E is far 
stronger than necessary. Secondly, even if we weaken E by no longer requiring 
Q to be unique, the assumption is still undesirable because it renders the con- 
cept of the residual superfluous; A:B is merely a particular one of the quo- 

A 
(A, B)” 

Postutate C. If A divides B, there exists an element P such that A = B:P. 

We shall show here that Postulate C alone suffices to prove that (A: B, B:A) = 
I, (A, B):M = (A:M, B:M) and M:[A, B] = (M:A, M:B). Postulate Cisa 
sufficient, but not necessary, condition that = be an arithmetic structure, and 
a necessary condition that = be a Boolean algebra, if multiplication is identified 


tients” We assume accordingly the weaker 


with cross-cut. 

If we interpret our multiplication as the cross-cut operation of the structure 
(so that the structure is arithmetic by assumption 3.3), we obtain a residuation 
operation within the structure which does not seem to have been investigated 
even in common arithmetic. 

It is also possible to define residuation abstractly over the structure by a 
proper selection of the properties of residuation given in §4 without any reference 
to multiplication. This investigation has been carried out by R. P. Dilworth 
in an unpublished paper. 

A complete postulational analysis of the interrelationships between the struc- 
ture properties, and the operations of residuation and multiplication would 
appear desirable, but will not be given here. We shall content ourselves with an 
informal treatment, introducing our postulates on the basis of naturalness and 
convenience. Their consistency will be evident. 


2. We begin by recalling briefly the defining properties of a structure (Ore, 
[1]). We postulate the existence of a well defined division relation > which 
is transitive and reflexive. The equality relation = is then defined in terms 
of > by A = B if and only if A D> Band B DA. For any two elements A 
and B of = we postulate the existence of elements D and M such that 


DDA,DIAB; if XDA,XOB, then X OD. 
AOM,BOM; if ADY,BOY, thn MOY. 


D and M are called the union and cross-cut of A and B. They are determined 
up to equal elements. We write’ D = (A, B) and M = [A, B]. 
If an element E divides every element T of a fixed subclass 6 of L, we write 


5 We define a quotient Q = : as anelement Qsuch that A = QB. See $4. 


* Ore uses [A, B]| for union and (A, B) for cross-cut. We prefer to retain as far as 
possible the notation and terminology of ideal theory. 














, a i ed 


od 


as 








RESIDUATION IN STRUCTURES 629 


E >09. We state explicitly our assumption of closure of = with respect to 
union (Ore [1], p. 409): 

PostuLaTE A. For every subclass 0 of elements of 2, there exists an element U 
called the union of ® such that 


U2De; ff X D0, thn X DU. 


We write U’ = u(O). 

In particular the element u(=) divides every element of =. We shall call it 
the identity element of > and denote it by J. 

If we assume the existence of a null element divisible by every other element, 
then the closure of = with respect to union obviously entails the closure of = 
with respect to cross-cut. Conversely, closure with respect to cross-cut and 
the existence of an identity imply closure with respect to union. 

If for every three elements A, B, C of = 


(2.1) (A, [B, C]) = [(A, B), (A, ©)], 


the structure is said to be arithmetic, or distributive. (2.1) is equivalent to 
[A, (B, C)] = ({A, B}, [A, C)). 

We recall that A > B if and only if A = (A, B) and B = [A, B], and that 
union and cross-cut are associative, commutative, and idempotent operations. 


3. We next assume that = is closed with respect to an associative and com- 
mutative operation X.Y or XY: 
(3.1) A-Bisin > if A, Bare in 2; A-(B-C) = (A-B)-C; A-B = B.-A. 
We call this operation multiplication. We make the further assumptions 
(3.2) I.A = A for every element A of 2. 
Here / is the identity element of the structure. 
(3.3) A-(B, C) = (A-B, A-C) for any three elements A, B, C of the structure. 
The following rules are easy consequences of these assumptions: 
B >C implies AB > AC. A DB andC DD imply AC D BD. 
A = BC implies B > A. [A, B] D AB D[A, BI(A, B). 
(A, B) = I implies AB = [A, B] and (A, BC) = (A, C), any C. 


On account of Postulate A, assumption (3.3) must be strengthened, as from 
(3.3) we can merely deduce the distributivity of multiplication with respect to 
union for a finite number of elements. If © and ® are subclasses of 2, we define 
their product 6 as the subclass of all products of elements of 6 and elements 
of & If © consists of a single element 7’, we write T@ for 0¢. We assume’ 


7 It suffices to assume merely that u(T7#%) = Tu(#) for the developments which follow; 
but this apparently weaker assumption is easily shown to be equivalent to Postulate B. 











630 MORGAN WARD 


PostuLaTEe B. The union of the product of any two subclasses of = is the product 
of their unions. 


II. Residuation 
4. The element R = A:B is called the residual of B with respect to A if 
(4.1) A D> RB; A D> XB implies R > X. 


The residual always exists for any A, B of =. For since A > AB, the class 
0 of all elements X such that A D> XB is non-empty. Let R = u(@). Then 
by Postulate B, since A D OB, A = u(A) > u(OB) = u()u(B) = RB, or 
ADRB. Andif A D> XB, X lies in 9, so that R DX. 

The following properties of residuation which are needed later follow exactly 
as in the ideal theory of commutative rings (van der Waerden [2], Chapter XII; 
Macaulay [3], Chapter 3): 

(4.2) A:A =I7, A:I = A, IQIASBD A. 
(4.21) A:B = A:(A, B) = [A, B]:B. 
(4.3) A:B = IJ, if and only if A > B. 
(4.31) If A:B = A, then A > XB implies A > X. 
(4.4) (A:B):C = (A:C):B = A: BC. 
(4.41) M:A = Bimplies AB:A = B. 
(4.5) M:(M:N) DN. 
(4.51) A D> Bimplies M:B D> M:A and A:M D B:M. 


THeoreM 4.1 (Macaulay [3], p. 32). If M and N are any two elements of 
Land A = M:N, B = M:(M:N), then B = M:A, A = M:B. 


A and B are then said to be mutually residual with respect to M. 


The quotient Q = of two elements A and B of > is defined by 


(4.6) A=QB; if A=XB, then QOX. 

Unlike the residual, the quotient need not exist even if B > A. But if the 
class © of all X such that A = XB is non-empty, ‘ exists and is the union 

A A AB . 

»). Clearly — = J, — = A, —— exists. 
u(@). Clearly A I 7 { A exist 

THEoreM 4.2. If the quotient . exists, it equals the residual A:B. 

Proof? Let = V, A:B = W. Then by (4.1), (46) A = BV > 


8 We use when convenient — and ~ for formal implication and equivalence to shorten 
the proofs. 


























RESIDUATION IN STRUCTURES 631 


AD BV—-W2V-—> BW >BV-—- BW DA—BW=A-V OW. 
W2>VandV DW-W = V. 


It follows from formula (4.21) that A:B = Ag Aw 


7, tala whenever the 
indicated quotients exist. Consequently, if we assume 
Postutate D. If A divides B, there exists at least one element Q such that 


A = QB, 


(A, By’ We shall not assume 


then the residual A:B reduces to the quotient 


Postulate D in this paper. 

We conclude this division of the paper with a lemma which we shall need 
subsequently. 

Lemna 4.1. If (R, S) = I, then (P, Q) > [P:R, Q: 8S]. 

Proof. Let P:R = U,Q:S = V. By (4.1), P D UR, Q > VS so that 
(P, Q) > (UR, VS). It suffices then to show that (UR, VS) > [U, V]. 
(R, S) = I-[U, V] = [U, V](R, 8S) = (R[U, V], S[U, V]). Now U D[U, V]— 
UR D> R[U, V]; V D> [U, V] ~ VS D> S[U, V]. Hence (UR, VS) > 
(R[U, V], SU, Vj) > [U, V]}. 


III. Distributive properties of residuation 


5. The following two “distributive laws” for residuation (Macaulay [3], p. 33) 
are due in essence to Dedekind [4]: 


I M:(Ai, Az, ---,An) = [M:A1, M: Ae, --- , M:Aal, 
II [A1, Ae, ---, An]: M = [Ai:M, Ao: M, --- , An: M). 
In common arithmetic the residual satisfies the additional distributive laws 
Ill (Ai, Az, ---,An):M = (Ai: M, Ao: M, --- , An: M), 
IV M:[Ai, Ac, --- , An] = (M:A1, M: Ae, --- , M:A,). 


Since III and IV are easily proved by induction from the case n = 2, we shall 
discuss them here in the form 


(5.1) (A, B):M = (A:M, B:M), 
(5.2) M:[A, B] = (M:A, M:B). 


That III and IV need not hold in the abstract structure is shown by inter- 
preting = as the polynomial ideals of the ring K[x , x2, 23]. On taking M = 
(xi, 22, 23), A = (27, 23) and B = (a3, xs) we find that (A, B):M = (1), 
(A:M, B:M) = (ai, 23, 23). On the other hand, if M = (a2, 23), A = 
(zi , 23), B = (23, 23), then M:[A, B] = (1), (M:A, M:B) = (x1, 22, 23). 

We now make the following assumption: 

PostutaTE C. If A divides B, there exists an element P such that A = B:P. 

The following consequences of C are needed in the discussion of (5.1) and 
(5.2) and serve to reveal its scope. 














632 MORGAN WARD 


Lemma 5.1. If P and Q are any two elements of X, then (P,Q) = P:(P:Q). 

Proof. (P,Q) > P — (P, Q) = P:N by Postulate C. By Theorem 4.1 
and rule (4.3), (P,Q) = P:N = P:(P:(P:N)) = P:(P:(P, Q)) = P:(P:Q). 

Lema 5.2. If Q > P, then Q = P:(P:Q). 

Proof. QDP—Q = (P,Q) — Q = P:(P:Q) by Lemma 5.1. 

Lemma 5.3. If P D> M and Q > M, then M:P D M:Q only if Q D> P. 

Proof. By Lemma 5.2, P > M,Q >M—P = M:(M:P),Q = M:(M:Q). 
Then by rule (4.51), M:P D> M:Q — M:(M:Q) > M:(M:P) — Q OP. 

This lemma is a limited converse of the first part of rule (4.51) which states 
that Q > P implies M:P D> M:Q. The direct converse is of course false. 

THeoreM 5.1. Lemma 5.1 and Postulate C are equivalent. 

Proof. Lemma 5.1 implies Lemma 5.2. And Lemma 5.2 is Postulate C 
with N = P:Q. 

Lemma 5.31. Jf P >M andQ > M, then M:P = M:Qif and only if Q = P. 

Proof. Q = P — M:P = M:Q by (4.51). M:P = M:Q —Q = P if 
P >M,Q > M by Lemma 5.3. 

Lemma 5.4. If M:N = M, then (M, N) = I. 

Proof. By Lemma 5.1 and rule (4.2), (M, N) = M:(M:N) = M:M = I. 

Lemma 5.4 shows that the distinction between “relatively prime’ and 
“coprime”’ (Teilerfremd) elements, which must be made in the general theory 
(van der Waerden [2], Chapter XII, p. 30), vanishes if Postulate C is assumed. 

TueoreM 5.2. Lemma 5.3 and Postulate C are equivalent to one another. 

Proof. We have shown that Postulate C — Lemma 5.1 — Lemma 5.2 — 
Lemma 5.3 — Lemma 5.31; Lemma 5.2 — Postulate C. It suffices then to 
show that Lemma 5.31 — Lemma 5.2. By Theorem 4.1 and rule (4.21), 
P:(P, Q) = P:Q = P:(P:(P:Q)). Also (P,Q) > P, P:(P:Q) > P. Hence 
by Lemma 5.31, (P, Q) = P:(P:Q). This is Lemma 5.2. 


6. We shall now prove the important 
THEOREM 6.1. If Postulate C holds, the structure = is arithmetic.* 
Proof. It suffices to show that 


(i) (C, [A, B]) > [(C, A), (C, B)]; 


for we have trivially [(C, A), (C, B)] > (C, [A, B)). 

Assume Postulate C. Then by Lemma 5.1 and the first distributive law, 
(C, [A, B]) = [A, B]: {[A, B]:C} = [A, B]:{[A:C, B:C]} = [A, B]:M, where 
we have written M for [A:C, B:C]. Thus by the first distributive law, and 
Lemma 5.1, 


(C, [A, B]) = [A:M, B:M], 
[((C, A), (C, B)] = [A:(A:C), B:(B:C)}. 


Now A:C D M, B:C DM. Hence by rule (4.51), A:M D A:(A:C), 
B:M > B:(B:C), so that [A:M, B:M] > [A:(A:C), B:(B:C)], giving (i). 


*Stated by Garrett Birkhoff ((6], p. 619) for the special instance of the ideals of a 
commutative ring. We have made no assumption here that our structure is Dedekindian. 





























RESIDUATION IN STRUCTURES 633 


Postulate C need not hold in an arithmetic structure. For consider the 
numbers 1, 2 and 4 with both multiplication and cross-cut taken as L. C. M. 
and union as G.C.D._ In this arithmetic structure, 1:1 = 1:2 = 1:4 = 2:2= 
2:4 = 4:4 = 1; 2:1 = 2; 4:1 = 4:2 = 4. However, 4:(4:2) = 1, (4, 2) = 2. 
This contradicts Lemma 5.1. 

If = is a Boolean algebra and we interpret both multiplication and cross-cut 
as Boolean multiplication and union as Boolean addition, then A:B = A + B’ 
where B’ is the negative of B. Hence 


A:(A:B) = A:(A + B’) = A+ (A+ B)’ =A+A'B=A+B = (A, B), 
so that Postulate C is satisfied by Theorem 5.1. 
Lemma 6.1. If Postulate C holds, then for any three elements A, B and M of &, 
(M:A, M:B) = M:|(M, A), (M, B)). 


Proof. Assume Postulate C. Then by Lemma 5.1, [((M, A), (M, B)] = 
[M:(M:A), M:(M:B)| = M:(M:A, M:B), by the first distributive law. 
Now (M:A, M:B) D M. Hence by Lemma 5.2, (M:A, M:B) = 
M:{|M:(M:A, M:B)} = M:[(M, A), (M, B)]. 

THEOREM 6.2. If Postulate C holds, then for any three elements A, B and M 
of x 
(5.2) M:[A, B] = (M:A, M:B). 


Proof. Assume Postulate C. Then by Theorem 6.1, [(47, A), (M, B)] = 
(M, [A, B]). Hence by Lemma 6.1, (M:A, M:B) = M:(M, [A, B]) = 
M:|A, B), by rule (4.21). 


7. THeoremM 7.1. If Postulate C holds, then 
(A:B, B:A) = I. 


Proof. Assume Postulate C. Then by rule (4.21), Theorem 6.2 and rule 
(4.2), (A: B, B:A) = ((A, B]:B, [A, B]:A) = [A, B]:[A, B] = J. 
TueoreM 7.2. If Postulate C holds, then for any three elements A, B and M 


of x 
(5.1) (A, B):M = (A:M, B:M),. 


Proof. (A, B) > A — (A, B):M D A:M by (4.31). Hence (A, B):M D 
(A:M, B:M) and it suffices to show that 


(i) (A:M, B:M) > (A, B):M. 


(4.2) - A:M DA, B:M D B-— (A:M, B:M) DP (A, B). Also (4.2) > 
(A, B):M > (A, B). Hence by Lemma 3.3, (i) follows if 


(ii) (A, B):}(A, B):M} D> (A, B):}(A:M, B:M)}. 


By Lemma 5.1, the left side of (ii) is ((A, B), M) = ((A, M), (B, M)). By 
the first distributive law, the right side is [(A, B):(4:M), (A, B):(B:M)]. 








634 MORGAN WARD 


Now by Lemma 5.1 and rule (4.4), (A, B):(A:M) = {A:(A:B)}:(A:M) = 
{4:(A:M)}:(A:B) = (A, M):(A:B). Similarly, we have (A, B):(B:M) = 
(B, M):(B:A). Thus the right side of (ii) equals [((A, M):(4:B), (B, M): 
(B:A)]. By Theorem 7.1, (A:B, B:A) = I. Hence by Lemma 4.1, ((A, ™M), 
(B, M)) > [(A, M):(A:B), (B, M):(B:A)]. This gives (ii). 


IV. Multiplicative properties of structures 


8. We shall deduce here two or three interesting consequences of the fourth 
distributive law. These are independent of Postulate C, which we no longer 
assume. 

TueoreMm 8.1. If the fourth distributive law holds, then multiplication is 
distributive with respect to cross-cut; that is, for any three elements A, B and M 
of = 
(8.1) M[A, B] = [MA, MB]. 


Proof. Let N be any element of ©. Then by (5.2) and (4.4) N:[MA, MB] = 
(N:MA, N:MB) = ((N:M):A, (N:M):B) = (N:M):[A, B] = N:M[A, B]. 

On taking N = [MA, MB) and N = M{[A, B] and applying rule (4.3) we see 
that M[A, B] > [MA, MB), |MA, MB] > M[A, B]. Hence (8.1) follows. 


Let Py, Qi; Pe, Qe: +--+ 3 Pn, Qn be n pairs of elements mutually residual 
with respect to a fixed element M, so that 
M:P; = Q;, M:Q; = P;, (i = 1,2,---,m) 


Then the first and fourth distributive laws show that 
M:(Qi, Q, clad » Q,) = [Pi , P2, ae , P.l, 
M:[P,, Po, +++ , Pal = (Qi, Q, --+ + Qn). 
Hence we have 
TuHeoreM 8.2. If the fourth distributive law holds and P;, Q; are mutually 
residual with respect to M for i = 1, 2,---,n, then [P,, P2,---,P,] and 
(Qi, Qe, «+ , Qn) are also mutually residual with respect to M. 
Pairs of mutually residual elements thus form a kind of structure. 
Lemma 8.1. If multiplication is distributive with respect to cross-cut, then for 
any two elements A, B of = 


(8.2) AB = [A, B)(A, B). 


Proof. We always have [A, B] > AB > [A, B)(A, B). Assume that (8.1) 
holds. Then [A, B](A, B) = [A(A, B), B(A, B)]. Since A(A, B) > AB, 
B(A, B) > AB, we have [A, B|(A, B) D AB. This gives (8.2). 

Formula (8.2) may be generalized as follows. Let S,, S2,--- , S, be any n 
elements of &, and let 


Si 
2 
- 


a. ee a. ? SS ee ee, 2? Se 








RESIDUATION IN STRUCTURES 635 


Then if formula (8.2) holds, we readily prove by induction that”® 
(8.3) S- Se- ove - Sn = [S; . Se ee S,)(71 , T: 5 eek y 7k 


Formulas (8.2), (8.3) are thus consequences of the fourth distributive law. 


9. We conclude by giving some properties of structures which are semi- 
groups. The structure ~ is said to form a semi-group if 
(9.1) For any three elements A, B, C of =, AB = AC implies B = C. 
This condition is easily seen to be equivalent to 
(9.2) For any three elements A, B, C of 2, AB D> AC implies B DC. 
An equivalent condition in terms of residuation is 
(9.3) For any two elements M and N of 3, MN:M = N. 


In a semi-group, a quotient Q = ‘ need not exist if B D> A. But if it does 


exist, it is unique in the sense that A = QB is satisfied for only one value of Q. 
(9.1) may be restated as 
(9.4) For any two elements M and N of &, there is at most one element R such that 


MR = N. 
The equivalence of (9.1), (9.2), (9.3), (9.4) is independent of Postulate C. 
THEOREM 9.1. Let = be a structure in which multiplication is distributive with 
respect to cross-cut. Then if = is also a semi-group, = is an arithmetic structure." 
Proof. We are to show that (9.1) implies that 


(2.1) (A, [B, C}) = [(A, B), (A, ©)]. 
By hypothesis and (8.1), (8.2) of the previous section 
[AB, AC] = A[B, C] = [A, [B, C]](A, [B, C]) = [[A, B], [A, C]](A, [B, C)). 
({A, B], [A, C]][(A, B), (A, ©)] 
= [[A, B](A, B), [A, B](A, C), [A, C](A, B), [A, C](A, C)] 
= [AB, AC, [A, B](A, C), [A, C](A, B)] 
= [[AB, AC], M}, 
where 
M = |[A, B\(A, C), [A, C](A, B)]. 
But [AB, AC] > [[AB, AC], M]. Hence 
[[A, B], [A, C]](A, [B, C]) > [[A, B], [A, CI][(A, B), (A, ©)]. 


1° This formula is of great importance in common arithmetic. See, for example, Stieltjes 
[5], Chapter I, §§7-10. 

11 The converse of this theorem is of course false as is shown by the structure consisting 
of the finite ring of integers modulo 4. 








636 MORGAN WARD 


Therefore by (9.2) and hypothesis 
(A, [B, C}) > [(A, B), (A, ©)]. 
But we have trivially [(A, B), (A, C)] > (A, [B, C]). Hence (2.1) follows. 


On combining this result with Theorem 8.1, we obtain 

THEeoreM 9.2. In a semi-group, the fourth distributive law for residuation is a 
sufficient condition that the structure be arithmetic. 

REFERENCES 
1. O. Ore, Annals of Math., vol. 36 (1935), pp. 406-437. 
2. B. L. vAN DER WAERDEN, Moderne Algebra, vol. 2, Berlin, 1931. 
3. F.S. Macautay, The Algebraic Theory of Modular Systems, Cambridge, 1916. 
4. R. Depexinp, Uber die Theorie der ganzen algebraischen Zahlen, Werke. vol. 111, 
pp. 1-222, especially §170. 

5. T. J. Srietrses, Annales de Toulouse, (1), vol. 4 (1890), pp. 1-102. 
6. Garrerr Birxuorr, Bull. Amer. Math. Soc., vol. 40 (1934), pp. 613-619. 


CALIFORNIA INSTITUTE OF TECHNOLOGY. 





Or 








ate 








ASYMPTOTIC RELATIONS FOR DERIVATIVES 
By R. P. Boas, Jr. 


1. Introduction. A well known theorem of Hardy and Littlewood states, in a 
form in which it is often quoted, that if f(x) is of class C’ on (0, «), and if, as 
r— », f(r) = o(1) and f’"(x) = O(x*), then f’(z) = o(x'). It is a special 
case of a theorem in which the powers of x in the order relations are replaced 
by more general functions; and this in turn can be used to establish an extended 
theorem where from the order of f(x) and of f'” (x) (n = 2) one deduces the 
orders of the intermediate derivatives.’ Now, if we think of the hypothesis 
on f(z) in the original theorem as “x°f’(x) = O(1)”, it is a hypothesis on the 
order of the function resulting from applying a certain linear differential operator 
to f(z). The principal result of this note is the corresponding theorem when 


the operator 2° S is replaced by a certain more general, n-th order, linear 
operator, L; from the order of L[f(x)] and of f(x), the order of 
f(a) (k= 1,2,---,n— 1) 


can be deduced. This result, and a preliminary theorem, overlap the results 
of Hardy and Littlewood, but neither include them nor are included by them. 
The full statement of our main theorem is somewhat complex; to illustrate it 
as simply as possible, a special case, sufficient for many applications, will be 
stated here. 

Let 


(1.1) LUf(z)] = > Aca’ f(a), 


where the A; are constants, A, # 0. Let f(x) be of class C" on (0, ~), and sup- 
pose that as x — ~, L{f(x)] < O(1). If f(z) = O(A), then f(x) = O(x); 
if f(z) = o(1), then f(x) = o(a) (k = 1, 2, --- ,n — 1). 

Examples of operators L[f(x)] which have the form (1.1) are x"f‘"(x); the 
operator 


—z7)-! ge! ‘ 
Wie 2)! a7 [x f(x)] (k = 2) 


Received June 23, 1937; presented to the American Mathematical Society, September, 
1937. 

1G. H. Hardy and J. E. Littlewood, Contributions to the arithmetic theory of series, 
Proceedings of the London Mathematical Society, (2), vol. 11 (1913), pp. 411-478; 417 ff. 

Reference should also be made to E. Landau, Uber einen Satz von Herrn Esclangon, 
Mathematische Annalen, vol. 102 (1929-30), pp. 177-188. Landau considers more general 
differential operators than we do, but his results are less general in other respects. 


637 


Lx, 21 f(x)] = 








638 R. P. BOAS, JR. 


used by D. V. Widder to invert the Stieltjes transform; and the operator 
Ly.2} Lea[f(x)]}, which inverts the iterated Stieltjes transform.’ In a later paper 
by D. V. Widder and the author, the results of this note will be applied to the 
theory of the iterated Stieltjes transform. 


2. Theorems with second derivatives. We shall denote by “7” and “1”, 
respectively, the classes of positive functions which are non-decreasing or non- 
increasing, 0 < x < x. We shall also consider the class of functions g(x), 
positive on 0 < x < «, and such that ¢g(cr)/¢g(x) is uniformly bounded for 
0 <2 < « and} < ¢ S 2; we shall call it “K”.* C" denotes the class of 
functions having continuous n-th derivatives on (0, «). With these notations, 
we can state 

TueoreM 1A. If g(x) and p(x) satisfy any one of the conditions 

(a) o(z)e |, W(r)e |; 

(b) o(z)e T, We T; 

(ce) ce(z) eT, 2° “W(x) € T, with r = 0, and g(x) = O(2°Y(2)); 
then, for any f(x) «C’, as x — « all of the following statements are true: 


Ay. f(x) = O(e(x)) and f(x) = O(W(2)) imply f’(x) = Ofe(z)¥(2)I'); 

Ao. f(x) = o(e(x)) and f(x) = O((x)) imply f(x) = o(le(x)¥(z)}'); 

As. f(z) = O(¢(2)) and f"(x) = o((x)) imply f(x) = o(le(x)¥(x)}), provided 
that g(x) = o(x*p(x)) is added to condition (c). 

Tueorem IB. If o(x) ¢ K, ¥(x) € K, o(x) = O(2*p(x)), and f(x) « C’, then, 
aszr-> <, 

By. f(z) = O(¢(x)) and f(x) < O(b(x)) imply f'(x) Ole(x)¥(2))'); 

Be. f(x) = o(g(x)) and f’'(x) < O(y(x)) imply f'(x) = o({e(z)¥(2)); 

Bs. f(z) = O(¢(z)), f(x) < o((z)), and g(x) = o(2°p(x)) imply f’(x) = 
o(le(x)y(x))). 

Theorem 1A under hypotheses (b) and (c)° and the special case of Be, where 
v(x) and ¥(z) are powers of x,” are due to Hardy and Littlewood. 


2D. V. Widder, The Stieltjes transform, to appear in the Transactions of the American 
Mathematical Society. 

2D—D. V. Widder, The iterated Stieltjes transform, Proceedings of the National Academy of 
Sciences, vol. 23 (1937), pp. 242-244. 

* Any constants a and A,0 <a<1< A < «, would do as well as } and 2 in the defini- 
tion of the class K. K contains, in particular, all those functions which J. Karamata 
calls ‘‘regularly increasing in the wide sense’. See J. Karamata, Sur un mode de croissance 
régulitre des fonctions, Mathematica (Cluj), vol. 4 (1930), pp. 38-53, 194-195; the class of 
functions in question is defined on p. 40, and the theorems necessary to show that it is 
contained in K are on pp. 40, 45. 

6G. H. Hardy and J. E. Littlewood, op. cit., pp. 417, 425-426. 

6G. H. Hardy and J. E. Littlewood, Tauberian theorems concerning power series and 
Dirichlet’s series whose coefficients are positive, Proceedings of the London Mathematical 
Society, (2), vol. 13 (1914), pp. 174-191. On p. 188 they give, not precisely the theorem in 
question, but the corresponding theorem when z — 0+. 











ASYMPTOTIC RELATIONS FOR DERIVATIVES 639 


Theorem 1B does not include Theorem 1A; for example, let f(z) = ¢(r) = 
v(x) = e*; ¢(x) and (2) satisfy (b) of Theorem 1A, which is applicable to f(x); 
but there is no function 6(x) e« K for which f(x) = O(@(x)). Suppose, in fact, 
that e < Aé(r) (0 < x < «;A a constant), and that @(er)/O(r) = M (0 < 
ar<«;e>1). Then @(er) S M@(x), O(c’x) < M@(cr) S M’6(zr), and by indue- 
tion, O(c"x) < M"O(x) (n = 1,2,---). But then e”” < A@(c"r) < AM"O(x); 
take x = 1; the result is contradictory for large n. Hence Theorem 1B cannot 
be applied to f(z). 

Theorem 1A does not include Theorem 1B, because the two-sided O-condi- 
tions of Theorem 1A cannot be replaced by one-sided O-conditions; this is 
shown by the example g(x) = e’, ¥(x) = 1, f(z) = —e’: here f(x) = O(¢(z)), 
f"(x) < 0 = o(¥(2)), but f’(x7) # O(le(x)¥(x)]'). On the other hand, if the 
one-sided O-conditions of Theorem 1B are replaced by two-sided O-conditions, 
the resulting theorem is included in Theorem 1A (c).’ To establish this, we 
have only to show that it is possible to construct, given any @(x) « K, a positive 
function u(x), such that for some r = 0, z’u(x) € FT , and @(7) = O(u(zx)), u(x) = 
O(0(x)). Since @(cr)/0(xz) <= M, uniformly for 0 < xr < ~,} Se S 2, it 
follows (replacing « by cr) that 0(c’r)/@(r) < M’, and generally that 
6(c"x)/O(r) <= M", so that we have @(cr)/@(r7) < M" (2" S ¢ S 2"; 
n= 1,2,---). Weassume, without loss of generality, that (0) = 0. Choos- 
ing r = Oso that M’2-"*' < 1, we define u(r) by 

u(r) = x ‘ub. (t'6(0); 
O<t<z 
evidently 2'u(x) € fT ; and u(x) 2 A(x), so that (x) = O(u(x)). We can define, 
for each x, at, (0 S t, S 1), such that (t,7)'O(t-r7) > 42’ (z), 


u(x) < 2t; (ts x) 
A(x) 6(x) 


Let x— x. If we can show that for z sufficiently large, and for some m 2 1, 
t, > 2°", we shall have u(x)/0(7) < 2M”, and hence u(x) = O(@(x)); this will 
complete the construction. But if the ¢, are not bounded from zero, we cannot 
have t, = 0 (x > 0), and there is a sequence x, with ¢,, approaching zero. There 
are then integers n, = 1 with 2.“* = t,, > 2-"*"' for sufficiently large k, and 


u(m.) . Men 


(rm) ~ get <1 


This contradicts u(r) 2 @(x). 
There is no theorem corresponding to Theorem 1B with the réles of the 
equality and inequality signs interchanged; that is (taking for example B,), 


if g(r) e K, ¥(x) e K, o(x) = O(2°(x)), then f(r) < O(e(x)) and f’(x) = O(y(z)) 


this state of affairs is to be expected; the reader will have no difficulty in con- 
structing examples to illustrate it. 


? That this might be true was suggested to the author by the referee. 











640 R. P. BOAS, JR. 


From the point of view of this note, Theorem 1B is more important than 
Theorem 1A, because it will be used in establishing our main theorem. For 
the sake of completeness, however, we shall give a proof of Theorem 1A under 
hypothesis (a) (this is the case not discussed by Hardy and Littlewood), and 
indicate how the same method could be used for the other cases. No originality 
is claimed for the proofs of Theorems 1A and 1B; the method of proof is an 
obvious modification of that by which the theorems are usually established 
when g(r) = 2“, ¥(r) = 2? 

Let 6(x) be any function, nowhere zero, such that « + 6(x) > 0 for x > 0. 
Then Taylor’s theorem with remainder of second order, applied to f(x), can be 
written 


(2.1) fe) =; a [f(x + 8(x)) — f(x)] — 4a(x)f"(e + 05(2)), 


where 6 = 6(z, 5(x)) satisfies 0 < 6 < 1. 

Now, if e(x) is any positive function, (2.1) is valid with 6(2) = e(x)[o(x) /W(a)}, 
and we have 

f)| ¢ l 
[o(x)y(x)}! ~ e(x)g(2) 


e(x) 


[| f(x + 6(z)) | + | f@@) |) + avr 


(2.2) | f’'(x + 06(x)) |. 


If we assume hypothesis (a) of Theorem 1A, 

e(z + 4(x)) = ¢(2), 
with a similar inequality for ¥(7), and (2.2) gives 
(<(2)0(1) + O(1)/e(2), 
e(x)O(1) + o(1)/e(2), 
‘e(x)o(1) + O(1) /e(ax), 


f'(x) 
[e(x)y(x)}} 


lA 


(2.3) 


according as we consider A;, Ae, or A;, respectively. To establish Ai, we 
take «(r) = 1; to establish Ao, we take for e(x) a function which is o(1) but of 
sufficiently slow decrease; to establish A;, we take for e(7) a function of suf- 


ficiently slow increase, with 1/e(z) = o(1). 
Theorem 1A under hypothesis (b) can be established in a similar way, but 
more care is needed. It is natural in this case to take 6(r) = — e(x)[e(x)/P(x)]’, 


where ¢(z) is still a positive function, to be chosen as before; but such an argu- 
ment can be used only as x — & on the set E,; of points where ¢g(z)/y(xz) S 
(x/e(x))*, since on the complementary set, E,, 6(z) < —2z, and (2.1) is no 
longer valid. But as z — «© on Ez, it is legitimate to suppose that f’(0) = 0, 
and the conclusions of the theorem can then be deduced from 


is’@|s [ f(b | dt, 


§ See, for example, E. Landau, Darstellung und Begrtindung einiger neueren Ergebnisse der 
Funktionentheorie, 1929, p. 58, where a proof is given for the corresponding theorem when 
xz — 0+; the modifications necessary when z — ~ are trivial. 


























ASYMPTOTIC RELATIONS FOR DERIVATIVES 641 


and the fact that y(x) « |. Under hypothesis (c), the proof is simpler, since (c) 
includes an auxiliary hypothesis which in effect makes 2; empty. 

We now establish Theorem 1B. We have g(x) < o(x)2°¥(x), where o(x) = 
o(1) for Bs, and (without loss of generality) we may take o(x) = 1 for B, 
and B.. Let e(x) be a positive function, ¢«(z) < [40(x)]*. Let o(z) = 
+e(x)[o(x)/W(x)}'; then | 5(r) | < 3a, and (2.1) is valid; we may write (2.1) in 
the form ' ; 

xy —1 /y¥(2) e(x) (e(x)\’ py 
2a) Fp) = 7) (HO) pe + aay — sol + P(E) gree + eave), 
where 0 < @ < 1, and the sign taken in (2.4) is that of —d(x). Now, f’(x) S 
n(x)¥(x), where n(x) > 0, n(x) = O(1) for B, and Bz, and n(x) = o(1) for Bs. 
The function A(z) = _u.b. —_n(t) has the same properties, and n(x + 66(x)) S 


x/2<st<3z/2 
A(x). Then 


Fs) _ fle + H(e)) — fla), €(a)aW(x + 05(2)) 


C5) ova = e@eay + we 
Since ¥(r) « K, ¢(x) « K, and | (x) | < 3, there is a constant B such that 
(2.6) v(x + 06(x))/Y(x) = B, o(x + 5(x))/e(z) S B; 


and hence 
f(x + 6(x))| _ | f(a + 6(a)) | e@ + 6) — B | f(x + 4(z)) | 
¢(2) 


~ g(a + 6(z)) p(t) (ew + 8(2))’ 
so that 
(2.7) | f(x + 6(z)) |/e(z) & (2), 
where r(x) = O(1) or o(1), according as we consider B, and B;, or Bz. Then 
(2.6) and (2.7) reduce (2.5) to 
Ff'(x)le(aw(x)]* < r(x)/e(x) + 4B e(x)A(z). 
For B,, we take e(z) = 4; for Be, we take e(x) = min [}, (r(x))']; for Bs, we 
take e(z) = min [(A(z))*, (50(x)) 4). 


3. A definition and a lemma. It is convenient to have a name for the differ- 
ential operators which we shall consider. We introduce the following 
Derinition. A linear differential operator 


(3.1) LIf(a)] = )» Pn—i(x)f” (x) 
with 
(3.2) pi(x) = p b:;2’, 


where the b;; are constants, and bon = 1, shall be called a generalized Euler operator.’ 


® The name “generalized Euler operator’’ is used because in the special case, mentioned 
in the introduction, when p,(z) = b:2"~‘, L[f(z)] = g(x) is what is known as an Euler differ- 
ential equation. 








642 R. P. BOAS, JR. 


Lemma. If L[{f(x)] is a generalized Euler operator of order n, f(x) is of class 
C" (0 S x < «), anda is an integer, 0 S a S n — 1, thenasz— ~, 
[ (x — tL f@)dt = 2"f" (ze + of) + DS FW O(x***") 
(3.3) *" pe 


+ > Ax | ° SOO) dt + O(2"), 


i=0 


where the A; are constants, and c is a constant ¥ - 
Let L[f(x)] be the operator adjoint to L[f(x)]. Then if g(x) is any function 


of class C", 


(3.4) | g(t)L{ f(t)] dt -[ fOL[g@Ojdt + PLO, gO] iz, 

where 

8.5) Lig] = Lo (-0' [pg 1”, 

34) PIfO, 901 = VD (-1)' f°" Olp. ag Ol? .” 
i=l j=0 


If g(t) = (x — 2)", and the differentiations are carried out in (3.5) and (3.6), 
we find 


[ (x — )L{f@Jdt = Dd Aus! | fOP.()dt + D> Ais 


1=0 1=0 


+ DS SOP isesrlx) +f"? (z)(exr" + Pi-«(2)), 
mu 


, . . . 
where A, and A, denote various constants, and P;(¢) is a polynomial of degree k 
at most (not necessarily the same at each appearance); from this (3.3) follows 
at once. The details are left to the reader. 


4. A theorem with generalized Euler operators. We shall consider a pair of 
positive functions 6(x) and ¢(x) satisfying what we shall call 

Conpitions A.” 

(i) g(x) « K; 

(ii) r(x) = O(¢(z)); 

(iii) | 6(t) dt = O(¢(z)); 


r 
(iv) [ g(t) dt = O(xe(zx))." 
d 0 

1° Actually, c = (—1)*a!. 

11 See, for example, E. L. Ince, Ordinary Differential Equations, 1927, pp. 123-124. 

2% The author is indebted to the referee for the elimination of a redundancy in Condi- 
tions A. 

13 It is supposed throughout this section that z— «. In particular, if @(z) > 0, and 
6(z) is (in Karamata’s terminology) regularly increasing in the wide sense, then @(z) and 
¢(z) = x6(z) satisfy Conditions A. The results needed for verifying this can be found 
in Karamata’s paper cited in footnote 4. 




















ASYMPTOTIC RELATIONS FOR DERIVATIVES 643 


We now establish our main theorem. 

THEOREM 2. Let L[f(x)] be a generalized Euler operator of order n, and let 
f(x) of class C” on (0, ~) be a solution of the differential equation L[{f(x)] = g(x). 
Let 0(x) and g(x) satisfy Conditions A. If, asx — ~, 


(4.1) f(z) = O(6(z)), 
(4.2) ¥ g(t)dt < O(¢(z)), 
then 
(4.3) f(z) = O(2” "¢(z)) 
for p = 1, 2, --- ,n — 2; and also for p = n — 1 tf in addition 
(4.4) g(x) < O(6(z)). 

If also 
(4.5) f(z) = 0(6(x)) 
and 
(4.6) [ O(t)dt = ~, 
then 
(4.7) f(x) = ox” “g(z)) 


for p = 1, 2, --- ,m — 2, and also for p = n — 1 if (4.4) ts satisfied. 

If the conclusion of the second part of the theorem is needed only for 1 S p S$ 
n — 3 (or n — 2 if (4.4) is satisfied), it is a direct consequence of the first part 
(even without (4.6)), by successive applications of Theorem 1B, since by Condi- 
tion A(ii), (xz) = O(x'y(x)), while (x + 1)‘y(x) eK for any real number k 
if g(x) e K. 

Theorem 2 has content only when n 2 2 if (4.4) is satisfied, and only if 
n = 3 when (4.2) alone is satisfied. If n = 2, L[f(x)] = 2°f’(x), and z2@(xz) = 
g(x), Theorem 2 reduces to the special case of B; and B, of Theorem 1B where 
g(x) = a’¥(x) (except that Conditions A(ii), (iii), (iv), and (4.6) are then 
redundant). If L[{f(x)] = 2"f‘(x), ¢(x) = x6(z) = x‘, and the one-sided 
O-condition (4.4) is replaced by a two-sided O-condition, Theorem 2 is a special 
case of the extension of Theorem 1A given by Hardy and Littlewood (who use 
a more general function g(x)).”* 

If we take 0(r) = (x + 1)* log (x + 1), ¢(x) = [log (x + 1)J’, we obtain an 
example illustrating the theorem when ¢(z) # 2x6(x); and the theorem with 
these functions has applications.” 


'* Reference in footnote 5. 
18 It was used in the author’s Harvard thesis (unpublished). 








644 R. P. BOAS, JR. 


To establish Theorem 2, we start from the familiar formula 


G,, (x) = [ (x — t)"g(dt = mt [ dt», [ra = Sf at, [° g(t)dt (m>0). 


From Conditions A(ii) and (iv) we obtain, for p = 


[ g(t)t’dt o(e | e(tat) = O(x”*' 9(z)); 


[ “oli? dt o( [ “er"t) = O(2’ ¢(z)); 


and it follows, by use of (4.2), that 
(4.9) G(x) < O(x2"¢(z)) (m = 0, 1, 2, ---). 


(4.8) 


Now, by Condition A(iii), 


[ “alt)dt = O(e(2)), 


1/e(z) = oft / [ ; wat} = 01), 


(4.10) x* = O(2* g(x) (k = 0). 
Since f(z) = O(@(x)), from Condition A(iii) and (4.8), we obtain 


(4.11) > Aja’ [ soo ‘)dt = O(x* g(z)). 


1=0 


We use (4.10) and (4.11) in (3.3), and, remembering that L[f(z)] = g(x), we 


obtain 
n—a—2 


Gaz) = 2"f"" (ae +o) + DV f° WOR 
(4.12) ne 


+ O(x* 9(x)) (lsSagn-}). 
Suppose now that (4.3) has been established for 1 S p S q,q S n — 2. 
Then (4.12) fora = n — q — 1 gives 
(4.13) Gr—~a(t) = O(2" *"g(z)), 
and since G,.(z) m(m — 1)Gm~2(r) (m = 1; we set G_4(x) = g(x)), we have 
by (4.9) with m = n — q — 3, 
(4.14) Gi -ga(x) < O(2" *“g(z)) 


when g S n — 3, and also when q = n — 2 if (4.4) is satisfied. But since 


o(x) « K, («@ + 1)""*"g(r) « K; and by part B, of Theorem 1B, (4.13) and 
(4.14) imply 


(4.15) Gi,-e-a(z) = (n — q — 1)Gy_-¢-2(z) = O(2" * “g(z)). 








a ia eee 








e 


ve 


ce 
nd 











ASYMPTOTIC RELATIONS FOR DERIVATIVES 645 


By (4.12) fora = n — q — 2, and by (4.3), assumed established for 1 S p < q, 
(4.16) Gn—ga(z) = x" (x)(e + o(1)) + Oe" *“G(z)). 

Comparing (4.15) and (4.16), we have, since c ¥ 0, (4.3) for p = q + 1, provided 
only that 1 S$ q S n — 3, orl S q S n — 2, according as (4.4) is not or is 
satisfied. Hence the first part of Theorem 2 is true as soon as we verify (4.3) 
for p = 1. To do this, we have, from (4.12) with a = n — 1, 

Gna(z) = x"f(x)(c + o(1)) + O(z"'¢(z)) 
= O(2"'g(2)) | 

by use of (4.1) and Condition A(ii). Then by (4.9) with m = n — 3, or by 
(4.4) ifn = 2, 


Gp-a(z) < O(2"“g(z)), 
and by Theorem 1B, 
Gno(z) = x"f'(x)(c + o(1)) + f(z)O(2"") + O(z" “o(z)) 
= O(z"“*g(z)), 
so that 
f'(x) = O(a*g(z)). 

This completes the proof of the first part of Theorem 2. The proof of the 
second part is similar. 


Since | 6(t) dt diverges, in place of (4.10) we have 
0 


(4.17) x = o(z‘g(z)) (k 2 0). 
In place of (4.11), 


D> Ajz' [ SOO) dt = o(x* g(z)). 
Hence (3.3) gives us, in place of (4.12), 


Gila) = 2°f"* (a) + ofl) + FOR) 
(4.18) i=0 


+ o(z* ¢(zx)) (lsasn-l1). 


The remainder of the proof is exactly parallel to the proof of the first part of 
the theorem, part B: of Theorem 1B being used instead of part B,. The details 
are left to the reader. 


5. Conclusion. We have stated our theorems with z — ~. It is clear that 
they hold, with obvious modifications, when x — 0+. The proofs are given 
most simply by modifying the reasoning directly, rather than by a change of 








646 R. P. BOAS, JR. 


variable. The classes f and | are replaced by the classes of functions non- 
decreasing or non-increasing as x — 0+; that is, T becomes | and reciprocally. 
The class K is replaced by the class of functions g(x) with g(1/r) « K. Since it 
may not be quite evident what becomes of Theorem 2, we state the modified 
theorem in detail. 


THEOREM 3. Let 


Lif(z)] = > paz) f (2), 


\ . pz) = > b;2’, bo = 1. 


7=0 


Let f(x) of class C" on0 < x < & bea solution of the differential equation L{f(x)] = 
g(x). Let 0(1/x) and ¢g(1/x) satisfy Conditions A. If, as x + 0+, 


f(z) = O(6(2)), 
i g(t)dt < O(x"** g(z)), 


then 
f(z) = O(@ ’"'¢(z)) 
for p = 1, 2, --- ,n — 2; and also for p = n — 1 @f in addition 


(5.1) g(x) < O(x "0(z)). 
If also 
f(x) = o(@(z)) 
and 
[ at ')dt = x, 
0 
then 


f(z) = o(@”™ g(a) 
for p = 1, 2, ---,n — 2, and also for p = n — 1 if (5.1) ts satisfied. 


Harvarp UNIVERSITY. 





Pee 





nneiikanatiecnmaes 




















TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL 
SUMMABILITY 


By Morris L. Ka.es 


The following work is divided into two sections. The first deals with a 
Tauberian theorem of summability methods related to Borel summability, and 
the second with a Tauberian theorem of summability methods related to Abel 
summability. 

In 1925 Robert Schmidt gave a proof of the following theorem [6]:’ 

If a series is Abel summable to the value s, and if the partial sums s,, satisfy the 
condition 


. ” — 
lim (s,, — s,.) 20 whenever ——— —0 (m > n), 
_ n 
then 
lim s, = 8. 
n—-2 
In the same year Schmidt gave a proof of an analogous theorem concerning 
Borel summability. This theorem states [7]: 
If a series is Borel summable to the value s, and if the partial sums s,, satisfy 


the condition 


lim (s,, — s,) 2 0 whenever ~ 0 (m > n), 


then 


lim s, = 8. 


In the following two years Vijayaraghavan [10, 11] gave a new and more 
elementary proof for each of these theorems. 

It will be observed that in each of the preceding cases the method of sum- 
mability is a power series method, and that the condition imposed on the 
sequence of partial sums is of the following type: 


m— n 
> 


0 (m > n), 
o(n) 


(i) lim (s,, — s,) 2 0 whenever 


where ¢(n) is an increasing function of n which tends to « with n. 


Received June 30, 1937. The author is indebted to Professors J. D. Tamarkin and 
O. Szisz, who suggested this problem to him. 

1 The numbers refer to the list of references at the end of this paper. For an extensive 
bibliography on the subject of Tauberian theorems see N. Wiener, Tauberian theorems, 
Annals of Mathematics, (2), vol. 33 (1932), pp. 1-100. 

647 








648 MORRIS L. KALES 


These facts suggest an interesting possibility of extension. It is natural to 
ask: corresponding to any increasing function ¢g(n) which tends to infinity 
with n, does there exist a power series method of summability such that every 
series which is summable by that method, and whose partial sums satisfy condi- 
tion (i), is convergent to the same value s? In particular we may take the case 
when ¢g(n) = n“ (0 < a@ <1). [Cf. J. M. Hyslop, 2.] This is a special case of 
Theorem I which is proved in the first part of this paper. When a = 3}, we 
obtain the case of Borel summability as a special case, but the case of Abel 
summability, for which a2 = 1, is not included in Theorem I. The latter case 
is the point of departure for Theorem II, which is proved in the second part of 
this paper. 

I 

By combining the methods of Vijayaraghavan and Valiron [9] I have been 
able to prove the following theorem which I now proceed to formulate. 

Let 


4 


> g(n)e* x" = F(z) (x > 0) 
1 
be a power series with radius of convergence R. (2 may be finite or infinite.) 
Let the function G(x) satisfy the following conditions: 
I. G(x) has a second derivative G’’(x) which is positive and which tends mono- 
tonically to zero as x tends to infinity. 
II. There exists an increasing function ¥(x), which tends to infinity as z — «, 
such that 
¥"(a) fl : / V(2) 
= ] whenever {|2%1—2/| 3 - 
G(x) + *"\ Wa) sia V G(x) 


III. There exists a decreasing function H(z) ~ G(x) (x — ~) such that 
2/H(x) has an inverse K(x) which has a continuous second derivative and 
satisfies for all large z relations of the form 

(i) AK(x) < xK'(x) < BK(z), 

(ii) 2° | K(x) | < CK(2), 
where A, B, C are positive constants. 
IV. lim ~ = = 0 


~o | 
Finally, let the function g(x) be defined as follows: 
V. g(x) = x°L(x) > 0, 
where o is any real number and L(z) satisfies the condition 


. Laz) 
lim 


= } 0. 
lien Fis) 1 for every fixed \ > 


PORT WOES 


tee 











t 





AION 


— 


TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 649 


TueoreM I. Let the functions G(x) and g(x) satisfy relations I-V inclusive. If 


lim ro) > srig(nde “" z" = 8 (s finite), 
and if 
lim (s,, — 8.) = 0 whenever (m — n)+/G’'(n) > 0 (m > n), 
then 


lim s, = 8s. 

For the cases when g(x) ~ las x — «, or g(x) = x°L(x) with « > 0, condition 
IV is not required. 

The proof of this theorem depends upon a number of lemmas which are 
analogous to results obtained by Valiron and Vijayaraghavan. Before turning 
to these lemmas, it will be useful to enumerate a few of the immediate conse- 
quences of conditions I-V which can be easily verified. 

(A) Condition IT implies that 

lim 2°G’"(x) = &. 
(B) Without any additional assumption, the function ¥(2) of condition II may 
be replaced by any function g(x) which tends to infinity as z — © and satisfies 
e(z) = ¥(z). 


(C) The series 


> e G(n) = F(z), px g(n)e~ ox" os F(x) 
1 1 


have the same radius of convergence. 
(D,) If g(x) = 2x°L(x) with ¢ > 0, or g(z) ~ 1 as x — «&, then there exists a 
positive constant K such that 


g(n) r if linsnm. 
g(m) 


(D.) If g(x) = x°L(x), where o is any real number, there exist positive con- 
stants K and @ such that 


(D.) If g(x) = 2x°L(x), where o is any real number, there exist positive con- 
stants K and @ such that 


gm) — K(") iff 1snsm. 
g(n) 








650 MORRIS L. KALES 


In proving C and D use is made of the following representation of the func- 


tion L(r) which is due to Karamata [3, p. 45]: 


(1) L(x) = eta) exp [ t e(t) dt, 
where 

(2) lim e(x1) =e >0 

and 

(3) lim e(7) = 0. 

Also, from (1), (2), and (3) it follows that 

“ _ le sa 


uniformly for all \ in any finite interval not containing the origin. 
Let & = [y + 1], where y = y(z) is the solution of the equation 
G’(y) = log x. 
We are now ready to prove 
Lemma 1. Let N; = N,(£) and Ne = No2(é) be two positive integers which 
satisfy the relations 


() Ni-§=&-M, (Nz > 8), 
(ii) lim (Ne — t)’G’(N2) = &. 
Then ; 
No-l 


; I (n) on 
lim } x g(nje "2" = 1. 


z—R F(x) N,+1 
Lemma | is an immediate consequence of Lemmas 1, and 1, which follow. 


LEMMA 1, . 
Ni 


: 1 Gin)» 
lim g(nje "2" = 0. 
z-7R Ik (x) X g 
Let M be any positive integer such that 1 S M < &— — 1. Writing 7,(2) 
for e “\"'x" and following the method of Valiron [9], we can easily show that 


an Tu— <p Muerte O<spsM- 1). 


T: -l—p 5 a5 
To prove this, we first apply Taylor’s expansion and get 
(2) G(M — p) = GE -—1— p) + (M —€=4+ DG"(E — 1 — p) 


(M—£+1)'Q, 
+ 2 +f) @(M,), 








Te ee PE 

















); 


r) 


at 


py) 











TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 651 


where M — p < M, < & — 1 — p, and then make use of the facts that G’’(z) 
is a decreasing function of z, that G’(z) is an increasing function of z, and that 
log zr = G'(y) 2G(E — 1) 2G(E — 1 — p). 

Now let us consider the case where g(x) = 2°L(zx) with o > 0, or g(r) ~ 1 


asx— «x. Then by D, 


(3) gM =P) eK 


g(§ — 1 — p) 
Combining (1) and (3) and placing M = N, we get 


g(N1 — - p)Tx,- —p > —§(N1—&+1)2G""() et 
(4) g(é a yg Per p) T+ le < Ke (0 = Pp = M 1). 
Hence 
M1 N-1 
(5) z g(n)T,(2) = > g(Ni — p)Tw,-p 
n=1 p= 


— , aad 
< Ke se (é) p> g(é = gTi-s-. < Ke tse ® F(z). 
>= 


Since — < Nz, and therefore G’’(¢) = G’’(Ne), it follows from (i) and (ii) 
that 


(6) lim (Ni — — + 1)°@"(&) = @. 
—§—-2 

Combining (5) and (6) we have 

(7) lim rs 5 2 > g(n)T (x) = 0. 


Now let us consider the case where g(x) = 2x°L(zx) and oa is any real number. 
Then by D, positive constants K and @ exist such that 


g(M — p) 


. K(é — 1 — p)* < Ke 
(8) ¢-1-p ~** wee 


(ls Mséi-1;p=0,1,2,---,M — 1). 
Combining (1) and (8) we have 


(M — p)Tu_» pa — 4 M—E+1)2G" (8) 

(9) : 5 P< Kée : 

g(§ — 1 — p) Tr» ‘ 

Let 6 be a fixed positive number such that N = [(1 — 4)t] + 1. Then writing 


(10) > y(n) Ta(z) = > g(n)T, + es >> g(n)T, = 1, + Te, 


Fe x) ‘4 
we obtain from (9) 


(11) I, < ree < Ke hee @talogé 


Fa 








652 


But by condition IV and A 


MORRIS L. KALES 


(12) lien Se a'"(t) — alogé = lim Fores _ sent = 0 
Thus, combining (11) and (12), we get 
(13) lim J; = 0. 
z->R 
Since lim L(\r)/L(z) = 1 uniformly for \ in any finite interval not con- 


ze 


taining the origin, it follows that, for sufficiently large ¢, if (1 — 5) <n < m < &, 


then 


g(n) _ n° L(n) 


(14) g(m) ~—-m? L(m) 


1 |e] 
<2(,1 ,) 


K. 


Setting M = N, in (1) and combining with (14) we see that 


*rcg) 


(0 < P < Ni _ (1 = 5)é). 


Dd g(Ni — p)Tx,-» < K exp [-3(N1 — € + 1°") 


> gl —1 — p)Te1-» < K exp[—}(M1 — € + 1)°@"(. 


o g(N, - p)T x ~p > —}(N,—§+1 24 
(15) : < Ke . 
gg = I aa Pp) Tr1~-p 
Hence 
1 N\—N-1 
I, = 
2 F(x) p=0 
(16) 1 
F(x) p=0 
From (16) we see that 
(17) lim J. = 0. 
rR 
Combining (13) and (17) gives 
; 1 < 
XS e = (). 
(18) lim F(a) > q(n) T(x) 


It will be observed that in proving Lemma 1, , condition IV was not required 


for the case where g(x) 


x’L(x) with « > 0, or g(x) ~ 1. 


Since this is the 


only place where condition IV is used, it follows that condition IV is not re- 
quired for the theorem if g(z) = 2°L(x) with « > 0, or g(x) ~ 1. 


LEMMA 1), . 
lim eG) p> q(m)e 


G(m yr” _ 0. 


If we apply the theorem of the mean twice, it is easy to show that 


m 
<= ¢ 


(1) . 
Th 
In particular it follows from (1) that 7,./7,, 


monotone decreasing with respect to n for n 2 


niin-8G" 


Mn (§ Sn Sm). 


<= 1; i.e., the terms 7',(2) are 
t 
é. 

















TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 653 


By D, there exist positive constants K and @ such that 


(2) g(m) <K (") (1 <n»). 
g(n) n 
Hence 
‘ g(m)T,, , =) —(m—n)(n—-8)G"’(n) 
(3) g(n)T, 4 K ‘ e é 
Thus 
_ gT, g(m)T , 
a Fe =e Hm) Tm = "Fa) Xs o(n)T 
< Kg(n)T,, = my (m—n) (n—§)G"'(n) 
F(z) mon\n n 
But 
> ("ye (m—nin-BG""() <9 ap oat wi (m — = aati 
(5) m=n n = n= 
5 
1 
aes AL =e" + (n peers} 
= “a 
Combining (4) and (5) we have 
KK.g(n)T,, | 
) Fay 290m < RGA {i+ +ex=arwtoz areeraye) 
Now by (2) we have, if & S n S m, 
- g(n)T.,, 1 sy 
@) g(m)T,, J K (2 i 


Let N = [(1 + 4)é], where 6 is a fixed positive number such that 0 < 6 < 1, 
and let M be any positive integer such that & < M < N. Then 


g(n)T,, . n\" 
2 g(n)T, = g(M)Tu > (MT, > 9TH Le (i) 


(8) = wi ¢ 
> WME n oy o(,2 ) 
5° 
Thus 
M r a r a 
(9) g(M)T 4 < > WT K(1 + 4) < K(1 + 4) 


F(x) & F(z) M-é M—: 
Let K°K,(1 + 6)* = K,. Thenif N < N2, we set n = N in (6) and M =N 


in (9), and combining the two we get 


Ke ) z g(m) Tn <j (x) Lm g(m) Tn 


" 1 1 , 
< Kil y —ét + (N ~H2G"(N) * ((N — £)2@"( a (N s N3). 


(10) 








654 MORRIS L. KALES 


If Ne < N, we may set M = Nz in (9) and n = N- in (6), and combining we 
obtain 


] ] ] 
F(z) b4 g(m) Tn < Kily, nay” + WN, He t)?@’’(N2) 


1 
¥ {(¥: — 8G") | 


Since by A, lim 2°G’ (x) = «, it follows that 


(11) 
| ove 


— lim iw — 52G”"(N) = B@ eG") ~ 


Thus from (12) and condition (ii) of Lemma 1, it follows that the right sides 


of (10) and (11) tend to zero as § — x. Hence we conclude 


(13) lim rG D > g(m) T, = 0. 


Lemma 2. Under the conditions of Lemma 1, 


lim ve we 2, | ‘m — No)g(m)e*™ x™ = 0. 


By (3) in Lemma 1 we have 


a _ g(n)T,, n) 9 m)T » 
ri 2 d > (m — n)g(m)T,, F(a) Em - ) o(n)T.. 
< Ko(nT. ~ =) go (mm (n— BG" (n) 
(1) F(x) x (m n)( 


KK.g(n)T 1 ‘ 1 \ 
F(z) {(n — §)G"(n)}? (n — §)?#22G""(n)2+2)° 


Let N be defined as in Lemma 1,,. Then if N S Ne, we have 


a ~ : (m — N2)g(m)T,, S sy le > (m — N)g(m)T 


<K { : + } 
"UCN — €)?G’(N) * ((N — &)?G"(N)} 4)’ 


(2) 


and if No = N 


VG (No) >> (m — N2)g(m)T,, 
(3) F(z) m=Ne 


<xKJ . I \ 
"\(N2 ae t)? GIN. 2) y ° ° 


1(N2 — &)?G"(Ns) | '** 









a 














— 








TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 655 


From (2) and (3) we conclude, as in Lemma 1), that 


(4) lim A (2) > (m — N2)g(m)T, = 0. 
z—R F(x x) m=No 

The following lemma concerning the slowly decreasing sequence {s,} can be 
proved in precisely the same manner as the analogous lemma proved by Vija- 
yaraghavan [11, p. 319, Lemma ¢]: 

Lemma 3. If lim (s,, — s,) = 0 whenever (m — n)V/G"(n) — 0 (m > n), 
then to every positive number c there corresponds a positive number k = k(c) such 
that 


Sm — 8» > —{k(m — n)VG"(n) + c}. 


LEMMA 4. Under the conditions of Theorem I, if 


lim Dd sag(nye Oz" = 8 (s finite), 
z>R Fe 7 
and if lim (s» — 8.) = 0 whenever (m — n)+/G’'(n) > 0 (m > n), then 


= O(1). 


With the aid of Lemmas 1, 2, and 3 this lemma can be proved in the same 
way as Lemma 1 of Vijayaraghavan’s paper on Borel summability [11]. 

Lemma 5. Under the conditions of Theorem I, if {A,} be a bounded sequence 
of numbers, and if 


lim : A,g(ne "x" = A, 


lim ) 
then 
lim pc ye Ane hes 


Let 6 be a positive number such that 0 < 6 < 1. Let N; = [(1 — 4)£] and 
Nz = [(1 + 4)é]. The conditions of Lemma 1 are satisfied by these defini- 
tions of N,; and Ne. Hence 


(1) lim z gine" 2” = 1, 
and 
(2) lim p> g(nye rad zr" = lim g(nde Gin) r" = 


lim 2 


Since the sequence {A,} is bounded, it follows that 


(3) lim cy .% A,g(nje Ox" = lim >> A,g(ne ox" = 0 














KALES 





MORRIS L. 


and 
Ng-1 


(4) lim cg ) », A,g(ne Om" = A, 


As was noted under D, we have 

L(x) 
5 
5) a” Tis) 


uniformly for all \ in any finite interval not containing the origin. Now if 
Ni+1s nS Nz —1 then 


= 1 


(6) (1 — dE Sn s (1 + 4)E, 

and therefore by (5) 

(7) 1l—ez)< a <l+der) (Ni+t1S nS N2.-1;c(x)-0,2—R). 
If we let 6, = 5fore = Oand 6, = —é fore < 0, then 

(8) (1 — 8)" < (7) < (1 + 64)". 

Combining (7) and (8) we get 

(9) a — ea — 3)" < @™ <1 4 ee + 8. 


g(é) 


Since the A, are bounded and the summability methods under consideration 
are regular, there is no loss of generality in assuming that the A, are positive. 
Hence assuming that A, > 0 we get from (9) 


(é) N21 1 No-l 
(I — 6.)"(1 _= e(zx)) 9 2, A,e a), wo > A,g(n)e oz" 


F(x) s < F(z) Nyvl 
(10) 
e g(é) © gis ~G(n) n 
< (1 + 4,)°(1 + ¢(z)) > Aes. 
F(x) A 
Hence 
No—1 Ne—1 
a1) 1 — 8) fim 2®. © Ave 2" 5 A (148, lim 98) 4c 2" 
zk F(x ) Mm rR F(x ) m1 
If in (1) we place g(n) = 1, then F(x) = F,(x) and we have 
Nq—1 
(12) lim — 2 eos" = 1. 


z—R F(z) Ny+1 


If now we place A, = 1 in (11), then A = 1, and combining with (12) we get 


PF i(x) li o 9(E)F i(z) 
bm Fay 1 SO + 


(13) (1 — 6,)° lin 




















TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 657 
Since 6 is arbitrary, we may set 6, = 0 in (13) and thus conclude that 


g(é) 1 a 
(14) F(z) ~ F(a) (x R). 
Substituting this result in (11) we have 


No-1 
(1 — 6,)’ lim aiken 7 ae 
zor F(x) S11 
- li sy (n) 
< A < 1 5, ¢ im _ Ane —Gi(n xz" 
ie < ( - i= AG p> 


If in (3) we set ~ 1, then F(x) = F,(x) and we have 


—G(n) n oto) n 
(16) lim FG@ p> aa him Fr we a 
Combining (15) and ge we have 


ae Gin), n <A < (1 + 6,)° lim - Ave —G(n) n 


(17) (1 — 6,)° lim 


De ) ‘7 zk Fi ) ‘7 
And since we may place 6, = 0 in (17), we conclude that 
on), n 
(18) lim re 2 D Ase = A. 


Lemma 6. Let {A,} be a bounded sequence of numbers, | A,| < K, and 
suppose that 


lim — © as “Gn = A. 


zk Fe r) 7 
Let the function A(x) be defined as follows: 
(i) A(x) = Ajay + (x — [z]) (Apes — Ate) (x 2 0), 
A(z) = 0 (x < 0). 


Finally, let H(x) ~ G(x) asx — ~. Then 


lien >: Hz) [a4 vet" dt = A. 


Since H(z) ~ G’’(x), we have 
(1) H(x) = G(x) + e(2)G" (x) (e(t) + 0,r— ~). 
By B, we may assume that the function ¥(z) of II satisfies the conditions: 
1 
(2) (x) = of \ 
p(x) 


ad 4) = Lance) 








658 MORRIS L. KALES 


asx— «x. Let 


(4) Ne~t=§-N, = | ¥(g) | 
g=§ 1 G’(t) 
From (4) and II we have 
(5) VG"(N2) ~ VG") (§ > ~). 
Hence 
(6) lim (Nz — §)0/G"(N2) = lim (N2 — §)V/G"(é) = &. 
f—-2 ix 
We may then apply Lemma 5 and get 
(7) tim —) > Ae 2” = A. 


z—R F(x) 1 


By (16) in Lemma 5, we have 


N2-1 


° 1 G(n) nn 
Now 

T(x) G(n)+G()+(n—§&) log z 
(9) —~- = ¢ ssid 

T,(x) 


and by Taylor’s theorem 


(10) Gin) =G@+n—-9e@+” 3 GG) ESE. Snorns& Sd. 


Combining (9) and (10) we have 
(11) T, = Fe hea. 


Now if N; S n S Neo, then 


aes ey ee eer 

(12) En g| Ss ; gE | s Ne g | G"’(é) i 
Hence by condition II, 

- " oe} 
13 G"(é,) = G . 
(13) (E,) (€) + of ve) 
Hence 

ver ver ee\ 

14 H(é) — G’(é,) = G _ . 
(14) (é) (En) = p(E)G’’(E) + of ve) 
Combining (14) and (2) we see that 


vier oe} 
5 H —G n) = 0§- = Pp, 
(15) (é) (é ) { H(é) 

















TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 659 


Hence 


! — ¢)? _ ww _ 2 G 2} me 
(16) | (n — &)"{H(E) — G’(En)} | S (Ne — 8) 4oe o(1) 


uniformly for all n such that Ni < n S Nz. Now, log x = G'(y), where 
& = [y+ 1]. Hence, since G’(z) is an increasing function of z, 


(17) 0 < Gt) — logz S G'(é) — GE — 1). 

Hence, if N; S n S No, 

| (n — &){log « — G’()} | S (Ne — H(E"(B — G@’(—E -— 1)} 
Ss (N2 — )G"E — 1) 


ve) ” ins tle\( le 


(18) 


Combining (18) with (3) we see that 


(19) | (n — £&) {log x — G’(é)} | = o(1) 
uniformly for all n such that N; S n S Nz. From (11), (16), and (19) we get 
(20) Ty = {1 + en(§)} Tee ON 


(where lim ¢,(¢) = 0 uniformly for all n such that N; S n S Nz). From (20) 
t-2 


we conclude 


1 *S : T(z) — —I(n—8) 2H (8) 
lim Dd AnT.(z) = lim Dd Anf{l + enlé)Je 
(21) z—R F(z) ) wy z—-R Fy(x) n=Nj+1 
T(x) *S —Hn—§) 2H (8) 
= fet rot A, ; 
lim F(z) ba , 


In particular, if we place A, = 1, then A = 1, and we get from (21) 


(22) lim T(x) ¥ etinPta® _ 7 


z—R F(z) Ny+1 


Now it is easy to show that 


N2-1 oo — 
Hin-D4H® 9 HetH® ay 2r : 
™ a! fe H® 
Thus, we see that 
T(z) , /H(®) 
(24) F(x) Qr , 
And combining with (21) we have 
(25) lim 1 HO FS gctoome = 4. 
g-08 x4 








660 MORRIS L. KALES 


From (25) it is possible to conclude by the sort of argument used by Hardy and 
Littlewood that [1, p. 39, Lemma 2.13] 


(26) lim 4/ “e) / Apso" dt = A. 
m8 us —2 


And finally, by an argument which is the precise analogue of that given by 
Vijayaraghavan, we can conclude from (26) that [11, p. 322, Lemma 2] 


“* 


(27) lim 4, AG) | A(t + xe" dt = A. 
r—-2 Tv ~20 


We now introduce the following lemma which was proved by Valiron [9, 
Section 11, p. 278]: 
Lemma. Let K(x) be an increasing function of x which satisfies the conditions 


K(x) 


(i) lim —= = a, 

r+2% Vr 
(ii) AK(x) < rK'(x) < BK(2), 
(iii) aK" (x) | < CK(z) 


for positive A, B, C and all large x. Let f(x) be a bounded function of x, and 
suppose that 


z—x 


l ” ° 
lim —> / f(t + K(a))e dt = 8; 
VW/wt Jo 
then 
lim V ~ fit + K(a))e ais dt = 8 
( sz J 


zs 
for everya 2 1. 
From this lemma we obtain very easily 
LemMa 7. If A(x) is a bounded function of x, and if 


lim 4/ — / A(t + ze" dt = A, 
2r 


r+ x 


then 
: /aH(x) [* Hatta) yy 
lim V On a A(t + xe dt=A 


for every a = 1, where H(x) is the function defined in condition III. 

To prove this, we observe that the function K(x) of condition III satisfies 
the conditions of Valiron’s lemma. Thus, if we replace the parameter x by 
2/H(x) in this lemma, we get Lemma 7. 

Vijayaraghavan has proved the following [11, p. 324, Lemma 4]: 





Vv 








TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 661 


Lemma. If 6 be any positive constant and if a — ~, then 


—_— br} 
am / a —at?/r 
I; V ae dt— 1 


uniformly in x, asa— ~. 
If in this lemma we make the substitution z = 1/H(y), it takes the form of 


Lemma 8. If 6 is any positive constant, then 


tent 
H ; 6(H(y)) ” 
I, = V a i(y) eta ays 1 
7 —5(H(y))-4 


uniformly in y, asa— ~, 
Lemmas 1-8 furnish us with complete analogues of the lemmas which Vijaya- 


raghavan uses in proving the Tauberian theorem of Borel summability. The 
proof of Theorem I can now be completed in the same manner as that used by 
Vijayaraghavan. 
II 

As was stated at the beginning of this paper Theorem I contains Borel 
summability as a special case, but not Abel summability. We turn now to a 
theorem which is a generalization of the Tauberian theorem of Abel sum- 
mability. 

TureoreM II. Let (x) be an increasing function of “regular growth’’, which 
tends to infinity with z; 


- . Lirz) 
= g > . ii, ile - 
@(x) = x* L(x) (« > 0; lim = i 1 for every > 0), 


Let B(x) be bounded in every finite interval. Let the integrals 


[ B d®(u), [ etal [Bo aw} 


exist as Stieltjes integrals. Then if 
; 1 —™ : _ , 
lim rd + a) 8) [ e a{ [ B(u) aww} = B (B finite), 


oy) 
(7) 


and if 


lim (B(y) — B(x)) = 9, whenever (y2r—> 0), 


then 
lim B(x) = B. 


zx 


Lemma 1. If 6(N)/#(x) — 0, then 


1 r —tiz 
a e '* d&(t) > 0. 



















662 MORRIS L. KALES 


For 
1 [ e ''* det) < . ai d®(t) 
- @(N) — #(a) nals 
(z) 


by hypothesis. 
Lemma 2. If #(M)/®(x) — ~, then 


1 one 
aa | e det) > 0. 


From the representation 
(1) eh a cz)els" me 
it follows that 
(2) L(x) = o(r‘) 


for every fixed « > 0, and a positive constant K exists such that if « is a fixed 
positive number, then for all large z and M > zx we have 


®(M) _ c(M)(M\" fMrmoa ny fhe er 
SS "wa (“) . <x(% . = hs) : 


From (3) we conclude that if 6(M)/#(z) — o«, then 


(4) ene we. 
r F 

From (2) we have 

(5) @(rz) = x°* L(x) = o(x**) (x > ~). 


Integrating by parts we have 


Lf? un ©) 20) ue 
as J me = a I ta), % ° 


(6) 
— 2M) wie 5 “ (1) tie g (‘) 
~ (x) u (x) Na} 


Substituting (3) in (6) we get 


] bi tle r M eg Mls t etl t 
wa Jy "00 < KZ) + [eva 
ate 
-~ x(* ‘) ouey fs gate e “du, 
rt 











od 


‘du, 

















TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 663 


and since, by (4), M/z — ~, we conclude from (7) that 
1 * —tiz #(M) ) 
Lema 3. If 6(N)/*(x) — 0 and 6(M)/®(r) > ~, then 
1 - —tiz 
aERe |, “a1 
It has been shown by Karamata [4, p. 296, Theorem I; p. 298, footnote 9] 


that if the function 6(z) is a function of regular growth which tends to infinity 
with z, then 


(1) lim "= dp(t) = 1. 


22 r(l + roe f° 


If we combine (1) with Lemmas 1 and 2, Lemma 3 follows directly. 
Lemma 4. If 6(M)/®(x) —> ~, then 


: 2 (0) | te 
1 [*, f @@ \ ts 1 [ ®t) tie 
> —o * db 
wa Je 8 \acanh <a fe aan 
P(t) e t/* 
- sane 3 a(z)® AP. 

But for all large z, 

(t) is 
” (2) ~ K(; 
and since M/x — ~, it follows that there exists a positive number 6 satisfying 
0 < 6 < 1 such that, ift = M, 


(3) K(‘) em. 


Substituting (2) and (3) in (1) we get 


ih O) | us l ° —(-stle 


Integrating the right side of (4) by parts, we have 


&(t) 1-6 [°®() ~a-sueiz 


—(1—d) t/z e ols 
wan | ° dO = Son’ I | = pie 


(1-8) M/z P(x) [° O(t) -a-auz,(t 
-eonns 0 Ost | ae AG) 


(5) 








664 MORRIS L. KALES 


Since M/z — «, and #(M)/#(z) — ~, it follows from (5) in precisely the same 
manner as in Lemma 2 that the right-hand side of (5) tends to zero. Hence 
we see that 


we is 6(M) 
(6) 5() I, log (aan aia (S20 2). 


By an argument similar to that used by Vijayaraghavan [10, p. 114, Lemma 6], 
t 


if we make use of the fact that B(u) d®(u) exists as a Stieltjes integral, we 


can prove 
Lemma 5. If lim (B(y) — B(x)) = 0 whenever ®(y)/#(r) — 1 (y 2r— @&), 
then corresponding to any positive number c there exists a positive constant k = k(c) 


such that 
Bty) — B(x) > -{k log (eo + ch 


for all large x and y 2 z. 

Making use of Lemmas 1-5, which correspond to lemmas in Vijayaraghavan’s 
paper on Abel summability [10], we can by an argument similar to Vijayaragha- 
van’s prove 

Lemma 6. Under the conditions of Theorem I1, if 


: ” ertle if \ — © 
a5 f | ‘ B(u) d®(u) O(1) (x ), 
and if 
lim (B(y) — B(z)) = 0 whenever 2) _, 1 (y=>24— @) 
‘an (x) ™ , 
then 


B(x) = O(1) asz— @~. 


From Lemma 6 it follows that the function B(x) of Theorem II is bounded. 
Hence there is no loss of generality in assuming that B(x) is positive. As- 
suming then that B(z) is positive, we write 


(1) A(t) = [ B(u) d®(u). 


Since $(u) is an increasing function of u and B(z) is positive, it follows that 
A(t) is an increasing function of t. Thus the hypothesis of Theorem II may 
be put in the form 


(2) [ e'*d{A@)| ~ T(1 + a)&(z) (rx @), 


where (x) = x“L(x) (a 2 0), and A(t) is an increasing function of t. 
From (2) it follows by a theorem of Karamata that [5, p. 30, Theorem I] 


(3) A(x) ~ &(2z) 


(x — «), 




















), 
) 


mn 











api > 


TAUBERIAN THEOREMS RELATED TO BOREL AND ABEL SUMMABILITY 665 


i.€., 


1 z 
4 lin Blu) d&(u) = B. 
( lim hy [Bled a) 
The following lemma now enables us to complete the proof of the theorem. 
Lemma 7. If G(x) is an increasing function of x which tends to infinity with x 
and which satisfies the condition 


G(x — 0) Giz + 0) _ 


. a a alias 
if 
¢ - a saat 
lim aw | R(u) dG(u) = R (R finite) ; 
and if 
G(y) 
lim (R(y) — R(z)) = 0 whenever Gia) —~1 (y2r->~2); 
then 


lim R(x) = R. 
This lemma is a generalization of a lemma proved by Szasz [8, p. 337, (18’)] 
and can be proved in the same way. There is merely one point in the proof 
which requires special attention. Following Sz4sz we set up the identity 


R(2)'Gly) — G(z)} = [ ” R(x) dG) = [ ” RW AG) 
(1) ? . 
- [ RW} dG) — [ (RW) — R(x) dG. 


To make the analogy complete, we should have to choose y so that G(y) = 
(1 + 6)G(x), where 6 is a positive number. Since G(x) is not necessarily a 
continuous function of x, it is not possible in general so to choose y. However, 
it is sufficient for the purpose of this proof if G(y) ~ (1 + 6)G(z) asx — &. 
Now since G(x) tends monotonically to infinity, it is clear that y = y(x) can 
be found so that 


(2) G(y — 0) S (1 + 4)G(z) S Gly + 9). 


Combining (2) with (i) we see that G(y) ~ (1 + 6)G(z). The remainder of 
the proof of this lemma can be carried out in precisely the same manner as in 
Szasz’s proof. 

It is easy to show that condition (i) of Lemma 7 is satisfied by the function 
(x), since &(r) is of regular growth. Hence, since 


lim es [ B(u) d&(u) = B 


Ie 





666 MORRIS L. KALES 


and lim (B(y) — B(r)) = 0 whenever #(y)/#(z) > 1 (y 2 x — ~), it follows 
from Lemma 7 that lim B(r) = B. 


z--x 


REFERENCES 


G. H. Harpy and J. E. Lirrtewoop, Theorems concerning the summability of series by 
Borel’s exponential method, Palermo Rend., vol. 41 (1916), pp. 1-18. 
M. Hystop, On the summability of series by a method of Valiron, Proc. Edinburgh 


J. 
Math. Soc., (2), vol. 4 (1936), pp. 218-223. 

J. Karamata, Sur un mode de croissance régulitre des fonctions, Mathematica Cluj, 
vol. 4 (1930), pp. 38-53. 

J. Karamata, Neuer Beweis und Verallgemeinerung einiger Tauberian-Sdtze, Math. 
Zeitschr., vol. 33 (1931), pp. 294-299. 

J. Karamata, Neuer Beweis und Verallgemeinerung der Tauberschen Sédtze, welche die 


Laplacesche und Stieltjessche Transformation betreffen, Journ. fiir Math., vol. 164 
(1931), pp. 27-39. 

R. Scumipt, Uber divergente Folgen und lineare Mittelbildungen, Math. Zeitschr., vol. 
22 (1925), pp. 89-152. 

R. Scumipt, Die Umkehrsdtze des Borelschen Summierungsverfahrens, Schriften der 
Kénig. Gel. Gesell., vol. 1 (1925), pp. 205-256. 


. O. SzAsz, Verallgemeinerung und neuer Beweis einiger Sdtze Tauberscher Art, Sitzungsb. 


d. math.-phys. Klasse d. Akad. d. Wiss., Miinchen (1929), pp. 325-340. 


. G. Vauiron, Remarques sur la sommation des séries divergentes par les méthodes de M. 


Borel, Palermo Rend., vol. 42 (1917), pp. 267-284. 


. T. VisAYARAGHAVAN, A Tauberian theorem, Journ. Lond. Math. Soc., vol. 1 (1926), 


pp. 113-120. 


. T. VISAYARAGHAVAN, A theorem concerning the summability of series by Borel’s method, 


Proc. Lond. Math. Soc., vol. 27 (1927), pp. 316-326. 


Brown UNIVERSITY. 











ween 


ws 











ASYMPTOTIC EXPRESSIONS FOR THE ZEROS OF GENERALIZED 
LAGUERRE POLYNOMIALS AND WEBER FUNCTIONS 


By ViviAN EBERLE SPENCER 


Introduction. It is the purpose of this paper to apply the close relationship 
between Hermite and Laguerre polynomials to find asymptotic expressions for 
the zeros {,2in} of the generalized Laguerre polynomial L,(z, a). The results 
obtained for the largest zero ,x,, are, we believe, new; for the other zeros the 
expressions are essentially equivalent to those obtained by Winston;' the method 
of procedure, however, in every case is new and fruitful. Hermite functions 
h(x, n) are the special case of Weber’s parabolic cylinder functions w(z, n) 
obtained when the boundary conditions w(+ «, n) = O(xr"e*") are imposed. 
The argument is simplified and the order of some of the results improved by the 
introduction of the latter functions for non-integral n. Hence we are led to a 
discussion of the zeros {w2in} of w(z, n). By an elementary application of 
Sturm’s theory Milne’s’ properties of the zeros {,.2ti.} of the standard solution 
are obtained, and similar properties for the zeros of the solution converging to 
zero aS x approaches minus infinity are developed. The application of the 
known asymptotic expressions for the zeros of Hermite polynomials is shown 
to give directly bounds and asymptotic expressions for the zeros of Weber 
functions for n an arbitrary positive number. A sequence of Weber functions 
w(z, a, n) is associated with every sequence of Laguerre functions l(z, a, n), 
and definite separation and asymptotic relations are obtained between the zeros 
of w(x, a, n) and U(x, a, n). As a consequence of these relations asymptotic 
expressions for all zeros of L,(z, @), for any a > 0, are obtainable immediately 
from any asymptotic expression for the zeros of the Hermite polynomial H,(z), 
or for the zeros of the Laguerre polynomial proper L,(z, 1). Neumann’s’ 
bounds for the zeros of L,(z, 1) and Zernike’s* asymptotic expression for the 
largest zero of H,(z) are then applied to L,(z, a). 


1. An asymptotic expression for ,z,, for } S a < }. We consider the Her- 
mite and Laguerre polynomials satisfying respectively the differential equations 


(1) Hi (x) — 2xH{,(x) + 2nH,(x) = 0, 
(2) aL (2, a) + (a — x) Li (z, «) + nb,(2, a) = 0 (a > 0). 


Received January 4, 1937; in revised form, July 8, 1937. 

1C. Winston, On mechanical quadratures formulae involving the classical orthogonal 
polynomials, Annals of Math., vol. 35 (1934), pp. 658-677. 

2 A. Milne, On the roots of the confluent hypergeometric functions, Proc. Edinburgh Math. 
Soc., vol. 33 (1915), pp. 48-64. 

> E. R. Neumann, Beitrdge zur Kenntnis der Laguerreschen Polynome, Jahresbericht der 
Deutschen Math.-Vereinigung, vol. 30 (1921), pp. 15-35. 

‘F. Zernike, Eine asymptotische Entwicklung fiir die grésste Nullstelle der Hermiteschen 
Polynome, Amsterdam Academy, Proc. of See. Se., vol. 34 (1931), pp. 673-680. 


667 








668 VIVIAN EBERLE SPENCER 


The polynomials H,(x) and L,(x, «) satisfy the relations 
(3) He, (x) = L,(z", 4); Honsi(z) = 2L,(2", 3). 
Zernike’s asymptotic expression for the largest zero y2,, of H,(x) is® 
uXnn = (2n + 1)'? — 1.8557571 (2n + 1) “* — 0.3443834 (2n + 1) °° 
— 0.168715 (2n + 1)-*? — 0.151965 (2n + 1)°* + Of(2n + 1) "*}. 
Applying (3), we obtain immediately an asymptotic expression for the largest 
ZeTO 1Xn» Of L(x, a) for a = 4, #: 
((@ = 43) pata, = 4n + 1 — 3.7115142 (4n + 1)'* + 2.7550676 (4n + 1) 7"? 
+ 0.940754 (4n + 1)7' + 0.440858 (4n + 1) °°" 
+ Of(4n + 1) “*} = A(n, 3), 
. = 4n + 3 — 3.7115142 (4n + 3)? + 2.7550676 (4n + 3)? 
+ 0.940754 (4n + 3)' + 0.440858 (4n + 3) °° 
+ Of(4n + 3)°*"} = A(n, 3) 





> 6 . . . 
Markoff’s’ theorem, applied to Laguerre polynomials, gives 


(6) ote 5 @ (¢ = 1,2,---,n). 
da 

Then (5) and (6) yield 

" \(a > $) t2nn > A(n, 4), 

si |\O<a<#) stm < A(n, 9). 

Hence 


(8) (4 <@ < 3) ita = 4n + 2a — 3.7115142 (4n + 2a)'* + O(1) 
(O(1) < 2). 


Note that (8) includes the case of Laguerre polynomials proper, corresponding 


toa=1. 
A more delicate analysis is necessary to extend these results to any a > 0. 


° The remainder term does not appear explicitly in Zernike’s expression, but is implied 
in his argument since he proves that an asymptotic expansion for yz,, in powers of y = 
2n + 1 exists. 

® J. Shohat, Théorie générale des polynomes orthogonaux de Tchebichef, Mémorial des 
Sciences Math., Fasc. 66, p. 39. 

7 By a method based on Zernike’s argument for Hermite polynomials, but not utilizing 
the intimate relation between h and l apparent in (10) and (21) below, W. Hahn, Die Null- 
stellen der Laquerreschen und Hermiteschen Polynome, Dissertation, Berlin, 1933, was able 
to show that 4n + 2a — c,(4n + 2a)! < paan < 4n + 2a — 0 (4n + 2a)! (a > 0; e1, C, > 0), 
for n sufficiently large. (8), which has been obtained from Markoff’s theorem with scarcely 
any argument, is, for} < @ < 4, a better result. 














ZEROS OF GENERALIZED LAGUERRE POLYNOMIALS 669 


2. The zeros of Weber’s parabolic cylinder functions. The Weber equation 
may be written in the form* 


(9) w" + (Qn +1 — 2*)w = 0. 


When the boundary conditions w(+ «, n) = O(r"e**’) are imposed, the only 
solutions of (9) are the Hermite functions {h(z, v)}, h(z, v) = H(zx)e*’, satis- 
fying the differential equation 


(10) h” + (v+1—27)h=0 (v = 0,1,2,---). 


To show this, assume that there exists for n # v a solution of (9) satisfying 
these boundary conditions. Multiply (9) by h, (10) by w, subtract and inte- 
grate between any two limits a and b 


b b 
(11) (wh — wh’) = Wy — ” | wh dz. 


Then letting (a,b) = (— ~, «~), we have / wh dx = 0, a result which contra- 
dicts the completeness of the set {h(z, v)}. 
Write (9) and (10) in the form 


(12) w’ — o(z, n)w = 0, o(x,n) = o(n) = 2 — 2n — 1, 


(13) h” — p(x, v)h = 0, p(z, v) = p(v) = 2” — QW — 1. 


Then p(v — 1) > a(n) > p(v) forvy —1 <n < vy. Hence, as a consequence of 
Sturm’s fundamental theorem, the zeros |2in} of w(x, mn) separate the zeros 
{wt} of h(x, » — 1) and are separated by the zeros {x2} of A(z, v). It 
follows that w(z, n) has at least »y — 2 and at most v + 1 zeros. 
Consider (11) with h(z, v) replaced by A(z, » — 1), and (a, b) = (4t%-w-1, k) = 
(7, k), where k is an arbitrary constant to be properly chosen. We have 
k 


(14) w'(k)h(k) — w(k)h'(k) + w(r)h'(r) = 2(v — 1 — ” | wh dz. 

If w’'(r)h(x) and w(x)h'(r) — 0 as x — x, choose k = «. Since h(x) > 0 
forz > +r, h'(r) > 0, then n > v — 1 implies that for both members of (14) 
to have the same sign it is necessary that g2ty-1—1 < otmn, Where »2m, is the 
largest zero of w(x, n). 

But if either w’(x)h(x) or w(x)h'(x) does not approach zero as r — ~, then 
since A(x) and h'(r) — Oas x — &, w(r)w’(r) > © as xr — © and hence for 
some value k of x sufficiently large h’(k) < 0 and w’(k)w(k) > 0. Choosing 
this value for k in (14), and supposing sgn w(x) constant for x > +, then for 
n > v — | the sign of the left member is determined as sgn w(x) and the sign 
of the right member as —sgn w(x). This leads to a contradiction. Hence 
whatever the asymptotic behavior of w(x) we have y2ty-1 < wlmn 


* This is obtainable from the standard form Dj,’ + (mn + 4 — }2*)D, = 0 by the substi- 
tution w(x, n) = D,(2'2x). 











670 VIVIAN EBERLE SPENCER 


A similar argument applied to the interval (a, b) = (k, #ty-1) leads to the 
inequality w2in < #2%y-1, Whatever the asymptotic behavior of w(x). Hence 
w(x, n) has either vy or vy + 1 zeros. 

Consider now the standard solution of the Weber equation, defined by Whit- 
taker’ and discussed by Milne,” for which w(z) = O(e"**x”), p a constant, 
z large and positive. In (11) let (a,b) = (utmn, ©) = (mn, @). 


(15) —w'(r,)h(m1) = 2(v — ») | wh dx. 


Since w’(7,)w(z) > O for x > vtmn, then vy > n implies that for both members 
of (15) to have the same sign it is necessary that um. < #2, , and hence that 
w(x, n) has exactly »v zeros. Moreover, since the zeros of w(x, n) are separated 
by the zeros of A(z, v), these results imply win < #2%y - 

For the solution of the Weber equation for which w(x) and w’(z) — 0 as 
x — —~ a similar argument leads to the inequality yr, < wm, , and hence 
to the conclusion that this solution has exactly v zeros, and that yt. < wtma-" 

Since the conclusions of the two preceding paragraphs cannot hold simultane- 
ously, it is clear that a solution of (9) for non-integral values of n cannot con- 
verge to zero at both + and — =. 

Summing up our results, and recalling again Sturm’s oscillation theorem, 
we have: 

1. Milne’s properties of the zeros {rin} of the standard solution of the Weber 
equation are: If v is an integer and v — 1 < n & », then w(z, n) has v zeros. 
If v is even, }v zeros are positive and }v negative. If vis odd, }(v — 1) zeros are pos- 
itive and 4(v + 1) negative or zero (zero only for n = v). Moreover, as n increases 
all zeros of w(x, n) increase. 

2. For the solution for which w(x) and w’(xz) ~ 0 as x > —~: If v ts an 
integer and v — 1 <n &S », then w(x, n) has v zeros. If vis even, } v zeros are 
positive and hv negative. If v is odd, }(v + 1) zeros are positive and }(v — 1) 
negative or zero (zero only for n = v). Moreover, as n increases all zeros of w(x, n) 
decrease. 

3. For any solution of the Weber equation: If v is an integer andy —1<n Sy», 
then w(x, n) has either v or vy + 1 zeros. If v is even, at least }v zeros are positive 
and 4v negative. If v is odd, at least }(v — 1) zeros are positive and }(v — 1) 
negative. 

These properties of the zeros of Weber functions enable us to extend at once 
bounds for the zeros of Hermite polynomials to the zeros of any Weber function. 
In particular, for the standard solution of (9), defining 2,1 as — ~, 


(16) nZi-1y—1 < wl in < Hl (v —_ 1 <a < V; t = l, 2, cae» v). 


*k. T. Whittaker, On the functions associated with the parabolic cylinder in harmonic 
analysis, Proc. London Math. Soc., vol. 35 (1903), pp. 417-427. 

1° A. Milne, loc. cit. 

1t Incidentally, this solution is obtained from the standard solution by changing z 
into —z. 

















ZEROS OF GENERALIZED LAGUERRE POLYNOMIALS 


671 


Thus, Winston’s bounds for the zeros of Hermite polynomials give for »y —1 <n <v 





: 


a a5 2-¢ < fan < = (i = jv + 2,---,») 
a < | otin| < se 7) se ee eae 
iit 9: 0 < udpsin < = <a 

(17) 
= ei ad data ae ) 6=%6+3),--- r 
—_ +1 ane <|een| < 22 <$22 (i = 2,---, 4 — 1) fvodd 
ee ac * “2 : mo < ctiesens <0 





Moreover, the standard solution of the Weber equation, like the Hermite 
functions, vanishes together with its derivatives for x — «, and their differ- 
ential equations are identical in form. These being the only properties of 
Hermite functions employed by Zernike in obtaining his asymptotic expression 
for 2,,, We may at once write 


(18) itm = (2n + 1)"* — 1.8557571 (2n + 1)"”* — 0.3443834 (2n + 1)*" 
— 0.168715 (2n + 1) ** — 0.151965 (2n + 1) + Of(2n — 1). 
3. Laguerre functions and associated Weber functions. In (2) let l(z, a,n) = 


- —jz72 2 : . . *,¢ 
= 2° *¢*'L,(2", a), a transformation carrying reals into reals for any positive 
and x. Then, 


~ 


= 


(19) I” + (4 +2a—2°— (a — 4)(a — »); = 0, 


+ ( a>0O; 
° — — W+ cc ) =0O ginta-t,—tz) 5 
(19a) 1’ + (4n + 2a — xl = (a - 2) ) ( 


, 


We are thus led to consider the equation 


(20) w” + (4n + 2a — 2’)w = 0 (a > 0), 


upon which we wish to impose a boundary condition of the form w(+«) = 
O(a‘e***), k > 0. But (20) is the Weber equation with 2n + 1 replaced by 
4n + 2a; and the boundary conditions are those of the standard solution. 
Hence, a solution of (20), satisfying these boundary conditions, exists for 
every n and a; and for n and a@ such that 


4n + 2a—1 — 


(21) v—-l1l< ; < », v a positive integer, 








672 VIVIAN EBERLE SPENCER 


w(x, a, n) has exactly vy zeros. The set of functions {w(z, a, n)} so defined for 
given n and @ shall be known as the associated Weber functions corresponding 
to the Laguerre functions {l(z, a, n)}. 

Condition (21) may be rewritten as }(2y — 2a — 1) < n < }(2v — 2a + 1). 
The characteristic numbers giving rise to the Laguerre functions are n = 


0, 3, 1, $,---. The corresponding Laguerre functions have respectively 
0, 1, 2, 3, --- zeros, arranged symmetrically with respect to the origin. Let us 
consider the associated Weber functions w(z, a, n), n = 0, 3, 1, j,---. If 


N(x) denote the next integer 2 zx, these Weber functions have respectively 
2a — | ,{ 2a ] ,{ 2a 3 
v( : , N + _ Ni — +¢ ,-++ zeros. Hence, the number of zeros 
2 2 2 
of I(x, a, n) equals the number of zeros of w(z, a, n) for 0 < a S }, and in 


general for any a > 0 the number of zeros of w(z, a, n) exceeds the number of 
zeros of I(x, a, n) by v(?2 > ‘) 
Write (19) and (20) in the form 


| 
toe 
~ 


, ‘ (a — 3)(a 
(22) Il’ —plz,a,n)l=0, plz,a,n) =p =2° — 4n — 2at+ = . 
= 
(23) w”’ — o(z, a, n)w = 0, o(z,a,n) =o =x — 4n — 2a. 
Then for every z and n 
po (} Sa SZ }) 
(24) 
p>co (0 <a < }, ora > #). 
Hence Sturm’s fundamental theorem gives: | oscillates more rapidly than 
w for } < @ < §, and we oscillates more rapidly than 1 for 0 < @ < 3, or 
a>} @=lH/fora = }, %). 
Denote the zeros of U(r, a, n) by {fi}, ¢ = 1, 2,---, n, and the zeros of 
w(r, a, n) by {6}, 7 = 1, 2, --- , m, where the zeros in each set are arranged in 


increasing order of magnitude. 
Multiply (19a) by w, (20) by J, subtract and integrate, 
ax et 
= , « Ww 
(25) (lw — lw’) = (a — })(a— pf — de, 
A kt =&« 
where k is an arbitrary constant to be properly chosen. Since a, w’, l, l’ — 0 
as x —» « (25) becomes 
* leo 


(26) U'(k)w(k) — U(k)w'(k) = —(a — 3)(a — #) I dx. 


“i 
First, let k = £,; (26) reduces to 
* lw 


(27) U(E)olCn) = —(a _ s)(@ = 3) [ r dx. 


on 











r 








i 


ZEROS OF GENERALIZED LAGUERRE POLYNOMIALS 673 


f, is not a multiple zero of I(x) and U(x) remains positive for x > ¢, ; hence 
'(¢,) > 0. This for 0 < a < } ora > } implies ¢, < 6,,. Next, let k = 6,, 
by similar reasoning 


(28) U(Sm)oo’ (Sm) = (a —_ 3)(a - 3) [ ~ dx. 
bm I" 


But w(x) has no multiple zeros and w'(é,,)#(r) > Oforz > 6,. For} <a < #% 
this implies 6,, < ¢,. Hence, we have 

TueoreM I. The zeros {fi} of the Laguerre functions I(x, a, n) and the zeros 
{6;} of the associated Weber functions w(x, a, n) satisfy the following separation 
relations 


(O<a<}) bA<ie<é, ee eee n even, 
(29) {@<a<%) GAd<biu<h, { 

(a > $) 61 <5 <6. _(%a-l 4 bn, jt= ets cee N, n odd. 

| itn ( 3 ) | 2 


4. Asymptotic expressions for the zeros of L(x, a). The results of §2 com- 
bined with Theorem I give us a method whereby results established for the 
zeros of Hermite polynomials immediately yield results for the zeros of gen- 
eralized Laguerre polynomials for any a > 0. The zeros of Hermite functions 
are identical with the zeros {42in} of the corresponding Hermite polynomials. 
Between any two of these zeros there lies one and only one zero of the set 
| tim} Of zeros of the standard Weber function w(z, m) for which n = N(m). 
But the associated Weber functions, w(z, a, n), a particular case of w(z, m), 
lead by Theorem I at once to relations for the zeros {¢;} of the Laguerre func- 
tions I(x, a, n). Results for the zeros {tin} of L,(z, a) are then obtained by 
squaring the {¢;} and replacing n by 3n. Moreover, our method permits us to 
extend results for Laguerre polynomials proper, a = 1, to the generalized 
Laguerre polynomials. In fact Winston,” applying Markoff’s theorem, has 
indicated the following method for reducing results for Laguerre polynomials 
(a = 1) to results for Hermite polynomials. 

To indicate the dependence of :z;, on a write .2in = 1%in(a). Then rela- 
tions (3) and (6) give 

—nXongi—itn = n%ian = WV 12%i-nn(4) < V 1ti—n.n(1) 
G=n+1,n+2,---, 2n) 
(30) _ = eimai 
— HT2n42-i,2n41 = ALi = V 1¥i—n—1.n(4) > V 1Xi-n—1.n(1) 


GG =n+2,n+3,---,2n+4+ 1). 


"2. Winston, loc. cit., p. 676. 





674 VIVIAN EBERLE SPENCER 


Hence, since yXi2n > w#%inyi (1 < t < 2n), 
V 1Fi-n-1.n(1) < —alingi-itn = aXian < WV 1Xi-n.n(1) 
(@=n+1,n+2,---,2n), 


(31) 


where we define the number ,20,,(1) = 0. 


Thus, extending Neumann’s” results for Laguerre polynomials a = 1 to 
generalized Laguerre polynomials the process of the preceding paragraph gives 
i—n 2i —n+1) 


(32) 2(n - 1) < —aXen41~-i,2n = wZien < “(n+ 1) 
G@=n+I1,n+2,---, Qn). 
If we apply §2 and Theorem I, (32) gives” 


(i — 2)’ ai +1) ae 
in + 1) <7" < yy (0<a<},i=2,3,---,n), 
G@-—1)° _ 4(i + 2) Tr 

(33) dat) ~<a (<a <3,i = 1,2,---,n—}), 
2 
( — 2) (it x(7F ‘Y) 

= 2 = cee =» a 

4(n + 1) —s n+1 (a > 3,7 = 2,3, »n — 1) 


Replacing 2n + 1 by 4n + 2a in the asymptotic expression (18) for u2an, 
we obtain as an asymptotic expression for the largest zero 5, of w(z, a, n) 


5m = (4n + 2a)"* — 1.8557571 (4n + 2a)~"* — 0.3443834 (4n + 2a) ** 
(34) — 0.168715 (4n + 2a)~** — 0.151965 (4n + 2a)*** 
+ O{(4n + 2a)-"""} = a(a, n). 


Let us now investigate the proximity, asymptotically, of 6, to {». Replace 
n in (20) by n’ < n and w by w, 


(35) wn + (4n’ + 2a — 2x’), = 0. 
Multiply (19a) by w , (35) by J, subtract, and integrate, 


(36) (l’a, — los)| + 4(n — n’) | luxdx = (a — 4)(a — p | an, 
k k k z 
whence 


U(k)wi(k) — U(bor(k) = —4(n — n’ | lardx + (a — 4)(a — a | ta. 
k 


18 E. R. Neumann, Beitrdge zur Kenntnis der Laguerreschen Polynome, Jahresbericht der 
Deutschen Math. Vereinigung, vol. 30 (1921), pp. 15-35. 

4 C, Winston, loc. cit., p. 675, has obtained results essentially equivalent to these by a 
geometric argument similar to Neumann’s. 

















— 











n). 


n), 


—5/6 


ace 


der 




















ZEROS OF GENERALIZED LAGUERRE POLYNOMIALS 675 


Proceeding as in the previous section, let k = 6,, , the largest zero of w . 


(37) Wow win) = —4(n — 1) [ lds + (a Na—¥ | "ld 


Consider (37) for0 < a < }ora > }. Assume U(r) > Oforz > bn. Since 
w1(5m’)w3(x) > O for x > 6, (37) implies 


4(n — n’) [ luoxdx < (a — 3)(a — $) [ - dz 
< @ aa — §) [ leas dz, 


3 
4(n—n< (a -die- 9 
8, 
This inequality will be contradicted if 
(a — })(a — 3) 
38) a <p a Tod, 
( 1a, 


Hence, for n’ satisfying (38), dn, < ¢,. Combining this result with (34), 

we have 

(39) a(a, n’) S fn S ala, n) (0 <a< }ora > $). 
Similarly, replace n in (19a) by n’ < n. Then reasoning analogous to that 

used in deriving (39) gives 

| (a — 3)(a — 3) | 


40) tn <6, for n’ <n — 
. 453, 


(3 <a < §). 


Hence, for n’ satisfying (40) 
(41) a(a, n') S$ Sw S ala, n) (} <a < #). 


But the first term of (34) which will be affected by replacing n by n + O(n™) 
is the term in n°”, hence (33), (34), (39) and (41) lead to 

THeoreM II. Jf {i2xin} denote the zeros of the generalized Laguerre polynomial 
L,(x, a), arranged in increasing order of magnitude, then for any a > 0 bounds for 


these zeros are given in (33), and also for any a > 0 

tfnn = 4n + 2a — 3.7115142 (4n + 2a)"*® + 2.7550676 (4n + 2a)™* 

(42) - 
+ O(n-). 


UNIVERSITY OF PENNSYLVANIA. 





REMARKS ON THE PROBLEM OF PLATEAU 


By E. F. BecKENBACH 


1. Introduction. We shall consider the problem of Plateau in the following 
form. 

PROBLEM OF PLATEAU. Given a Jordan curve T in xyz-space, determine 
functions x(u, v), y(u, v), z(u, v) which are continuous for u’ + v < 1, are har- 
monic and satisfy E = G, F = 0, where 


E=rit+yita, F=nutetypwtar, G=ert+y+2, 


for u’ + v° < 1, and map wv + vw = 1 ina topological way on I. 

Any set of functions satisfying the above conditions are coérdinate functions 
of a minimal surface bounded by [ and given in isothermic representation. 

The following theorems have been proved. 

THeoreM 1. If I bounds some surface, of the type of the circular disc, with a 
finite area, then the problem of Plateau is solvable for YT. 

THEOREM 2. The problem of Plateau is solvable for an arbitrary Jordan curve T. 

Theorem 1 has been proved separately and at about the same time by J. 
Douglas and T. Radé.' Subsequent proofs have been given by E. J. McShane’ 
and R. Courant.’ Theorem 2 has been proved by J. Douglas (loc. cit.), and 
later, by means of a different method but the same lemmas, by T. Radé.‘ In 
what follows, we consider alternative proofs of this latter theorem. 

In proving Theorem 2, Douglas, assuming Theorem 1, first uses a limiting 
process to establish the existence of mapping functions. He then completes 
the proof by using the following two lemmas to show that the functions thus 
obtained map u’ + v° = 1 topologically on I. 

Lemma |. Let x(u, v), y(u, v), z(u, v) be harmonic and satisfy E = G, F = 0 
foru’+v <1. Suppose x(u, v), y(u, v), 2(u, v) remain continuous on an are ¢ 
of ub + vo = 1, and x(u, v) = const. = x, y(u, v) = const. = yo, 2(u, v) = 
const. = z ono. Then x(u, v) = %, y(u, v) = yo, z(u,v) = a. 

Dovcuas’ Lemma. Let the integrable functions t(¢), nly), S(¢), substituted 

Received July 12, 1937. 

1 Their results are summed up in the following papers: J. Douglas, Solution of the 
problem of Plateau, Transactions of the American Mathematical Society, vol. 33 (1931), 
pp. 263-321; T. Rad6, The problem of the least area and the problem of Plateau, Mathematische 
Zeitschrift, vol. 32 (1930), pp. 763-796. 

2 E. J. MeShane, Parametrization of saddle surfaces, with application to the problem of 
Plateau, Transactions of the American Mathematical Society, vol. 35 (1933), pp. 716-733. 

3 R. Courant, On the problem of Plateau, Proceedings of the National Academy of Sci- 
ences, U. 8. A., vol. 22 (1936), pp. 367-372. 

4 An iterative process in the problem of Plateau, Transactions of the American Math- 
ematical Society, vol. 35 (1933), pp. 869-887. 


676 





: 
$ 
5 











REMARKS ON PROBLEM OF PLATEAU 677 


in the Poisson integral formula, determine the (harmonic) coérdinate functions of a 
minimal surface in isothermic representation. Let further &(¢), nly), f(¢) ap- 
proach definite limit values &(x), n_(m), ¢-(r) and &(x), n(x), &4() according 
as ¢ — m in clockwise and counterclockwise senses, respectively. Then 


E(r) = E(r), = o-(r) = n4(7), — F(r) = £4 (x). 


We have the following generalization’ of a theorem of Lindeléf.® 

LemMA 2. Let x(u, v), y(u, v), z(u, v) be harmonic and bounded and satisfy 
E = G,F = Ofor0 < are tan (v/u) < a, 0 < uw +r < ri. Suppose x(u, v), 
y(u, v), z(u, v) remain continuous on the rayO0 < u < m,v = 0, and xr(u, 0) > x, 
y(u, 0) — yo, z(u, 0) > m% asu—- +0. Then in every sector 


v 2 2 2 
0 < arc tan- <a—ao, u+v<7, where « > 0, 
u 


we have x(u, v) > x , y(u, v) > yo, z(u, v) — 2 as (u, v) — (0, 0) in any manner. 

Since Douglas’ lemma is a direct consequence’ of Lemma 2, it follows, as Rad6é 
has remarked,” that Theorem 2 is a consequence of Theorem 1 and Lemmas 1 
and 2. 

We shall call attention to two pairs of proofs, most of which have previously 
been given, of Lemmas | and 2, and then, reviewing the limiting process, shall 
obtain a proof of Theorem 2 from Theorem 1 and Lemmas | and 2. 

1. Proof of Lemmas 1 and 2 by means of subharmonic functions. A lemma 
which allows the immediate application of the Principle of the Maximum to 
minimal surfaces is the following.” 

A necessary and sufficient condition that the continuous functions x(u, »v), 
y(u, v), 2(u, v) be harmonic functions satisfying E = G, F = 0, is that 
[((c — a)’ + (y — b)* + (2 — c)*}' be of class'” PL for arbitrary choice of the real 
constants a, b, c. 

By means of the above lemma, Beckenbach and Radé give brief proofs of 
Lemmas 1 and 2, strictly analogous to proofs by means of the Principle of the 
Maximum of corresponding theorems concerning analytic functions of a complex 
variable. Actually, they prove Lemma 1 under the additional restriction that 
r(u, v), y(u, v), z(u, v) be bounded. But this restriction may be removed as 
follows. If x(u, v), y(u, v), z(u, v) satisfy the assumptions of Lemma 1, then 
in an arbitrary Jordan region R bounded by o + B, where o + B is a Jordan 


5 E. F. Beckenbach and T. Radé, Subharmonic functions and minimal surfaces, Trans- 
actions of the American Mathematical Society, vol. 35 (1933), pp. 648-661. 

®See Pélya und Szegé, Aufgaben und Lehrsdtze aus der Analysis, Berlin, 1925, vol. I, 
p. 138, problem 277. 

7 E. F. Beckenbach and T. Radé, loc. cit., p. 658. 

*T. Radé, On the problem of Plateau, Berlin, 1933, p. 73. 

* EK. F. Beckenbach and T. Radé, loc. cit., p. 654. 

” A function p(u, v), defined in a domain D, is said to be of class PL in D provided 
p(u, v) is continuous and 2 0 in D and log p(u, v) is subharmonic in the part of D where 
p(u, vr) > 0. 


678 E. F. BECKENBACH 


curve and every point of B is in u’ + v° < 1, the functions z(u, v), y(u, v), 
z(u, v) are bounded. Map a’ + 8 < 1 conformally on the interior of R by 
means of the analytic function u + iv = f(a + 78); an are o’ of eo+pfP=1 
will correspond to ¢. There are induced functions c = zx(u, v) = X(a, 8), ete., 
which satisfy the conditions, with a, 8, o’ replacing u, v, ¢, under which Becken- 
bach and Radé proved Lemma 1, so that X(a, 8) = x, Y(a, 8) = yo, Z(a, B) = 
Zz), Whence r(u, v) = 2, y(u, v) = yo, z(u,v) = 2. 


2. Alternative proofs of Lemmas 1 and 2. Radé has proved'' Lemma 1 about 
as follows. Since by assumption the harmonic functions x(u, v), y(u, v), z(u, v) 
reduce to constants on ¢, it follows by the Principle of Symmetry that these 
functions remain analytic on o, and consequently the relations E = G, F = 0 
hold on o. Since, for an isothermie map, we have dz’ + dy’ + dz = 
du? + dv’), where \ = E = G, and since dx” + dy’ + dz’ = 0 for (u, v) on a, 
it follows that EF = G = 0 on a, and consequently 2, = 2%) = Yu = Yo = Zu = 
z, = Oone. That is, the functions xz, — iz, , yu — tYv, Zu — i», Which are 
analytic functions of w = u + iv, vanish on an arc of their domain of regularity 
and therefore vanish identically. Hence, 2, = 2% = Yu = YW = %u = 2% = 0, 
so that r(u, v), y(u, v), z(u, v) are identically constant. 

We now offer a companion proof, based on the notion of normal families, 
of Lemma 2. 

Three functions z(u, v), y(u, v), z(u, v), harmonic in a domain D, are called a 
triple of conjugate harmonic functions provided they satisfy EH = G, F = 0 in D. 
In conformity with analytic function theory, we shall say that a family of such 
triples constitutes a normal family of triples of conjugate harmonic functions in D 
provided that every infinite sequence of triples of the family contains a subse- 
quence of triples which converges uniformly to a triple of conjugate harmonic 
functions, or for which z* + y* + 2° converges uniformly to infinity, in every 
closed region in D. It is a well known fact” that a family of functions {h(u, v)}, 
harmonic and uniformly bounded in D, constitutes a normal family of harmonic 
functions in D, and that if 


[h,(u, v)], 5 = 0, 3 2, aside. 


is a convergent sequence of the family, then the sequence 


a’** 
| hat »| n=0,1,2,---, 


j and k being fixed, converges uniformly in every closed region in D to the 
corresponding derivative of the limit function. It follows immediately that a 
family of triples of conjugate harmonic functions, uniformly bounded in a 
domain D, constitutes a normal family of triples of conjugate harmonic 
functions. 

1! T. Radé, Some remarks on the problem of Plateau, Proceedings of the National Academy 


of Sciences, U.S. A., vol. 16 (1930), pp. 242-248. 
12 See O. D. Kellogg, Foundations of Potential Theory, Berlin, 1929, Chapter X. 























my 








REMARKS ON PROBLEM OF PLATEAU 679 


Denote” (u, v) = (0, 0) by O, (ro, 0) by A, and (ro cos a@, 7% sin a) by B. 
Draw a line through O making an angle a — o > 0 with OA and cutting the 
arc AB at C. Let 0 < 6 < 7. Construct ares with center O, radii 6/2", 


cutting OA at A, , and OC at C,, n = 0, 1, 2,---. Let D, be the domain 
pny, 


i pn 
bounded by AnAniiCniiC,An. Then, for (u,v) in Do, (u/2", v/2") is in D,. 
Define 


uv uv uoov 
x, (u,v) = (x, 3) yn(u, v) = (x, =) z,(u, v) = (x x) 


In Do, rn(u, v), yn(u, v), Zn(u, v) take on the same values that x(u, v), y(u, v), 
z(u, v) take on in D,.. Since x(u, v), y(u, v), z(u, v) are bounded, the sequence 


(1) [x,(u, v), yn(u, v), 2n(u, v)], n=0,1,2,.--, 


forms a normal family of triples of conjugate harmonic functions in Dy). Then 
there is a subsequence 


[rn,(U, Vv), Yny(U, v), Zr, (4, v)], k=w@17..-, 


which converges uniformly in Dy to a set of conjugate harmonic functions 
F(u, v), 7(u, v), Zu, v). 

Since z(u, v) is continuous on OA, z,,(u, v) is continuous there, and, for 
(u, 0) on the boundary of D, , 


. : , u 
E(u, 0) = lim z,,(u, 0) = lim z{ =, 0) = %> 
kon kw . 
similarly, 7(u, 0) = yo, Zu, 0) = 2. Therefore, by Lemma 1, (u, v) = 2, 
g(u, v) = yo, Zu, v) = a. 

Now the entire sequence (1) must converge to 2%, yo, 20, since otherwise 
there would be a subsequence which converges to a triple of conjugate harmonic 
functions other than x), yo, 2, and the above analysis shows that any con- 
vergent subsequence converges to 2, Yo, 2. Therefore, for (u, v) in Do, 

lim z,(u,v) = 2, lim y,(u,v) = yo, lim z,(u, v) = 20; 

n-—-o no n-~o 
but the values of z,(u, v), yn(u, v), Zn(u, v) in Do are the values of x(u, v), y(u, v), 
z(u, v) in D,, so that x(u, v) — x, y(u, v) > yo, z(u, v) — 2 as (u, v) > (0, 0) 
in the sector 0 < are tan (v/u) < a — o. 


3. Proof of Theorem 2. Approximate to I in the sense of Fréchet by a 
sequence T, , n = 0, 1, 2, --- , of simple closed polygons. By Theorem 1, the 
problem of Plateau is solvable for I, ; further, by means of an adjoined linear 
fractional transformation, the solution can be so normalized that three distinct 
points A, B, C on u’ + v° = 1 are carried into three arbitrary distinct points 


18 The following proof parallels a proof of Montel for analytic functions. See P. Montel, 
Lecons sur les familles normales de fonctions analytiques, Paris, 1927, pp. 188-192 











680 E. F. BECKENBACH 


A,, B,, C, on T,. Choose three distinct points A*, B*, C* on T and let 
A, — A*, B, — B*,C, — C*. Let now 


r= z,(u, v), y = yp(u, v), z = z,(u, v) 


solve the normalized problem for [,,, and let the corresponding boundary 
functions be 


m v 
x= &(¢), y = m(¢), z = ¢,(¢), ¢ = are tan—. 
n 
P ° ° ° Me ° 
An immediate generalization” of the fact that a uniformly bounded sequence of 
monotonic functions must contain a convergent subsequence assures us of the 
existence of a subsequence 


(2) r= £,(¢), y = m(¢), z = Su(¢), k=0,1,2,---, 
converging everywhere on u’ + v° = 1 to limit functions 

(3) x= &£y), y = ne), z = ¢(¢), 

which map u’ + v° = 1 monotonically on T, carrying A, B, C to A*, B*, C* 
respectively. 


Consider the harmonic functions 
(4) x = z(u,v), y = y(u, v), z = 2(u, v), “uty <1, 


obtained by substituting the functions (3) in the Poisson integral formula. 
Since the functions (2) are uniformly bounded, we may pass to the limit in the 
corresponding sequences of Poisson integrals, so that the functions 


In, (U, v), Yn, (U, v), Zn,(U, v), k = 0, 1, 2, PP one 


and their partial derivatives converge in u’ + v° < 1 to the functions (4) and 
their corresponding partial derivatives. Since Z£,, = G,,, F», = 0, it follows 
therefore that EH = G, F = 0. 

That the conjugate harmonic functions (4) give a solution of the problem 
of Plateau for the curve IT will be established when we show that the boundary 
functions (3), which map u’ + »° = 1 monotonically on I, actually map 
u’ + v° = 1 topologically on T. Now the functions (3) cannot remain constant 
on an are of u’ + v° = 1, by Lemma 1 and the three-point condition. And, 
by the monotonic character of the map, the functions (3) have definite one- 
sided limits for each value of ¢; we shall show that these limits are the same 
from both sides at an arbitrary g. Because of the nature of the possible 
discontinuity of &(¢) at ¢ = go, it follows from a well-known property of the 
Poisson integral that z(u, v) approaches a definite limit if (u, v) — (cos go, 
sin gp) along any straight line in u’ + v’ < 1, this limit being a linear function 
of the angle which the straight line makes with a fixed direction and varying 
from £_(go) to &,(¢o). Similar statements hold for y(u, v) and z(u, v). But if 


“4 T. Radé, first footnote, p. 771. 








let 


C* 


he 


om 
ry 
ap 
int 
id, 
ne- 
me 
ble 
the 
20 5 
ion 
ing 
, if 


° EN reer vee, 





REMARKS ON PROBLEM OF PLATEAU 681 


we join two such straight lines by a circular are lying in u° + vo < 1, we obtain 
a sector for which Lemma 2 applies; consequently, (x, y, z) — a definite 
(to, Yo, 20) Which does not vary with the angle. That is, the linear functions 
mentioned above are constants, whence 


E_(yo) = &(¢0), n-(¢o) = n+(¢o), S(o) = £+(¢). 
Therefore, the functions (3) map u* + v* = 1 in a one-to-one way on I. 
4. Lemmas | and 2 are essentially theorems im kleinen, and their proofs are 
independent of the dimensionality of the containing Euclidean space. There- 
fore, these lemmas may be used to discuss the behavior on the boundary of 


functions giving isothermic representations of minimal surfaces bounded by 
several Jordan curves in Euclidean n-space. 


Rice INstirure. 








ANALYTIC FUNCTIONS OF ABSOLUTELY CONVERGENT 
GENERALIZED TRIGONOMETRIC SUMS 


By R. H. CamMEron 


1. Introduction. It has been shown by Wiener’ that a nowhere vanishing 
periodic function with an absolutely convergent Fourier series has a reciprocal 
whose Fourier series also converges absolutely. Lévy’ has pointed out that 
this result can be extended from reciprocals to general analytic functions. 
Thus if f(x) is periodic and never zero and has an absolutely convergent Fourier 
series, it follows that F[f(x)] also has an absolutely convergent Fourier series 
provided that F(z) is analytic and single valued whenever z = f(r). One of 
the results of this paper (Theorem I) shows that these results are true in n 
or even N» dimensions. This is accomplished by carrying through Wiener’s 
proof with the necessary modifications to take care of dimensionality. 

One might reasonably ask whether this result can be extended from periodic 
to almost periodic functions. A partial answer to this question has been given 
by Bochner,’ who has shown that reciprocals of trigonometric polynomials which 
are bounded away from zero on the real axis have absolutely convergent Fourier 
series. It is shown in the present paper that the theorem is true not only for 
trigonometric polynomials, but also for absolutely convergent infinite trigo- 
nometric sums. No further hypothesis is required; so the exponents are alto- 
gether unrestricted and may be any countable set of real numbers. Moreover 
this result is true not only for reciprocals, but for all analytic functions; and it 
holds in n or even No dimensions. Thus the final result of the paper is 

TueoremM II. Let f(x, , re, --- ) be an almost periodic function with an abso- 
lutely convergent Fourier series, and let R be the closure of its set of values. Then 
if F(z) is a function analytic over an open set S containing R, it follows that 
Fi f(a, te, --- )| ts an almost periodic function with an absolutely convergent 


Fourier series. 


teceived July 12, 1937. 

'N. Wiener, Tauberian theorems, Ann. of Math., (2), vol. 33 (1932), pp. 1-100; p. 14. 

2 P. Lévy, Sur la convergence absolue des séries de Fourier, C. R. Acad. Sci., Paris, vol. 196 
(1933), pp. 463-464. 

*S. Bochner, Beitrag zur absoluten Konvergenz faslperiodischer Fourierrethen, Jahres- 
bericht der Deutschen Math. Ver., vol. 39 (1930), pp. 52-54. 

‘ After this paper had been submitted for publication, the author learned that his main 
theorem (without the extension to analytic functions or to more than one dimension) 
has been proved independently by H.R. Pitt. Apparently Pitt’s work was done somewhat 
earlier than the author’s, though it was not submitted for publication until about the time 
the present paper was accepted for publication. It will appear in an early issue of the 
Journal of Mathematics and Physics, Massachusetts Institute of Technology. 


682 




















ABSOLUTELY CONVERGENT GENERALIZED TRIGONOMETRIC SUMS 683 


2. Absolute convergence a local property for periodic functions. Before 
proving Theorem I we shall need to extend to infinitely many variables Wiener’s 
lemma’ that a periodic function has an absolutely convergent Fourier series if 
in the neighborhood of every point it is equal to a function having an absolutely 
convergent Fourier series. Such a generalization naturally depends on the type 
of neighborhoods we use, and the appropriate neighborhoods in this case are 
defined as follows. We consider the space whose points P(x, x2, --- ) are 
unrestricted sequences 2;, 22, --- Of real numbers. Then corresponding to 
each point P(x, , x2, --- ), each « > 0 and each positive integer n we define the 
(e, n)-neighborhood of P to be the set of all points x}, 73, --- satisfying 


x; — 2;| < € (mod 27) (ij = 1,---,n). 


6 ° ° ° 
Jessen’ has shown that for such neighborhoods in which all but the first n 
variables are unrestricted the Heine-Borel theorem holds for the whole space. 
This fact enables us to extend Wiener’s proof to infinitely many variables and 


obtain 
Lemma 1. Let f(P) = f(a, x2, ---) be periodic of period 2x in each variable. 
Suppose further it is known that corresponding to each point P’(x;, x2, --- ) 


there exist ep» > 0 and np: and a function fe (P) = fe(a1, t2, --- ) which equals 
S(P) throughout the (ep , np-)-neighborhood of P’ and has an absolutely convergent 
Fourier series 


| KP’) ) 

“0 “aa 

fe(P) = AS? + > AS?” exp id pi, aif. 
n=1 j=l 


Then it follows that f(P) has an absolutely convergent Fourier series 
x ka 
f(P) = Ao + > A, exp ‘i > pats 
n= i= ) 


For by the Heine-Borel theorem there are a finite number of points 
P,, P2,---, P,such that every point P is contained in Np, + Np, + --- + Ne,, 
where Np, is the (Jer; , m»;)-neighborhood of P; ; and we shall show how to fit 
together absolutely convergent Fourier series in these neighborhoods to make 
f(P). For positive values of — < x, let T:(x) be periodic of period 27 in x, and 
let it be defined by the equation T;(7) = max [1 — |x //&, 0] in one period 
—rtSszrsr. 

Obviously this function consists of equally spaced isosceles peaks of height 1 
with horizontal lines of height zero in between. Again, let 


T's, m(X1 » Tea, °°° ) = T (a1) T ¢(22) cee T (am). 


and note that 7's.n(a., 22, -+-- ) vanishes outside of the (£, m)-neighborhood 


of the origin and has a peak (or infinite dimensional edge) of unit height a’ 


5N. Wiener, loc. cit., p. 10. 
® Jessen, The theory of integration in a space of an infinite number of dimensions, Acta 
Math., vol. 63 (1934), pp. 249-323; p. 256. 








684 R. H. CAMERON 


a = 0, r = 0,---,2n = 0. Finally, if « = min (e,, ---, &,) and n = 
max (np,,--- , Mp,), and if \ is an integer so great that 2° < «/(2m), let 
U a ,(P) = Pas tes Z,°° -) = T2 he, (2s — 2 ru, sin ia > Tn — 2 run) 
and note that 
gA+i—) 
a, . G....of ms, 
i=" **=Bn=0 
and hence that for all P, 
2a+1—} 

(1) KP) = Uy eese(P) SCP). 

Bie’ * "stn O 


Now if in each term of this sum we replace f(P) by a function which equals 
f(P) except when the coefficient of f(P) is zero, the equation will still be true. 
But such functions can be found with absolutely convergent Fourier series. For 


Q: 2 ayy ‘ 2 ‘rue, aes 2 rin , 0, 0, --- 


is contained in one of the neighborhoods Np, , --- , Ne, , say Np, ; and it follows 
that the (2-“x, n)-neighborhood of Q is contained in the (e,, np,)-neighbor- 
hood of P,. Thus for all P 


Urea PSP) = Uy,---sunP Fe, (P); 


and since 7:(r) has an absolutely convergent Fourier series, so do 7:(2,, 22, --- ) 
and U,,,...4,(P) and U,,.....4,(P)f(P). Consequently, it follows from (1) that 
f(P) has an absolutely convergent Fourier series and the lemma holds. 


3. Fourier series of small absolute value sum. Again following the course 
of the Wiener argument, we prove the 

LemMMA 2. Let f(x; , 22, --- ) have pertod 2x in each variable, and let it have an 
absolutely convergent Fourier series, the sum of the absolute values of the coeffi- 
cients other than the constant term being K. Then if F(z) is a function analytic 
inside and on the boundary of the circle | z — f(0, 0, --- ) | S 2K, it follows that 
Fi f(a... re, --+)| has an absolutely convergent Fourier series. 

For F(z) can be expanded in a power series about z = f(0, 0, --- ), and the 
Fourier series of f(z; , ze, --- ) can be formally substituted for z in this power 
series. The sum of the absolute values of the terms arising from (z — 2)” 
will be less than or equal to (2K)", and hence the whole series with all paren- 
theses removed will be absolutely convergent. 


4. General periodic functions with absolutely convergent Fourier series. We 
are now in a position to prove 
Turorem I. Let f(r, re, --- ) be a function of period 2x in each variable, 


and let the sum >. A,, of the coefficients of its Fourier series 
n=O 


“ kn 
(2) f(ai, 22, +--+) = Aot+ > A, exp E > paits| 
=! 


nel 

















ABSOLUTELY CONVERGENT GENERALIZED TRIGONOMETRIC SUMS 685 


be absolutely convergent. Then if R is the closure of the range of f(a, 22, +--+ ) 
and the function F(z) is analytic in an open set S containing R, it follows that 
F[f(ai, 22, -+- )| also has an absolutely convergent series: 


ea) s. 
(3) Flf(ai, 22, ---)]) = Bot+ > B,, exp E > anit 
a= j= 
On account of Lemma 1, we need only show that corresponding to each point 
P’(ai, #2, --- ) there exists an ep > 0 and a positive integer np and a fune- 
tion gp-(x1, 22, --- ) which has an absolutely convergent Fourier series and 
which equals F[f(z., x2, --- )] throughout the (ep, np-)-neighborhood of P’. 
For under these circumstances Lemma | establishes the existence and absolute 
convergence of the series (3). And since the function f(x, %,°-+) = 
fim, — 21, 2 — 22, ---) satisfies the same hypothesis as f(x: , 22, --- ), it 
follows that we need only consider the origin and show that there is a function 
go(ri, 22, --- ) which equals F[f(a, x2, --- )] throughout some neighborhood 
of the origin and has an absolutely convergent Fourier series. 
Now for any function g let o(g) denote the sum of the absolute values of the 


Fourier series of the function g, and let f(0, 0, --- ) = 20, so that 
oo ke 
fle, m,-+-) = 2 A, {exp [x paiti| - i. 
n= ]= } 


Let 6 > 0 be so small that z is in S if | z — 2 | S 6, and let N be so great that 


n=N+1 
Thus if 
; v ( kn 
g@(P) = g(r, 22, °°) =  » A, \exP E Z. pati | _ i} 
n=l j=1 
and 
e) f kn \ 
h(P) = =. A, exp E p> post _ lf, 
n=N+1 \ ?7=1 


it follows that f(P) = 2. + g(P) + A(P) so that o(h) < 36. Now for0 < & < 4x 
N 

define WAP) = I] Ve(z,), where 
n=l 


( 0 if |r; 2 2 (mod 27), 
V(x) = ? — ied if ¢€s/x| Ss 2 (mod 27), 
| I if xj st (mod 27); 


\ 


and note by actual computation that V¢(r) (and hence also W;(P)) has an 

absolutely convergent Fourier series whose absolute value sum is a bounded 

function of — Moreover, lim o[V¢(x)(e'""” — 1)] = 0; for if Y=’ denotes the 
t 


to? 














686 R. H. CAMERON 


sum from n = — x ton = +, omitting 0 and p, we have by actual computa- 
tion for all integers p, 
o[V,(z)(e"* — 1)] 
9 3& cos pE — cos 2pt 

2r p* rt 

+>’ 2[sin 3(2n — p)é sin 4p = sin (2n — p)é sin pé] 
(n — p)? xt 
2(2np — p’) sin }ng sin 3ng_ 


(n — p)?n? xt 


<q 3& _ 2sin }pé sin ppt 








2r p’ rt 
sv" {2\ sin }(2n — p)t 1.) sin 3pé | +3 | sin (2n — p)é |*-| sin pé | 
. (n — p)*xé 
2| 2np — p’|-| sin 3né|-| sin }nég le 
+ —— ss 
(n — p)*n? xt 
<2 38 4 9 23PE ME, yer f2| Hn — )el'-| dp | + 2| (Qn — pe |-| ve 
2x p’ ré (n — p)*xt 


4 2 2np — p*|-| ne |-| ine 


(n — p)*n? wt 
<¢, zi mp }(2n — p) |} + 2p|2n — p a 2! | 2np — p’ - 3) 
aa (n — p)*x (n — p)*?|n|'a }° 


Now consider W;(P)g(P), and note that 
lim o[W.(P)g(P)] = 0, 


§--0* 





since 





lim o(W(P)e' Pm+itm+y+ *Pyty (e'?mtm a 1)] at 0 


s 
holds for any integers p,, and implies 


lim o|WP le pit, +pere? *Pyty 1)] - (0). 


Finally choose .- 0 “) <mall that 
a(W AP )g(P)| < 46, 
and consider the functions 


JP) zo + WAP )g(P) + ACP). 


Sine 


q(0, 0, ) ACO, O, 

















ABSOLUTELY CONVERGENT GENERALIZED TRIGONOMETRIC SUMS 687 


the constant term of f(P) is f(0, 0, --- ) = 2; and since 
o[f(P) — 2] = o[W.(P)g(P) + A(P)] < 36, 


Lemma 2 applies to f(P) and shows that go(P) = F[f(P)] has an absolutely 
convergent Fourier series. But by definition W.(P) = 1 throughout the 
(e, N)-neighborhood of the origin, and hence g(P) = F[f(P)] in the neighbor- 
hood, and our proof is complete. 


5. Almost periodic functions. We can now pass to the case of generalized 
Fourier series and prove Theorem II, which has been stated in the Introduction. 


Let 


f(a, Xe, +--+) = Ao+ _ A,, exp E > Xn. j ni} 


n=1 7=1 
The proof can be based on Theorem I in the following way. Let 6 > 0 be so 
small that if | z; — z| S 6 and z,¢R,, then z ¢ S; and let R* be the closed 
set consisting of all points whose distance from FR is not greater than 6. Let NV 


be so great that >> | A,! < 46, and let wi, ue, ---,u, be an integral basis 
n=N+1 


for all \,,; for which n < N andj S p, ; so that 


An,i = > ka. j.0be (n = l, a 2 N;j == 1, ne Dn)- 
v=1 
Here the k,,,;,, are integers, and no integers hk; , --- , k, except 0, --- , 0 make 
pe ky wy = 0. 
v=1 a 
If p is the greatest of p; , --- , py , consider the functions’ 
h(Via, ae \ - re F s T ate a T es > Ewity Enae, —_ -) 
‘ ( Pa s \ 
=Ay+ Do Anexpyi ds Sky ¥i.> 
n=l \ y=l r=1 
and 


gVia, sia | Fass oe Tints ilies F ont Genes SNa2> -++) 


= ACV aay se+) Vues 2° 3 Vous sees Vet Swarr Ewer D+ 2 Ane®. 
=\+ 


It is clear that g has period 27 in all its arguments, and that 


PNet Pree 
mn, ++ yMeTns °° + 5 Maly > +> Motes D, AwaniLin 2, Awe» aa 
Pr | 


= 
= f(ri, 2, ---). 


7 Putting in exponents arbitrarily as we have done after the N-th term changes the 
range of the function; while putting them in according to the basis as Bochner did in his 
paper (loc. cit.) and as we have done in the earlier terms would require the use of limit 
periodic functions if it were carried out for all the terms. The author wishes to thank 
Professor Norbert Wiener for suggesting this combination of the two methods 











688 R. H. CAMERON 


Moreover, the closure of the range of h is the same as the closure of the 
range of 


Pn 


N 
Ao + } An expt D> ry; 2;. 
n=l 


3=1 


Thus by Theorem I it follows that F(g) has an absolutely convergent Fourier 
series 


ea p * In 
By + > B, exp E ip or ee ye raiti 


=) j=1 v=1 j=N+ 
Hence 
* p 8 In Pj 
F[f(ai, 22, ---)] = Bo + p> B,, exp| > ; Qn. joy Xj +2 b y # roa diteh 
n=l j=1 v=1 j=N+1 v=l1 


and Theorem II is proved. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY. 
































TRANSFORMATIONS ON SEQUENCE SPACES 
By L. W. CoHEeEN AND NELSON DUNFORD 


Hardy and Littlewood,’ Littlewood’ and others have given certain necessary 
and other sufficient conditions on the matrix a,; in order that the bilinear form 
YLa;;x;x; be bounded for >| 2; |? S$ 1, =| yi: |* S 1. So far as we know no 
conditions on the matrix a;; alone have been given which are necessary as 
well as sufficient for the boundedness of the corresponding bilinear form. In 
this paper we consider among other questions the more precise problem of 
determining the norm of the linear transformation y = Tz on l, to 1, in terms 
of the elements a;; of the matrix representing this transformation. We have 
been successful in the special cases where T is on |, to l, or co and, less trivially, 
on |, or ¢ to l,, if az; 2 6. Conditions for the absolute convergence of the 
determinant of (6;; + a;;) representing J + T as well as properties of the 
matrix of minors are also obtained. These last conditions together with neces- 
sary and sufficient conditions for compactness have been given for a Banach 
space with a denumerable basis g,. In such a space each element z is uniquely 
representable as 





oo 
a 
z= Do Inn, s, = 7.2, 


n=1 


&- ‘ ° P » on 
where 7’, is a linear functional on the space ® with |7, | S Me». In such 


a space the convergence of x” to x implies the uniform convergence of x7 to x, . 

In view of the well-known theorems on uniform boundedness of sequences of 
linear operations and the known conditions for weak convergence in many 
Banach spaces, it is comparatively trivial to give the form and norm of the 
general linear operation with the range in c, m = I, , C, M (bounded functions), 
ete. Consequently these cases have been omitted from the discussion of such 
questions. 

TueoreM 1. Jf and V are Banach spaces with denumerable bases and Tr = y 
is a linear transformation of & into V, the transformation is represented by 


(1) n= DL aj2;, 


j=! 


Received June 2, 1937; presented to the American Mathematical Society, December, 
1936, under the titles: L. W. Cohen, Transformations on spaces with denumerable basis; 
Nelson Dunford, Linear transformations of sequences. 

1G. H. Hardy and J. EF. Littiewood, Bilinear forms bounded in space [p, q), Quarterly 
Journal of Math., (Oxford), vol. 5 (1934), pp. 241-254. 

2J. KE. Littlewood, On bounded bilinear forms in an infinite number of variables, Quarterly 
Journal of Math., (Oxford), vol. 1 (1980), pp. 164-174. 

689 











690 L. W. COHEN AND NELSON DUNFORD 


where 
a; = TYT¢;. 

The i-th equation is a linear functional L;x on ® such that 

\Li| |T| My, 
and the series (1) converge uniformly with respect to i. Conversely, if a;; is a 
matrix such that for every x «® there is a y € WV satisfying 
(2) Tiy = > ai;T} 2, 

, = 


the transformation is continuous. 
Proof. For each j 
a 


Te = V3 = La, 


i=l 


where y; is the basis of ¥. For ze, let 2° = >> z;¢;. Then 


7=1 
n ~~ n 
y" =Tr" = > xiv; = a> ai;Zj, 
i= i= = 
y=Tr= p> Wii, 


and 


yi — Dayz; SMe \ly—y"|| S$ Me|T|-\|2 —2 || 
g=1 
imply the uniform convergence of the series (1) with respect toz. Since | y;| S 
My|T|-\|z|!,|Le| s Me|T |. 
Now, conversely, for fixed i and n the function >> a;; 7? z is continuous in z 
j=1 


and hence for fixed 7 its limit, i.e., the function 


2 
> 
yi = i a; T; z, 


)=1 
is also continuous in z. Thus 


y" = > (= ai; Tz) v 


i=l j=l 


as well as its limit y is a continuous function of z. 

In specializing the general linear transformation Tz = y, one may require 
that it be completely continuous, i.e., that it carry a bounded set into a com- 
pact set. We state the following condition for compactness. 














TRANSFORMATIONS ON SEQUENCE SPACES 691 


THEOREM 2. A set X Cis compact in © if and only if X is bounded and 
lim >> rg; =2 
n j=l 
uniformly for x « X. 
Proof. From a sequence (y”) of points in X a subsequence (x”) may be 
chosen such that, for all 7, 


. > 
lim 7; 2” = a;. 


™m 


Now 
n 
: ’ 
lim >> T?2"y; = 2” 


n i=l 
uniformly with respect to m and 


lim >> T? 2"¢; 


m i=l 


exists for all n. Thus lim x” exists and X is compact. 


m 


Conversely, suppose that X is compact in ®. The sequence 
f(z) = D> LiPi 


of linear transformations is convergent, hence equi-uniformly continuous. For 
any « > 0 there is a 6, > O such that, for all n, 


| | € ' ' 
\| fn(ax) || <3 | 2 || < 6. 
There are x’, --- , 2" in X such that for x « X and some i = 1, --- , n(e) 


|| a — 2*|| < min (5,4). 


There is an N, such that n > N, implies 


iSa(2") — z* || <3 (i = 1, --- , a(¢). 


Now for n > N,, x « X and some 7, we have 
| fala) — x] S || fa(z) — faz’) || + Il faz’) — 2° || + || 2° -— all <e 


Corotiuary 1. Inl, (1 S p < @) a set X is compact if and only if X is 
bounded and 


lim >> |2;|? = 0 


n j=n 


uniformly with respect to (x;) in X. 








692 L. W. COHEN AND NELSON DUNFORD 
Coro.iary 2. Inc (the space of convergent sequences (x;)) a set X is compact 
if and only if it is bounded and lim x; exists uniformly with respect to (x;) in X. 


7 
Coro.iiary 3. The general completely continuous linear regular transforma- 


tion of c into c is 


2am 
y; = a; lim x; + > 4;;2;, 
? 


j=1 
where 
x 
lim a; = 1, lim >> |a,;} =0 
i i j=l 
and the norm of the transformation is 
4 2 \ 
sup 4, a,! + b a3; | \ 
mA j=l ) 


Thus, while every Toeplitz matrix is continuous, none is completely continuous 
onc toc. We have the formulas 


2 2 
yi = a; lim ye b,j;2; + , AijXj, 


n j=l j=1 


nx 
yi = a; Lim a + Ph Aiyjtj, 
j=l 


where (6,,;) is a Toeplitz matrix and Lim is a generalized limit defined for all 
bounded sequences. These formulas yield completely continuous regular sum- 
mation methods whose domain of definition is the set of all bounded sequences. 
From Theorem 2 we also have 
Tueorem 3. If T ts linear and completely continuous on & to WV and ¥ has a 


bounded basis, then 


uniformly in j. 

It will appear below (Theorems 12, 13) that this condition is sufficient for 
the complete continuity of 7 in the cases where’ = 4, ¥ = Ll, (1 S p < @), 
Vv Cy . 

To formulate sufficient conditions for complete continuity we may use the 
following theorem which is well known in case the transformations involved 
are linear 

Turorem 4. Jf 7,2 = y is completely continuous ona Banach space S toa 
Banach space S’ for each n, and if for any « > 0 there is an n, such that 


Tz — T,z\|| S €|\z|l, (n > n,), 


then Tz 1s completely continuous 
Proof. Lat X be « bounded subset of S and y”™ a sequence in TX. There 








TRANSFORMATIONS ON SEQUENCE SPACES 693 
is a sequence of sequences x7 in X such that each Tz} is a y", (x?) D (z7*") 
and lim 7,2) exists. Writing x" = x; we have for « > 0 

qd 


A + ntk lla Gal n {| ' 7 k 
|| Tx _— Tr" 1| < | Tx" -_ T at” 1] oe 1} Tat” _ tm || 
iw +k y ntk i} | tk 
+ || Tnx" — Tx" || < 2eM + || T.2” — Tx” |I, 
if |||! Ss M when ze X andm>m,. Then for n > nn. , k > O we have 


\| Tz" — Tx"** || < (2M + le. 


Since S’ is complete, TX is compact. 
In case T is a linear transformation on ® to ¥, we have 
TueoreM 5. A necessary and sufficient condition for the complete continuity 
of T is that lim | T,, | = 0, where T,x = y ts defined by the matrix 
n 
n) . 
a;;' = 0, i<n, 
= aij, t = n, 


associated with the matrix (a;;) of T. 
Proof. The sufficiency follows from the complete continuity of T — T,, 
lim | 7 — (T — T,) | = 0 and Theorem 4. Conversely, let y = Tx be com- 


n 
pletely continuous and 


y= > Yivi, y” = > yivi- 


If S is the unit sphere in &, then for every « > 0 there is an n, such that for 
alln > n, and x8, 


beeause of Theorem 2. For any fixed n > n, there is an x(n) e S such that 


17.1 S || Tax(n) || + £, 


so that 


my (i ' a i € 
T.| s lly’(@) ll += <.«, 
| | 11 ¥ ti 5 


where y(n) = Tx(n). 
With a view to obtaining results on absolutely convergent determinants we 
state certain sufficient conditions in terms of the rows and the columns of the 


° if Wy, ’ . . . . 
matrix (a,;) = (7% 7'¢;) separately and impose the following conditions on ®. 
a 
1 ep? + 
(a) Ife, then as’ = 2. | T.2|¢n€® and || x || = || rj]. 
n=l 


~ ope - pe - ji! 
(b) If0 s Tar Ss T, 2’, then || z]] S$ |] 2’ ||. 


These conditions are satisfied by the spaces J, and ¢o . 








694 L. W. COHEN AND NELSON DUNFORD 
TueoreM 6. If T is linear on & to V and 


L= DL |Li| wey, 


i=1 


then T is completely continuous. 
Proof. Let T, be defined by the matrix 


a;; = TiTe;, 1 


= 0, t> nn. 


IIA 
IIA 
3 


Since 7,,z lies in a finite-dimensional subspace of ¥, T,, is completely continuous. 
For any « > 0, 


Tz—T,2|| =| Dd yi =|) DS ly ¥ 
i=n+l t=n+l1 


a\|-\| D> [Lil vill <ellal, 


i=n+1 


IIA 


ifn > n,.. Thus T is completely continuous by Theorem 4. 


“ bed q/p’ 
Coro.iiary. If is (= | di; ‘ < o, then y; = 


im1 \j=1 
continuous onl, tol, (p + p’ = pp’). 
Tueorem 7. If T is linear on ® to V and 


x 
on oO 
D Iv; || Xj 


j=l 


x 
a;;x; ts completely 
j=l 


is a linear functional on ® such that for « > 0 


ye vy -l2;| <e|[z (n < n,), 


j=nt+l 


then T is completely continuous. 
Proof. Let T, be defined by the matrix 


Tah Ad ° 
a;; = T;T¢;, iss 


lA 


n, 


= 0, j>n. 


7,, defined on an n-dimensional space, is completely continuous. Now for 


e>O,n>n,andk > 0 


nok nok cA 
* + ' * ' 
ET? S 2d lt, 7 Vj _ De lail-ll¥i ll <ellzil. 
jen jn pent 
Further, 7,2 Tz" =T _ Li¥j : rv, so that 
j=l jo 
Tz — T,2|| = ph tiv; <e|/z||. 
jantl 








LE a gos eee 








Se 





TRANSFORMATIONS ON SEQUENCE SPACES 695 


20 oo p’la@ C) 
Corotiary. If Zz (= | Qi; ') < 0, then y; = >, a;;2; is completely con- 
j=1 


j=l t=—l 
tinuous on l, to l, . 

The next stage of restriction on T is made to assure the absolute convergence 
of the determinant of the matrix of J + T. The condition is essentially the 
combination of the conditions of the last two theorems. 

TueoreM 8. The determinant of the matrix (6;; + a;;) defined by I + T on 
® to > is absolutely convergent if 


l. La = ps | aj; | 2; ts a linear functional on 9; 


j=1 


2.L = D|Lil eve, || LI) <1; 
t=1 

3. gx = >> || ¢F || x; is a linear functional on ®, g? = T¢;; 
j=1 


4. > |a;| =a < o. 


i=l 
Proof. According to the theorem of von Koch it is enough to show that 


Pinte ee > | Osis Qinig © + = Digg | 


i=1 n=l diye igel 


converges. We show first that for all 7, 7, n 


(1) > _ | Min Minis +++ ing] S | Le|-|]L||""+|/ @ Il. 
a. a 
For n = 1, 
> | dss, Qs; | = Lie;* Ss | Li|-l ¢; |I. 


From the inductive assumption we have 


oa ao a 
pm | Oss, iyi, °° Bias | = ys | ass, | > | Qinss tee re 
tists tinea i)=1 Pr eres | 
co) 
< PRESCOT A ae . IZ II" -IHe* Lil 
= Jags, || Le fe {] DU des Il = WL ley || LL 
i,=1 
S | Lel-l| LI I"-|1e; ll. 





Putting ¢ = 7 in (1) we have 


ba pe | Asi, Qinig = ay: | > \| Z|I" me Iles | “TL; 


deel pecs, sel 


Ik 


ll 
~ 
> 
~~ 
IlA 

+ 
~ 
ey 











696 L. W. COHEN AND NELSON DUNFORD 


Thus 

= ! = = ! ! ! “ 1] n | ‘s | LI} 
> ai| + > fia | iin Minin **+ dis] Sat |¢e| > Lip =at+ pana 
i= n=l iiy**+i,= n= _ 1 


and the theorem is proved. 
The matrix of minors A,; has certain properties similar to those of the 
matrix a;;. 


x 
THEOREM 9. : | Ay; | g; €® for each 1. 
=1 
= ? 
| Ay; | x; is a linear functional on ® for each j. 
i=1 


Proof. Let A,- be the minor of a,. (r # c). The matrix of A,,, after a 
finite number of interchanges of rows and of columns has the form 


(le, (az) (i ¥ c) 


(a.;) (6; + aij) 
(Gj#r) | (#ej#nr) 
If P = (1 + | p)) is the product extended over all circular products 
P = Qi, Aisin ++ * Bins (t, i: , «++ , i, distinct), 


then 


x x 
Are| S P{lae| + >> Dd | Geeky Bigrg °° * Gage |}. 


n=1 t)***t,@=1 


From (1) of the previous proof 


* 
| Le |, lle, ||, we have 


Since |a,.| S 


f er || 
| Ave| $ |Le| PQ1 + lee] 
\ 1 — || || 


IIA 


| Are | eat + 
forr #c. But | Ay} S P for all 7. Hence under the postulates (a) and (b) 
the theorem holds. 

We turn to a consideration of the spaces Ll, and c» in order to determine the 
exact conditions for continuity and complete continuity of the transformation 
and to evaluate the norm in certain cases. 

TueoreM 10. The equations 


(1) 























wen 


Yn) caine 








TRANSFORMATIONS ON SEQUENCE SPACES 697 


define a linear transformation on l, to l, if and only if 
bed 1/p 
(a) sup (= | ai; ") < ow, 
i t=1 


and this constant is the norm of the transformation. 
Proof. Let ¢; be the vector with one in the j-th place and zero elsewhere. 


Then 
oo l/p x bed l/p 
sup (= | ai; ") = sup > lz;| (= | ai; ‘) 


zii=l j=1 i=l 


Eo) oo P\ lip 
=> sup (= >, a7; ) = |T! 


z\\=1 i=l j=l 


bed li/p 
= sup || T¢;|| = sup (= aij ; . 
° 2 


i=l 


THEOREM 11. The equations (1) define a linear transformation on |, to co if 
and only if 


(a’) sup sup | a;;| < ™, (a’’) lim a;; = 0, 
i i i 


and the constant (a’) is the norm of the transformation. 
Proof. Let ¢; be the unit vector with one in the j-th place. Then lim a;; = 0 


since Tg; = (a;;) €¢ for each j and 
Lo} 
sup sup |a;;| = sup |! Tg;|' S$ |T| = sup sup : a a,;jt;| S sup sup | aj; |. 
j i i zi=l i j=l i 7 


Tueorem 12. The equations (1) define a completely continuous transforma- 
tion onl, tol, (1 S p < «) if and only if the matrix (a;;) satisfies the conditions 
(a) and 


a 1 Pp 
(b) lim sup (= | ai; r) = 0. 


n i i=—n 


Proof. The necessity of (a) follows from Theorem 10 and that of (b) from 
Theorem 3. The conditions are sufficient since 


(= yi ") "< sup (= 4; ) > x; | 


? t=—n i=l 


and Theorem 2 show that bounded sets go into compact sets. 
Tureorem 13. The equations (1) define a completely continuous transformation 
on Ll, to eo if and only if the matrix (a;;) satisfies the conditions (a) and 


(b’) lim ay; = O uniformly in j. 
' 








698 L. W. COHEN AND NELSON DUNFORD 


Proof. The necessity of these conditions follows from Theorems 11 and 3. 
The conditions are sufficient since 


x 
sup | ys] S sup sup |as;| 20 |x| 
t=1 


izn i2n jj 
and Theorem 2 show that the image of a bounded set is compact. 

The roles of l, and lL, or c in these theorems may be interchanged if the a,; 
are positive and the norms of the transformations evaluated. The evaluations 
are corollaries of the following theorem in which the postulates (a) and (b) 
on © used in Theorems 8 and 9 appear in a weaker form as one restriction on ® 
and one on the conjugate space ®. 

THeoreM 14. Let a;; = 0 and © be a space with a basis ¢, such that 

(i) of i Lig; converges so does D las| es; 
' ‘ 

(ii) the norm of a point’ (a;) ¢& is not decreased when the values | a; | increase. 

ao 


Then equations (1) define a linear transformation on # to l, if and only if >. a;; 
t=1 


x x 
(7.¢., if and only if pa a, ;z; converges whenever pe x;g; converges). Furthermore 


j=l i=1 7 


the constant 


xa 
p>» aij 


t=1 > 


is the norm of the transformation. 
Proof. Let f(y) be the linear functional on 1, defined by 
fy) = Li. 


i=l 


If equations (1) define a linear transformation y = Tz on # to l,, then fT isa 
linear functional on @ with f7| S| 7). But 


fTz - > > az, = | 


tml j=l j~1 


If (z,) is the unit vector with one in the j-th place, we see that 


(a;) = (d a) e® 
1 


and 


‘if z > zi¢, and f ix in &, then fz > r,a,, where a, Se; . 














iS 


») 








RRO nm rae spt 





TRANSFORMATIONS ON SEQUENCE SPACES 699 


In what follows f will stand for an arbitrary linear functional onl,. From a 
well-known theorem (Banach, Théorie des Opérations Linéaires, page 55, 
Theorem 3) we have 


sup |fT| = |T'). 
|\f|=1 


Now f is represented by a vector (fi) in the space of bounded sequences and 
\f| = sup | fi|. Also 


fr =f Dayz, = L fase, = X S tenuis, 


j=1 i=1 
so that 
2 
ST | = | Do foasi 
i=l 
Thus 
oe ! so oa 
|T| = sup 2 fete _ = sup 2, Satu _@ 2. Gu , 
Sj—1 || it % fits || i= d i=l t 


x 
Hence we have shown that if y = Tz is linear on # to 1, , then PN a;; ¢@ and 


=1 
oo oa : 
|T| = Zz. a;;|_. Now conversely if > a;; «?, we have the iterated sum 
i=l > i=1 
wo oe 
ps p> Qj; 2; 
j=l i=l 


converging absolutely and hence for every rx ¢&@ the sequence 
oo 
¢ 
2 Gy Tf rel. 
j=l 


By Theorem | then the transformation is continuous. 
Coroutary 1. In case the ay; = 0, the equations (1) define a linear trans- 
formation on lL, (p > 1) tol if and only if 


(E(E-))" < 


This constant is the norm of the transformation. 
Corouiany 2. In case a,; = 0, the equations (1) define a linear transforma- 
tion on 9 to lif and only if 
Day <~. 
gn i inl 
This constant is the norm of the transformation. 
Using these corollaries, the corollary to Theorem 7, and Theorem 2 we get 
Corouiary 3. A linear transformation with a, = O en l, (ls px @) 


or ¢y to Ll, is necessarily completely continuous, 











700 L. W. COHEN AND NELSON DUNFORD 


Part of this corollary but not the evaluation of the norm given in Corollary 1 
has been proved in more general form by H. R. Pitt. He has shown that any 
linear transformation on l, to Ll, is necessarily completely continuous if p > q. 
Littlewood” has given the same result for p = «, q = 1. 

Corotiary 4. If T is linear on cy tol, , and a;; 2 0, then the determinant of 
I + T is absolutely convergent. 

In fact, it follows from Corollary 2 that the determinant of (6;; + a;,) is a 
normal determinant in the sense of von Koch. 

We conclude with the representation of the general completely continuous 
operation on L (the space of functions ¢g(P) summable on (0, 1)) to l, 
(lSq< «). 

TuHeoreM 15. The function T is a completely continuous linear operation on L 
to L, if and only if it is expressible in the form 


1 

(a) Te -[ k(P)¢(P) dP 
0 

with measurable functions k,(P) satisfying 


x l/q 
(b) ess sup (= | k(P) °) <a, 
P 


i=1 


(c) lim ess sup (x k(P) ') = 0. 
nm P 


i=n 
The constant in (b) is the norm of the transformation. 

Proof. It is known’ that T¢ is linear on L to L, if and only if (a) and (b) are 
satisfied and that (b) gives the norm of the transformation. Let 7,,¢ be defined 
as the vector 


1 1 
(0, eee f k,(P)e(P) ar, [ kn, (P)g(P) AP, os 


/ 


Then 
x 1 4 
T.¢,, S ess sup (x k(P) ") ¢ |i. 
r in 


Corollary 1 of Theorem 2 then shows that (¢) is sufficient for the complete con- 
tinuity of 7. Conversely, if T is completely continuous, we have for every 


« > Oan n(e) such that for every ge L with | ¢g|| Ss 1 
[=. J a\ "a 
TW.¢ >> | kK(P)g(P)dPi 7-7 Se (n = nfle)). 
t—n Jo ) 


‘H.W. Pitt, A note on laitinear forms, Journal of the London Math. Soc., vol. 11 (1936), 
pp. 174 1s) 

* Loe. eit 

* See Dunford, Integration and linear operations, Trans. Amer, Math. Soe., vol. 40 (1936), 
p. 4%, or & Vullich, Sur les opérations linéaires dans Cespace des fonctions sommables, 


Mathematica, vol. 14 (1937 ) p 12, Theorem | 


























TRANSFORMATIONS ON SEQUENCE SPACES 


There exists a ¢, in L with || ¢, || = 1 such that 
1T.| S || Taga || + €. 


Thus for n = n(e) 


' 
=n 


2 \ Ma 
| T,, | = ess sup ‘z \k(P)!"> <2, 
P ) 


and so (¢) is also necessary. 


UNIVERSITY OF KENTUCKY AND YALE UNIVERSITY. 


701 











ON PERFECT METHODS OF SUMMABILITY 
By J. D. Hit 


1. Introduction. In this paper we are concerned exclusively with Toeplitz 
methods of summability in the real domain, and we begin by introducing the 
definitions and notations which we shall employ. Being given a matrix 


A = (a,.) (k, n = 0,1, 2, --- ) and a sequence x = |s,}, we may form the new 
x 

sequence y = A(r) = }t,} provided each of the series > Ans = t, = A,(z) 
k=0 


is convergent. If y belongs to the space (c) of convergent sequences, we say that 
x is summable by the method A, or simply A-summable, and we write A-lim z = 
lim y. The class [A] of all A-summable sequences is called the convergence-field 
of A. If for two methods A and B we have the relation [A] C [B], we say 
that B is not weaker than A. A and B are said to be consistent if A-lim x = 
B-lim x whenever these limits exist. The method J defined by the matrix 
(6,.), where 6,. is Kronecker’s symbol, is called the tdentical method or the 
identity; obviously [I] = (¢). Every method A for which [J] C [A] is called 
convergence-preserving; if, in addition, A is consistent with J, it is said to be 


regular. If the matrix (a,,) is such that a,, = 0 for k > n, A is said to be 
triangular; if, furthermore, a,, #4 0 for every n, A is said to be normal. A will 
be called reversible if the equation A(z) = y has exactly one solution x, con- 


vergent or not, for each y in (ce). For triangular methods the notions of re- 
versibility and normality are easily seen to be equivalent. 

For future reference we list here the following conditions which are necessary 
and sufficient for A to be regular: 


(1.1 lima, = 0 (k = 6, 1,2, ---), 
” =x 
st 
(1.2 lim Doan = 1, 
, om fh 0 
A 
(1.3 Zz au| &K (n = 0, 1, 2, ) 
‘A ’ 


We shall say’ that A is of type M if the conditions 


(1.4 > a,|< @, > atattns 0) (ke os eee 


neo 


Reeeived August 16, 1937. The author gratefully acknowledges his indebtedness to 
Professor J. D. Tamarkin, under whose direction this paper was written 
Matrices of this type were first introduced by Mazur in connection with normal 
methods; see Kine Anwendung der Theorve der Operationen ber der Untersuchung der Toe- 
plitzechen Lamitierungasverfahren, Studia Mathematiea, vol. 2 (1930), pp. 40.50.) We shall 


refer to this paper hereafter as SM 


7T02 





























ON PERFECT METHODS OF SUMMABILITY 703 


always imply 
(1.5) a, = 0 (n = 0,1, 2, --- ). 


Banach’ calls a method perfect if it is simultaneously regular, reversible, and of 
type M. The importance of perfect methods lies in the following theorems. 

THEeoreM 1 (Mazur).* In order that a normal regular method A be consistent 
with every regular method not weaker than A, it is necessary and sufficient that A 
be of type M. 

Banach has shown that the sufficiency may be extended to methods which 
are not necessarily normal. 

TueoreM 2 (Banach).* In order that A be consistent with every regular method 
not weaker than A, it is sufficient that A be perfect. 

The only known examples of perfect methods are the Cesiro and Euler 
methods of all positive orders, a result obtained by Mazur in SM. It is the 
purpose of the present paper to find conditions under which the Nérlund, 
Hausdorff, and weighted-mean methods will be of type M. We start, how- 
ever, with a few theorems of a general nature. 


2. Perfect methods in general. [Let A = (a,,) be a given regular matrix. 
If we regard A(x) as an operation on (c), the regularity insures that its range 
R. will be a subset of (c), and we then have the following characterization of 
perfeet: methods. 

TueoreM 3. Jn order that a regular and reversible method A be of type M, it is 
necessary and sufficient that R. be dense in (ce). 

Proof. The assertion of the necessity is merely an alternative statement of a 
result due to Banach.” To establish the sufficiency consider the points y; = 


{5,0} of (ec), (@@ = 0,1, 2,---). If R. is dense in (c), there exists corresponding 
to each « > O and each 7 = 0, 1, 2, --- a convergent (and henee bounded) 
sequence {s,;} = a), such that || A(7,) — yi < €, which is equivalent to 

(2.1) A,(z;) — 645 | < « (n,t = 0,1, 2,--- ). 


We write (2.1) in the form 
(2.2) A,(a;) = 6,; + €n;. where | €,;| < «, 
and assume that (1.4) holds. ‘Then for each fixed ¢ it is clear from (1.3) and 


a 
the boundedness in k of {s,;} that =. Gy, Ay4 Sy 8 Absolutely convergent. Conse- 
k 


ned 


2? Théorie des Opérations Linéaires, p. WW. 

3 See SM, p. 48, Satz 7. It may be remarked here that Mazur (see Mathematische 
Zeitschrift, vol. 28 (1928), pp. 604-605) has constructed two normal regular methods, each 
not weaker than the other, which assign different limits to a particular sequence 

* Loe. cit., p. 95, Théoréme 12 


® Loe. eit., p. 98, Lemme 2 








704 J. D. HILL 


quently, from (1.4) and (2.2) we have 


x x 2 f 2 ) x 
0 = > > a, 4A Sei = ~ {> Ani Sti Py = > A, (aia, 
k= n=O n=0 \A 0 } n=O) 
2 x r 
=> > he . 7 ; €,ia, = a; + > enian- 
n=0 n=O n=0 


x 
Therefore a; < re a, , and, « > O being arbitrary, it follows that 


a; = 0( = 0,1, 2,---). This completes the proof. 

On the other hand, if we consider A(z) as an operation defined in the space 
(m) of bounded sequences, its range F,, lies in (m) on account of (1.3), and it is 
readily seen from the foregoing proof that the following theorem holds. 

Tueorem 4. In order that a regular and reversible method A be of type M tt ts 
necessary and sufficient that the points y; = {6,:} of (m) be limit points of R,, 
for alli = 0, 1, 2. : 

THeorem 5. The product AB = C of two triangular perfect methods A and B 
is also a triangular perfect method. 

Proof. If the matrices of A, B, and C are, respectively, (a,x), (6.x), and 


(c,.), then c¢,. = Dm aby if k Sn, and c,,. = Oif k > n, so that C is triangular. 
=f 


Moreover, it is obvious that C is normal and regular. Conditions (1.4) applied 
to C give 


x 


(2.3) >. AnCrk = > y a,Andb, = > ( cents bu =Q@Q (k =0,1,2,---), 
, kh ash nme 


a=h 


where the interchange of summation signs is permitted by the absolute con- 
x 


vergence of x a, 4, by . Setting 


0 


(2.4) Bs = D> andy, (i = 0,1, 2, ---), 
we have > Bi, < = and =~ Bb. = 0 fork = 0, 1, 2,---. Sinee B is of 
type M,. this implies 8; = 0 fort = 0, 1, 2,---. This in turn, by (2.4) and 
the fact that A is of type M, implies a, = O for every nt. Thus C is also of 


type M, and the proof is complete. 

THeorem 6. If the product AB C of two triangular convergence-preserving 
methods A and B is of type M, then A must be of type M. 

Proof. Uf A is not of type M, there must exist a sequence fa,} satisfying 
(1.4) but not (1.5). Since (1.3) is also necessary for preservation of con- 
vergence, the relation (2.3) can then be established for the sequence tar}. 
This contradicts the assumption that C is of type M. 

We shall have oeeasion later to use the following theorem, the proof of which 
is easily supplied 

Turonem 7. If A is normal and B is triangular, then B is not weaker than A 


uf and only of BA — is convergence-preserving. 








ON PERFECT METHODS OF SUMMABILITY 705 


3. The Nérlund method. The Nérlund method of summation corresponds 
to a triangular matrix whose elements are of the form a,, = pn-«/P, for 
k = 0,1,---,n;n = 0,1, 2,---, where {px} is a given sequence such that 
P, = po + pm +--- + pr + O for all n. In particular, for n = 0 we have 
po ~ 0, and we may always take po = 1 since px may be replaced by px/po 


without affecting the matrix. Then in view of a,, = 1/P, # 0 we see that 
every Noérlund matrix is normal. Furthermore, since > Pri+t/P, = 1 
k=0 


(n = 0, 1, 2,--- ), the regularity conditions reduce to (1.1) and (1.3) alone. 
In the special case where p, 2 0 for every k, (1.3) is always fulfilled and on 
account of pr_x/Pxa S Pn—«/Pn~ the conditions become simply lim p,/P, = 0. 


The question naturally arises whether a regular Nérlund matrix is necessarily 
of type M, and the following example shows that this is not the case. 

Example 1. Let py = 2 and p, = 0 fork = 2; since pp = 0 and p,/P, = 0 
if k = 2, the corresponding method is regular by the above remark. On the 
other hand, conditions (1.4), but not (1.5), are satisfied if we let a9 = 4 and 
a, = (—}4)" forn 2 1. 

TueoreM 8. For a regular Nérlund method to be of type M it is sufficient that 
the sequence 


Pr 1 0 0 .--- O 
Pe m 1 0 0 
See Pereeer eee her Tere (n = 1, 2, 3, ) 
Pn-1 Pn-2 1 
Pn Pa-1 Pr 


be bounded. 

Proof. We obtain this result as a consequence of Theorem 4 by imposing the 
condition that F,, actually contain the points y;. This is equivalent to requir- 
ing that the solution {s,;} of the system of equations 


(3.1) A,(x) = } (Dn k P 8%: = Oni (n = 0, 1, 2, — -) 
k=O 
be bounded in k for each fixed 7 = 0, 1, 2,---. Observing that the first ¢ of 


the s,; are zero, we may write (3.1) in the form 
n” 


(3.2) ps Pn Sink. = P31.08i+0,5 (n —= 0, 1, 2, ++) 


k=O 


from which we obtain by applying Cramer's rule to the first 7 + 1 of these 
equations 


(3.3) 8i4;0 = (—1) PD; (i,j = 0,1,2,--- ; Do = I), 
and the theorem follows. 


By condition (1.1) the regularity of a Nérlund method implies that 
Prrs/Pay Pas P.)/Pess l (P,/Pauw) — 0, or that P./Pays — 1 








706 J. D. HILL 


x 
asn— 2. Thus the power series = P,,z” has a radius of convergence equal to 1, 
n=0 
x 


and consequently the radius of convergence of p(z) = >. paz” = (1 — z) D Paz” 


n=0 n=0 


is at least 1. Combining (3.2) and (3.3) gives the relation >> (—1)*pn».Di = 
k=0 


6,0 for n = 0,1, 2, --- , from which we obtain the formula 
(3.4) 1/p(z) = >, (—1)"D. 2", 
n=0 


valid for z_ sufficiently small since p(0) = 1. Hence for each regular Nérlund 
method there exist positive constants A and B such that | D, | < AB" for all n, 
and the condition of Theorem 8 will be satisfied if and only if B can be chosen 
<1. That the latter is by no means a necessary condition is shown by the 
following example. 

Example 2. For each r = 1, 2, 3, --- there exists a perfect Nérlund method 
for which the corresponding D, is precisely O(n™"). For let r be chosen arbi- 


trarily and fixed. Define p, as / for n = 0, 1, ---, r and as zero otherwise. 
: , “ nf{n+r—-1\, 
We then have p(z) = (1 + 2)’ and 1/p(z) = D (—1) 1}? that by 
y= 


(3.4), D, = (" = ; " = O(n"). 


/ 
The p, being non-negative and ultimately zero, the regularity is apparent. 
Let us then assume that conditions (1.4) are satisfied. Fork 2 r these reduce to 


r 


(3.5) LAa) = > ()as. = 0 (k= r,r+1,r+ 2, ---). 


1=0 
The latter implies L,,(a,) = Oforallk 2 r. For we have evidently 
L, (ax) = L,-1(ax) + L,—-1(arz41) = 0, 
L,(ars1) = L, (a4) + L,-1(ax+2) = 0, 


whence L,-;(ax) = L,-1(axs2). Since a, — 0 it is clear that {L,-:(ax+2:)} for 
each fixed k = r is a sequence of equal terms converging to zero. This es- 
tablishes the assertion. 

It follows then by induction that conditions (3.5) imply Lo(a,:) = a, = 0 
for allk =r. The first r equations in (1.4) now take a simplified form from 
which it is obvious that a, = 0 for k = 0,1, ---,7 — 1, and the example is 
complete. 

We conclude this section with a few instances in which the criterion of Theorem 
8 applies. 

Mazur’s result for Cesaro summability (C, @ > 0) is immediate. For we have 

















ON PERFECT METHODS OF SUMMABILITY 707 


Pr = (" roe ’ p(z) = (1 — 2) *, and 1/p(z) {~3)" * )e ". thus 


n n=0 


D, = () 0s n—-> 2, 
n 
If p, = 1/(n + 1), we obtain the so-called logarithmic Nérlund method. 


> (—1)"D,2", and it is known that 


n=0 


In this case 1/p(z) = —z/log (1 — 2z) 


the coefficients D,, in this expansion are® O(1/(n log’n)). 

Finally, consider the method defined by the sequence p, = 1 + nd for a given 
d>0. Since P, = (n + 1)(nd + 2)/2, the regularity follows at once. More- 
over, this method is omeny stronger than the identity since one may show 
that it assigns the limit } to the div ergent sequence {(1 + (—1)")/2}. We 
have here p(z) = (1 + (d — 1)z)/(1 — 2)’ and 


1/p(z) =1 —-(1+d)z2+d’ > (1 — d)”2", 
n=2 


whence D, = d'(d — 1)"* forn = 2. Thus if 0 < d S 2, this method will 
be perfect; for other values of d the question remains undecided. 


4. The Hausdorff method. Let {xu} be an arbitrarily given sequence and 
consider the matrices S = (s,.), T = (tnx), where sn. = (177 : tn = we Onk- 


Any matrix of the form H = STS is called a Hausdorff matrix. Such a matrix 
is clearly triangular, and if H = (h,x) we find that 


(4.01) hy = am 1)’ ss \ mr (k = 0,1,---,n;n = 0,1, 2, ---). 
i=k he 
Since’ S° = J it is easy to verify that 

(i) The multiplication of Hausdorff matrices is commutative and the result is 
again a Hausdorff matrix. 

(ii) The inverse of a normal Hausdorff matrix is also a (normal) Hausdorff 
matriz. 

The question whether or not every normal regular Hausdorff matrix is of 
type M seems to be quite difficult and still remains open. However, we shall 
later exhibit (see Example 3) a regular Hausdorff matrix which is not of type M. 
The next theorem, although based on an unverified hypothesis, seems of suffi- 
cient interest to warrant its inclusion. 

TueoreM 9. If Hy is normal, convergence-preserving, and not of type M, and 
if H is not weaker than Hy , then H is not of type M. 


6 See Tamarkin, Problem 3276, American Mathematical Monthly, vol. 35 (1928), pp. 
497-500; esp. bottom of p. 500. 

? For a particularly simple proof see Henriksson, Uber die Hausdorffschen Limitierungs- 
verfahren, die schwicher sind als das Abelsche, Mathematische Zeitschrift, vol. 39 (1935), 
pp. 501-510; in particular, p. 502. 








708 J. D. HILL 


Proof. By Theorem 7, H,; = HH," is convergence-preserving and from (i), 
(ii) we see that H, = Ho'H. Hence H = HoH, and the result follows at once 
from Theorem 6. 

On the other hand, since Cesiro summability is a special case of the Haus- 
dorff, we know that there exist Hausdorff matrices which are of type M. Con- 
cerning such matrices we have the following result. 

TuHeoreM 10. If H is normal and convergence-preserving, and if Hy is of type M 
and not weaker than H, then H is of type M. 

Proof. As in the previous theorem we have H, = HoH™' = H™'Ho, where 
H, is convergence-preserving. Then Hp = HH, and the conclusion follows 
again from Theorem 6. 

From (i) and Theorem 5 we obtain directly the following theorem. 

THeoreM 11. The product of a finite number of perfect Hausdorff methods is 
likewise a perfect Hausdorff method. 

We propose next to establish conditions sufficient for a regular Hausdorff 
matrix to be of type M. In order that a regular method be defined by (4.01), 

1 


it is necessary and sufficient® that u; be of the form yu; = u' dq(u) for i = 
0 

0, 1, 2,---, where qg(u) is a function of bounded variation on the interval 

U = (0 s u S 1) which is continuous at the point u = 0 and which satisfies 


the condition g(1) — q(0) = 1. It is understood throughout this section that 
q(u) will always denote such a function. With this expression for yu; , (4.01) 


1 
reduces to hx, = (x) u‘(1 — u)"“* dq(u), and conditions (1.4) become 


x al l 
(4.02) Dlaj<«, L a(t att — u)"“dq(u) = 0 (k = 0,1,2,---). 


n=0 n=k Jl 


Since 


(4.03) > (;)wa _~ wy =] (n = 0, 1,2, ---) 
k=0 
1 
and dq(u) = q(1) — q(0) = 1, we observe for future reference that the 
0 


following relation is implied by (4.02): 
(4.04) > dS ahu = Dd (= has an = Da, = 0. 
k=0 n=k n=0 \k=0 n=0 


We proceed now to reduce (4.02) to a more convenient form. From (4.03) 


we have 0 < (n)ata — u)"* < 1 on U. Consequently, we see that each 


of the series 
(4.05) glu) = } ® a(t utd —)*" (k = 0, 1,2, ---) 


n=k 


* See Hausdorff, Summationsmethoden und Momentfolgen, Mathematische Zeitschrift, 
vol. 9 (1921), pp. 74-109, 280-299. 























ON PERFECT METHODS OF SUMMABILITY 709 


converges absolutely and uniformly on U, and that for every k 


(4.06) | ge(u) | > > lanl, 
(4.07) gi(u) = (—1)*ugs” (u)/kI. 


Thus {g.(u)} is a uniformly bounded sequence of functions, continuous on U 
and analytic for |u—1|!< 1. Onaccount of the uniform convergence of (4.05) 
we may write the second part of (4.02) in the form 


(4.08) [ gx(u) dq(u) = 0 (k = 0,1, 2, --- ). 


Now for an arbitrarily fixed ¢ on the interval 0 < ¢ < 1 it follows from (4.06) 
that the series G(t, u) = pin g.(u)(1 — 2) is uniformly convergent on U. Term- 
k=0 


wise integration is therefore permissible, and from (4.08) we obtain 


(4.09) [ " G(t, u) dq(u) = 0. 


But by (4.07), for 0 < u S 1, we have G(t, u) b g(u)(l — t)* = 
k=0 


} ® go’(u)(tu — u)*/k! = go(tu). By (4.04), however, go(0) = > an, = 0, 
k=0 


n=0 
and by (4.05), g.(0) = 0 for k = 1, 2, 3,---. This shows that the relation 
G(t, u) = go(tu) holds also for u = 0. Making this substitution in (4.09) we 
obtain finally 


(4.10) [ go(tu) dq(u) = 0 (0<t<1), 


and the problem is reduced to finding further conditions on q(u) sufficient to 
insure that (4.10) shall imply go(u) = 0. 

As a first step in this direction we state the following definition. 

ConpiTion C. q(u) will be said to satisfy Condition C if there is an index 
r = O such that q°*”(u) exists and is bounded on U and if q“*"(1) # 0. We 
shall denote by m the smallest index for which this holds. Obviously we then 
have 


(4.11) ql) = 0 (¢ = 0,1, ---,m — 1). 
If q{u) satisfies this condition, we may form the sequence q(u) = q(u), 
qi(u) = uq;-s(u) for i = 1, 2, --- , m, and one easily sees that 
(J Si 
C,;,wq (u) 
(4.12) qu) = 1%: e (i = 1,2, ---,m), 
\d'q(u)/dz', when u = e* 


where the C,; are certain constants. 








710 J. D. HILL 


TuHeoreM 12. If q(u) satisfies Condition C and if constants co, 1, --+ , Cm 
exist such that 


1 m 
(4.13) [ uq..(u) + > ls c:qi(u) du < |q'"*? (1) |, 
i=0 


0 


then q(u) defines a Hausdorff matrix of type M. 
Proof. From Condition C, (4.11), and the first half of (4.12), it follows that 


(4.14) qi(1) = bimg°"*” (1), 
(4.15) q. (u) = O(1) on U, 
Setting s = tu for 0 < ¢ S 1 and writing dq(u) = q’(u) du, (4.10) becomes 


(¢ = 0,1, ---,m). 


t 
[ go(s)q'(s/t)ds = 0. In view of (4.15) for 7 = 0 we may differentiate this 
integral and obtain I go(s)q’’(s/t)(s/t?) ds = q’(l)go(t). Recalling (4.14) for 
0 
1 
? = 0 and returning to the variable u, we have [ go(tu)uq”’(u) du = 0. This 
0 


1 
added to the original expression (4.10) gives [ go(tu)qi(u) du = 0. This rela- 
0 


tion is simply (4.10) with q(u) replaced by q:(u), and (4.15) allows us to repeat 
this process until we have 


1 
(4.16) go(tu)q:(u) du = 0 (i= 0,1, ---,m). 
Repeating the process again with 7 = m and using (4.14) for the index m, we 
obtain 
1 
(4.17) go(tu)uq,,(u) du = Gn(1)go(t) = gq" (got), 
0 


where, by Condition C, q'"*"(1) # 0. Multiplying equations (4.16) by the 
corresponding constants ¢; and adding the results to (4.17) give q°""" (1)go(t) = 


1 
[ go(tu)(uq,,(u) a p c,q.(u)) du on the interval 0 < ¢ < 1. Letting go = 
0 i=O 


max go(t) | on (0, 1), we obtain the inequality 


1 we 
aia (is w | uq,,(u) + p> :q.(u) du. 


Consequently, if (4.13) holds we must have g. = 0, and the theorem follows. 
Corottary |. If q(u) has a bounded second derivative, and if a constant 
ce < 1 exists such that uq’’(u) + eq’(u) 2 0 on U’, then the corresponding Haus- 


dorff matrix is of type M. 
] 
Proof. Since ¢ < 1 the relation 0 S uq''(u) + eq’(u)|du = 























ON PERFECT METHODS OF SUMMABILITY 711 


1 1 
[ uq’’(u) du + ef q'(u) du = q'(1) — 1 + « gives q’(1) > 0, and the hy- 
0 0 


potheses of Theorem 12 are therefore satisfied with m = 0, c = c. 

From this fact we obtain at once the following result. 

Corouuary 2. If q(u) has a bounded non-negative second derivative on U, then 
the corresponding Hausdorff matrix is of type M. 

It seems desirable to show as a final corollary that the criterion of Theorem 12, 
together with Theorem 10, enables us to reproduce Mazur’s result for Cesaro 
summability. 

Corouiary 3. The Cesdro matrices of all positive orders are of type M. 

Proof. In this case we have g(u) = 1 — (1 — u)*, wherea > 0. We assume 
first that @ is an integer. Then it is obvious that qg(u) satisfies Condition C 


with m = a — 1. Moreover, g(u) = Zz (-— (sym and from the second 
i=l 


part of (4.12), qg.(u) = x, (27 (S) for s = 0,1,---,a@— 1. Conse- 
i=l 


quently, we have 


ugqu(u) + > c.q.(u) = > (—1)' i(*\(E “ce. + (i - ae .. 


i=l s=0 


Hence by choosing the c’s to satisfy the equations 
a—l 

(4.18) > ic. + G — I)i*" = 0 (¢ = 1,2, ---,a@) 
s=0 


as it is clear we may do, we see that the integrand in (4.13) vanishes identically, 
and the result for integral orders is established. But since the strength of the 
Cesiro method increases with the index, the general conclusion follows from 
this by Theorem 10. 

One might be led to suspect that the preceding argument could be applied 
to obtain the desired conclusion when g(u) is an arbitrary polynomial. It 
turns out that such is not the case, however, since in general the system of 
equations corresponding to (4.18) is inconsistent. 

THEeoreM 13. The function q(u) = u” for p > 0 defines a perfect Hausdorff 
method. 

Proof. The normality and regularity are apparent. Let us then assume that 
(4.08) holds and replace therein g,(u) by its expression given in (4.07). Since 
dq(u) = pu” du we find on integrating by parts that 


1 1 
0 = (p+ 0 | u‘ gi’ (u) pu” ‘du = pgi(1) — [ ua? (Cu) pu? du. 
0 0 


Thus gi(1) = 0 for k = 0, 1, 2,---. From this the theorem follows. 
It may be remarked that this result for p 2 2 is an immediate consequence 
of Corollary 2, and, for p = 1, of the ensuing Theorem 15. 


The next two theorems deal with certain general classes of monotone func- 








712 J. D. HILL 


tions. To facilitate the statement of the first, we introduce the following 
definition. 

Conpition D. Any function g(u) defined as follows will be said to satisfy 
Condition D. For an arbitrarily given v, 0 < v S 1, let Ui = (0 S u < vv) 
and U,; = (v Susi). On U, let q(u) be monotone increasing with q(0) = 0, 
and suppose that r exists, 0 < r < 1, such that gv — 0) S r/(1 +r). Finally, 
at each point of U2 let q(u) be equal to 1. 

TueoreM 14. If q(u) satisfies Condition D, then the corresponding Hausdorff 
matrix is of type M. 

Proof. For each i = 1, 2, 3, --- let 0 = up < ui < --- < unui = vbea 
mode of subdividing the interval 0 S u S v such that the maximum length 
of the subdivisions tends to zero as i > ~. Let Q} = q(ui) — q(ui_s) for 
s=1,2,.--,m; + 1. 

Assuming that (4.10) holds, we set 


mil 


(4.19) Git) = YX goltui)a: (0 <ts)D), 
s=] 


and we have then for each ¢, lim G,(t) = 0. By Condition D, 


is 


Qhu1 = 1 — gun) = 1/1 +n), 
05 YOM = aud)/A — aud) Sr <1, 


fori = 1, 2,3,---. Consequently, if we let go = max | go(u) | on (0, v), we 
obtain from (4.19) the inequality | go(vt)| < | G(t)/Qh.i1| + rgo. Letting 
i— ~ gives go(vt)| S rgoforO0 S vot S v. This is not possible unless go = 0. 
Thus go(u) = 0, and the theorem is proved. 

It is evident that no essential change is necessary in the above proof if v < 1 
and U, = (0S usv),U2=(v<us il). 

TueoreM 15. If q(u) is continuous on U and has a derivative for0 <u < 1 
which is non-negative and non-decreasing, then the corresponding Hausdorff matrix 
is of type M. 

Proof. Under the given conditions q(u) is absolutely continuous and (4.10) 


may be written 
1 
(4.20) [ go(tu)q’(u) du = 0 (Osts 1). 
0 


If we assume that go(u) is not identically zero, it follows that the function 


(4.21) h(u) = [ go(t) dt <u<1) 
0 


* The proof given here parallels an argument ascribed to bk. J. MeShane, which applies 
directly to the function q(u) = (2/r) sin~! u. See Bonnesen und Fenchel, Theorie der 
Konveren Kérper, p. 138. I am indebted to Professor Hans Lewy for calling this to my 


attention. 





























ON PERFECT METHODS OF SUMMABILITY 713 


(which may be continued analytically into the circle | wu — 1 | < 1) must assume 
both positive and negative values. For integrating (4.20) with respect to ¢ 
from 0 to v (0 S v S 1) and interchanging the order of integration we obtain 


1 
(4.22) [ —_ q'(u)du = 0 (O< v1). 


Since q(1) — q(0) = 1, q’(u) cannot be zero almost everywhere and hence the 
assumption that h(u) is of one sign or zero implies that A(u) is zero on some 
set of positive measure. This is a contradiction to (4.21) unless go(u) is iden- 
tically zero. 

Let us denote then by —m the minimum value of the function A(¢)/t, which 
from (4.21) is continuous for 0 < t < 1, if the value 0 is assigned at t = 0. 
Suppose that h(to)/t: = —m, where 0 < t& S 1, and set p(t) = A(t) + mt so 
that p(t) = 0, p(t) = 0. Then for the function 


F(t) = / Pe) ou) du = [2 q'(u) du + mt [veo du 


we get from (4.22), F(t) = mt(q(1) — q(0)) = mt, and thus 
(4.23) F(t) — F(t) = m(t — to), m> 0. 


On the other hand, if & < 1 and & < ¢ < 1 a simple calculation gives 


Fi) — Fl) = [ as (7 - a(t) a oo fe = q'(u) du. 


The first integral in view of p(t) = 0 and the monotone property of q’(u) is less 
than or equal to zero. Furthermore, for &/t S u < 1 it follows from (4.21) that 


p(tu) = p(tu) — p(tr) = A(tu) — h(to) + m(tu — to) S (max | go} + m)(t — bt). 


This shows that the second integral fs o(f — t). These estimates provide a 
contradiction to (4.23) and complete the proof in case tf’ < 1. A similar argu- 
ment applies when f = 1. 

We conclude our discussion of the Hausdorff method by showing, as pre- 
viously mentioned, that there exists a regular Hausdorff matrix which is not 
of type M. 

Example 3. The polynomial Q(u) = 16u° — 27u° + 12u defines a regular 
Hausdorff matrix which is not of type M. Since Q(1) — Q(0) = 1, the regu- 


larity is clear. Now let us choose a = 0, a = —1, and a, = 1/(n(n — 1)) 

forn = 2. Then we have >> | an| < ©, go(u) = > «(1 — u)" = u log u, 
n=O n=0 

and (4.07) gives gi(u) = —u(1l + log u), ge(u) = u/(k(k — 1)) for k 2 2. 


1 
| udQ(u) = 0, | (u log u) dQ(u) = 0, and one easily verifies that these are 
0 0 


satisfied. Thus the matrix defined by Q(u) is not of type M. Moreover, 


Consequently, conditions (4.08) reduce simply to the two conditions 
1 








714 J. D. HILL 

since the diagonal elements of this matrix are given by 

hin = [ u" dQ(u) = 6(n — 1)*/((n + 1)(n + 2)(n + 3)) (n = 0,1, 2,---), 
we see that the normality is destroyed by (and only by) the vanishing of hy . 


5. The weighted-mean method. The weighted-mean method is defined by a 


triangular matrix whose elements have the form a,, = p/P, for k = 
0,1, 2,---,n;n = 0, 1, 2,--- , where the sequence |p,} is such that P, = 
Po + pr + --- + p, # Ofor all n. If such a matrix is normal, we must have 


Ann = Pr/ P, # 0, or p, # Ofor every n. In this event, conditions (1.4) reduce 
to >a, /P, = 0(k = 0,1, 2, --- ), which clearly imply a, = 0(n = 0,1. 2, --- ). 
n=k 


Thus we have established the following theorem. 
THEOREM 16. Every normal weighted-mean matrix is of type M. 


Brown UNIVERSITY AND MICHIGAN STATE COLLEGE. 


























QUASI-UNITARY MATRICES 
By JoHn WILLIAMSON 
Introduction. Let /,, be the n-rowed square matrix 
E,, 0 
0 —E,./ 


where £; is the unit matrix of order 7. Then J,, is the normal form of a non- 
singular Hermitian matrix of index m under a non-singular conjunctive trans- 
formation. A matrix A, whose elements are complex numbers, which satisfies 


(1) AI,,A* = I,,, 
where A* = A’ is the conjugate transposed of A, will be called a quasi-unitary 
matrix. In particular, if m = n or 0, A is a unitary matrix. A matrix, A, 


which satisfies (1), is a conjunctive automorph of the Hermitian matrix /,, . 
The conjunctive automorphs of a non-singular Hermitian matrix have been 
studied by Loewy.’ He has shown how the nature of the elementary divisors 
of A — XE is restricted by the index m of the matrix 7,,. In the following 
paper we derive normal forms for quasi-unitary matrices under quasi-unitary 
transformations, and in doing so are inevitably led to Loewy’s results. (See, 
for example, the remark following Theorem 2.) We also determine necessary 
and sufficient conditions for the similarity of two quasi-unitary matrices under 
a quasi-unitary transformation. In particular it is shown that two quasi- 
unitary matrices which are similar are not necessarily similar under a quasi- 
unitary transformation. In §2 the similar problem for real quasi-orthogonal 
matrices is considered, and in §4 an interesting property of the elementary 
divisors of a pencil, whose base is 7,, and a canonical quasi-unitary matrix, is 
deduced. 

As many of the proofs are in essence the same, subject to obvious modifica- 
tions, as those in a previous paper,” for the sake of brevity they will be omitted. 


Received May 4, 1937; in revised form September 21, 1937. 

' Alfred Loewy, Allgemeine bilineare Formen konjugirt imagindren Variabeln, Abhand- 
lungen der Kaiserlichen Leopoldinisch-Carolinischen Deutschen Akademie der Natur- 
forscher, vol. 71 (1898), pp. 377-446; Mathematische Annalen, vol. 50, pp. 557-576. The 
second of these papers gives a short account of the results proved in the first. The term 
quasi-unitary was first used by Harold Hilton, Properties of certain homogeneous linear 
substitutions, Annals of Mathematics, (2), vol. 15 (1913), pp. 195-201. 

? John Williamson, On the normal forms of linear canonical transformations in dynamics, 
American Journal of Mathematics, vol. 59 (1937), pp. 599-617. This paper will be referred 
to as W. 


715 











716 JOHN WILLIAMSON 


1. The problem that we first consider, then, is the following. Let A; and A: 

be two matrices, which satisfy 

* . 
(2) Ail, Aji = In (¢ = 1, 2, 3); 
to determine necessary and sufficient conditions that a third matrix A; , satis- 
fying (2), exist and satisfy A4342A;' = A,. 

If A; and A, are similar and a matrix Q, to be specified later, is similar to A; , 
then Q is also similar to Az. There accordingly exist two non-singular matrices 
R; and R;, such that 

R;A;R;' = Q (i = 1, 2). 
The matrices 

S; = RiI,R? @G = 1, 2) 
are Hermitian and are left invariant by Q; that is, they satisfy 

QS:Q* = S; (¢ = 1, 2). 
Accordingly, if Q is any matrix similar to both of the quasi-unitary matrices 
A; and Ae, there is associated with A, a Hermitian matrix S; and with Ae a 
Hermitian matrix S, , both of which are left invariant by Q. 

The problem under consideration is reduced to a similar but simpler one by 
means of 

THEOREM |. A necessary and sufficient condition that the quasi-unitary matrix 
A, be similar to the quasi-unitary matrix Az under a quasi-unitary transforma- 
tion is that there exist a non-singular matrix H, such that 

HQ = QH, 
and that the two Hermitian matrices associated with A, and Az satisfy* 
HS, H* = Se. 

Since, in the above, Q is any matrix similar to A, , we are at liberty to choose Q 
in a suitable normal form. Then, if S is any Hermitian matrix, which satisfies 
the equation 
(3) QSQ* = S, 
we shall first determine a unique normal form for S under non-singular con- 
junctive transformations by matrices commutative with Q. If HQ = QH 
and HSH* = T, we shall call the transformation by the matrix H an admissible 


transformation and shall write S = T. 
Let the matrix Q, which is a normal form of the quasi-unitary matrix A under 
similarity transformations, be chosen in the diagonal block form 


Q -_ [Q, » Q, “7 » Ql, 


» For proof, see W, Theorem 1 


























QUASI-UNITARY MATRICES 717 


where no latent root of Q; has absolute value one, each latent root of Q;,7 > 1, 
has absolute value one, and, if 7 # j, no latent root of Q; is the same as a latent 
root of Q;. Then, if (3) is satisfied, the matrix S is also a diagonal block 
matrix, 


S = [S,, Se, --- , SJ 
and 


(4) QS:Qi = S; (¢ = 1,2,---,k) 


(W, Lemma 2). Since (4) is the same as (3) except for the suffix 7, we need 
only consider two special cases of Q: 

Case 1. No latent root of Q has absolute value one. 

Case 2. Each latent root of Q is equal to p, where p is of absolute value one. 

Case 1. Since Q is similar to (Q*)™’, Q is similar to the diagonal block matrix 
[F, (F*)"]. As a consequence of the remark following Theorem 1, we may 
replace Q by this matrix; that is, we may write Q = [F, (F*)™’]. With this 
value of Q, the matrix S is of the form 


(ro): 


where 7 is a square matrix of the same order as F. The transformation of 


matrix 
7 4 
( 0 :) 
is admissible and 


0 E 
(5) S= ( ) = G. 
E 0 


Hence we have 

Resutt 1. Jf no latent root of A has absolute value one, the matrix Q may be 
taken in the form [F, (F*)"]. Then S = G. 

The matrix F is not unique and may be replaced by any matrix similar to it, 
the classical canonical form, for instance. As a consequence of Theorem 1 we 
therefore have 

TuHroreM 2. If A; ts a quasi-unitary matrix similar to a second quasi-unitary 
matriz Az and, if no latent root of A, is of absolute value one, then A, is similar 
to Az under a quasi-unitary transformation. 

The fact that in this case the index of J,, must be one half the order of J,, 
is a known result." 


* Alfred Loewy, loc. cit. 











718 JOHN WILLIAMSON 


Case 2. Since each latent root of Q is equal to p, we may take Q in the 
canonical form 


Q = ve. ’ P., Vesoec , P.,|, 


where 
(6) P., = pE; + pU;, 


and £; and l’; are respectively the unit matrix and the auxiliary unit matrix 


of order e;. The elementary divisors of A — AEF are therefore 

(A — p)” (i= 1,2,---,t;4 2e 2--- =e). 
If 
(7) S = (S,,) (r,s = 1,2,---,?), 


is a partition of S similar to that of Q, it can be shown first that S = T = 
(T,.), where T\, is non-singular (W, §4); then that 7 = [S,, S., --- , S.J, where 


(8) P,,S;P*, = 8; 
(W, Lemma 3). Equations (8) are of two distinct types: type (1), the matrix 


P., = P is of even order 2m; type (2), the matrix P is of odd order e = 2m + 1. 
Type (1). The reduction, used in W, type 6, shows that 


0 Xie 
S; _ W; = d; X; _ d; ’ 
Xn O 


where Xy is a uniquely determined square matrix, all of whose elements are 
- > y? . 5: 
integers, and Xe, = —Xj.. For example,’ if m = 4, 


Since W; = W;, d; = a;i, where a; is a real number different from zero. 
The admissible transformation by the scalar matrix EB/+/|a;| shows that 
S, = W, = ¢,iX,, where ¢; = +1. Therefore we have 

tesuLr 2. If e; = 2m, S; = €;iX;, where ¢; = +1 and X, is uniquely 
determined. 

Type (2). The matrix S; = ¢,Y,;, 6; = +1, where Y; = (y,.) is a uniquely 
determined matrix, for which 


yn = 0 (r,s = 1,2,---,m; yn = 0,7 +8 2; + 2), 


©(f. Turnbull and Aitken, Canonical Matrices, p. 157. 





























QUASI-UNITARY MATRICES 719 


(W, type b.). For example, if e; = 5, 


Consequently, we have 

Resutt 3. Jf e; = 2m + 1, S; = €;¥;, where ¢; = +1 and Y; ts uniquely 
determined. 

We see from Results 2 and 3 that, with each elementary divisor (A — p)‘ 
of A — XE, where |p| = 1, is associated an ¢, which has the value +1. 
Therefore, if (A — p)° occurs exactly ¢ times among the elementary divisors 
of A — XE, with this elementary divisor is associated a set of ¢ positive and 
negative signs. We may call the number of these positive signs the index of 
the elementary divisor (\ — p)’. We are now able to prove (W, Theorem 4) 

THEOREM 3. Necessary and sufficient conditions that two quasi-unitary ma- 
trices A, and Ag be equivalent under a quasi-unitary transformation are that 

(a) the elementary divisors of A, — XE be the same as those of Az — XE, and 

(b) the indices of all elementary divisors (h — p)‘, |p| = 1, be the same for 
both pencils. 

Theorem 3 includes as a special case the known theorem that two unitary 
matrices which are similar are similar under a unitary transformation. For, 
if A is unitary, only elementary divisors of type (2) may occur with e; = 1 
and the corresponding indices must all be one (or zero). 


2. Real quasi-orthogonal matrices. The above arguments are valid in the 
real field, if the complex number 7 is replaced by the two-rowed real matrix 
0 1 
—1 0 


and the complex number p = a + ib of unit modulus by the real orthogonal 
matrix 


a b 
—b a 
The elementary divisors (A + 1)" of type (1), e; = 2m, now must be considered 
e i 


separately. In this case, since the matrix Sy in (7) is a symmetric matrix of 
even order, all of whose elements are real numbers, Sy is necessarily singular. 








720 JOHN WILLIAMSON 


However, after at most a rearrangement of the rows and columns of S, we may 


suppose that 
(* S) 
Sa So 


is non-singular. Then [P, P] may be replaced by [P, (P*)~’] and 
Su Sx 0 E 
(* a wi (; ,) 
(W, type a). Accordingly, an elementary divisor (A +1)*" must occur an even 
number of times and no index need be associated with it. We have therefore 


TueoreM 4. Two real quasi-orthogonal matrices A, and Az are similar under 
a real quasi-orthogonal transformation, if and only if 

(a) the elementary divisors of A; — XE are the same as those of Az — XE, and 

(b) the indices associated with each pair of complex elementary divisors (A — p)*, 
(A — p)‘, |p| = 1, and with each elementary divisor (\ + 1)***" are the same for 
both pencils. 


3. Normal Forms. In determining possible normal forms for a quasi-unitary 
matrix under quasi-unitary transformations we first reduce the matrices X; 
and Y; of Results 2 and 3 to simpler forms. In so doing we naturally alter the 
matrices P,, . 

Type (1). Since e; = 2m, we may write 


P. Ra 
I om = ’ 
0 I as 


where #,, is a square matrix of order m, whose only non-zero element is the 
element p in the first column and last row. Then, if 


E,, 0 
H = me where ¢€; = ¢, 
0 —ieXo; 


0 E.. 1 Pe —et Lin 
HaXH* = = (,, and HP2,,H” = . , 
E, 90 0 (P,)" 


where L,, is the matrix, whose last row is 
(9) (p, —p, p, ---,(—1)" ‘p), 


all other rows being zero. Therefore, since « = +1, we have 
Resutt 2a. If e; = 2m;, Pen, may be replaced by 


Pp; Ln; 
Lom; = * e 
. 


Then Sam; = Gm. 























QUASI-UNITARY MATRICES 721 


Type (2). Since e; = 2m + 1, we may write 


yr & 
Poms = ’ 
0 Past 


where R,, is a matrix of m rows and m + 1 columns, whose only non-zero ele- 
ment is the element p in the last row and first column. The corresponding 
matrix Y in Result 2 is of the form 


(9 


where D is a non-singular (m + 1)-rowed matrix and C consists of the first m 


rows of D*. If 
(* 0 
H= , 
0 «D' 


a Mn 
HeY H* = [G,,, €—1)"] and HPomiaH” = " , 
0 (Pra) 


where M,, = ¢«R,,D. The first m — 1 rows of M,, are therefore zero and the 
last is 


(10) (Sep, — ep, 2p, x (—1)""* dep, (—1)ep). 


On substituting (—1)"e for « we have 
Resutt 3a. If e; = 2m; + 1, Pe; may be replaced by 


Pn, (—D™ Mn, 
Z., = a 
0 (P41) 
Then S.; = [Gn;, €]. 


If O is the real orthogonal matrix 


then 
(11) OGO’ = [E, —E]. 
Further, if Z = [F, (F*)~"], where F is the matrix of Result 1, 


- ») ( F+(F*" -F+ ch 
(12) 0Z0'=B= =} 


Bn Bx —-F+(F)" F+(F)" 
and, if Z has the value given in Result 2a, 
P+(P*)'?+eL —-P+(P*% "+ ) 


(13) 0ZO’ = B= i( - e , 
—P+(P*)'-alL P+(P*)'- al 








722 JOHN WILLIAMSON 


In reducing the matrix [G,,, , e] of Result 3a to diagonal form it is necessary to 
make a further partition of the matrix Z,,. Accordingly we write 


>*)—1 
> > mj; a r * . (F ) 0 
P., = P, (—1) M; = Nz, (Pm, +0) = “ ee 
y (p*) 
where N is an m-rowed square matrix, z a matrix of a single column, and y* 
a matrix of a single row. An easy calculation shows that 


(14) [O, 1}[G, [O, 1] = [En , —En, 4], 

and that 

(15) (0, 1]Z[0, 1)’ = B = (B,)) (i, j = 1, 2), 
where 


Bu={P+N4+(P*)"}2", Be=({|-P+(P*%"+Nj2", 22%), 
{—-P — N + (P*)"}27 (P+ (P*)'-—N}2", —224 

Bu ss , Bx = . 
y*2* y*2?, (p*)* 


If « = —1, the matrix on the right of (14) is [E,, —Emyaj. Ife = +1, a 
simple interchange of rows and the same interchange of columns reduces the 
matrix on the right of (14) to [Z,.., —£,,J]. Accordingly, if « = 1, there 
exists a real orthogonal matrix 0, such that 


ONG, 10; = (Ens, —Enl, 


and 
(16) 0,20; = B = (B;;) (é,j = 1, 2), 
where 
(PP4+N4(P*) "12", «2? {-P +N + (P*)"}2" 
By = , By = ’ 
y*2 : (p*) 1 y*2 1 


Bu w (=P —- N+ (PONS, -—a, Bey = {P —N + (P*)‘2". 


Thus each matrix Z,; in Results 1, 2a and 3a is similar under a real orthogonal 
transformation to a matrix B; given by one of the equations (12), (13), (15) 
or (16) and the corresponding matrix S; =~ [E,, —£,]. Let B,, Be, --- , By 
be the complete set of matrices B,;, described above, obtained from a quasi- 
unitary matrix A, and let 


B, = (B,.i;) (i, 9 = l, 2; S&S l, 2, ati , k). 


Then, if 

(17) C = (C;;) (i, 7 = 1, 2), 
where 

(18) Cas [Biij, Bey, +--+, Besil, 


it follows that A is similar to C and that S = 7,,. We have therefore proved 














QUASI-UNITARY MATRICES 723 


THEoREM 5. A quasi-unitary matrix A is similar under a quasi-unitary trans- 
formation to one and essentially only one of the matrices C defined by (17) and (18). 


4. Elementary divisors. We now prove three lemmas. 

Lemma 1. The elementary divisors of [Pn , (P%)~'] — XG» are all linear and 
of the form \ — w, where w is of absolute value one. 

Let 


| Pn —ABn 
|-aB. (PS)" |" 


Then*® 
A = |P,.(PS)" — VE. |. 
But, if a = p/p = &, it follows from (6) that 


0 a 0 + @] 
0 0 e — | 
P,,(P*)" = os wis saa he Send 
| 0 0 0 a: 
\(-1)""a (—1)"*e (—1)"*a ve a J 
Therefore, if \ = pie” 
A = (-1)"a™[u™ — wp + yu" — .-- + (-1)"), 


= (-1)"a"(u™"" + (—1)")/(u + 1D. 


The roots of A = 0 are accordingly all distinct and of absolute value one and 
the lemma is proved. 

We have as an immediate 

Coro.tuary. The matrix F of Result 1 may be so chosen, that the elementary 
divisors of [F, (F*)"] — AG are all linear and of the form \ — w, where w is of 
absolute value one. 

Lemma 2. [f Z,, is the matrix of Result 2a, the elementary divisors of Z, — Gn 
are all linear and of the form \} — w, where w is of absolute value one. 

If A = |Z, — AG, |, then® A = | Pa(Pa) + Ain — NEw |. 

Since p is of absolute value 1, p = e“ and (p*)"' = p. Hence on substituting 
for L, its value given by (9) and on writing \ = (—1)™ ‘eieu and f = 1 + u, 
we find that 


u l 0 ved 0 
i ois emo 0 u 1 eee 0 
(— 1)” 7 (— 1)” . (— 5)" ‘f ve uw 4 f 
ae” af. Pa 1 we a 2 + er + u + 1). 
Consequently the roots of A = 0 are all distinet and of absolute value one. 


® J. Williamson, The expansion of determinants of composite order, American Mathe- 
matical Monthly, vol. 40 (1933), p. 67, formula 7. 








724 JOHN WILLIAMSON 


Lemma 3. If Z is the matrix of Result 3a, the elementary divisors of Z — NGn , ¢ 
are all linear and of the form X — w, where w is of absolute value one. 

If p = e” and A is the determinant of this pencil, we deduce, as in Lemmas 
1 and 2, that A is equal to a determinant of order m + 1; in fact 


=i ” a ees 0 0 

0 —» oe. 0 0 
rs (rs... 0 a |’ 
(—1f"e”  (—17"*e” «.. =< Pade 


where 8B = &” — ee”®d/2. On writing eA = ye” and adding 8 times the last 
row to the last but one, we see that 


= 1] ats 0 0 
0 —p +--+ 0 0 
(19) Aa emre a con hen ees iki |. 
0 O ve =p 14a + be! | 
(—1)” (-1)"" --- —1 l—4u 
On replacing the last row of the determinant on the right of (18) by 
(—1)"row, + (—1)" "row, + --- —rowm + (u° + 1)roWms1, we obtain 
my 1 ose 0 0 
. ‘a 0 --w --- O 0 : 
(1 + wa — 2m+1)i6 ‘ | ak ‘a 
0 Ov =e L- gu + he 
(—1)" 0O cae 0 7 


where y = (1 + u»)(1 — uw) — (1 — 4u + 4x’) = u(hu — § — ws’). Hence 
o(u) = (—1)"yw"™ +1 — du + be, 
(—1)" we? — da + + Be — J +1. 


We now proceed to show that the equation 


II 


(20) o(u) = 0, 
has 2m + 3 distinct roots of absolute value one. If m is odd and » = e™, 
(19) is equivalent to 
(21) f(t) = cos (2m + 3)t — 4 cos (2m + 1)t + § cos (2m — 1)t = 0. 
Let 
gag (k = 0,1,2,---,2m + 3). 


2m + 3 














QUASI-UNITARY MATRICES 725 


If k is odd, cos (2m + 3)t, = —1, while | cos (2m + 1) | < 1 and 
| cos (2m — 1) | < 1. 


Accordingly, when k is odd, f(t,) is negative. Similarly, when k is even, f(t,) 
is positive. Consequently, f(t) has one zero between t and &4; and therefore 
f(t) has at least 2m + 3 distinct zeros between 0 and x. Hence there are at 
least 2m + 3 distinct values of t between 0 and x, such that e“ is a zero of 
¢@(u). There are therefore 2m + 3 distinct roots of ¢(u) = 0 which have abso- 
lute value one, and, since ¢(u) is of degree 2m + 3, all the roots of ¢(u) = 0 
are distinct and of absolute value one. By a slight modification of the above 
argument we arrive at the same result when m is even. Therefore the roots 
of ¢(u) = 0 are all distinct and have absolute value one. Since the final reduc- 
tion to normal form in §4 was orthogonal, it is a consequence of Lemmas 1, 
2 and 3 that, if C is the matrix defined by (17), the elementary divisors of 
C — XI,, are all linear and of the form \ — w, where w is of absolute value one. 
Accordingly we have proved 

TueoreM 6. Jf A is a quasi-unitary matrix (unitary with respect to In), 
then A is similar under a quasi-unitary transformation to a matrix C, where the 
elementary divisors of C — XI» are all simple and of the form \ — w, where w is of 
absolute value one. 

If A is unitary, that is, if 7,, = EZ, Theorem 6 remains true when C is replaced 
by A, and we obtain the theorem that the latent roots of a unitary matrix are 
all of absolute value one. That no such simplification is possible for quasi- 
unitary matrices is shown by the following example. 

If F is an arbitrary non-singular matrix, then 


nw che lew ae @ 
(FE) o/\e o/\F* of] \e os 


and the elementary divisors of 


( 0 < (? ') 
wall 
(F*)" 0 E 0 


are the elementary divisors of F — \E together with those of (F*)"' — XE. 
If O is the orthogonal matrix which reduces 


0 ') 
E 0 
0 F 
O O' =A 
(F*)' O 


is quasi-unitary, and the elementary divisors of A — XJ,, are stiil the elementary 
divisors of F — XE and (F*)' — XB. Hence the elementary divisors of A — 7. 
need not be simple or even of the type (A — w)‘, where w is of absolute value one. 


to Z,, , the matrix 


Tur Jouns Hopkins UNIVERSITY. 








STABLE LAWS OF PROBABILITY AND COMPLETELY MONOTONE 
FUNCTIONS 


By S. BocHNER 


In discussing stability of laws of probability other than the Gaussian, P. 
Lévy’ has proved the following two statements about the Fourier transform 


a 


I. If 0 < p S 1, V,(a@) is non-negative for all real values of a. 

Il. If 1 <p< <«, V,(a@) assumes in — x < p < ~ both positive and negative 
values. 

It is not hard to prove statement II. As for statement I, a simple proof is 
available in case 0 < p < 3, but this proof cannot be extended to cover the case 
h<op<l- 

In the present note we shall give two new proofs for statement I. They are 
not of the easiest type, perhaps, but they do not distinguish between the two 
cases and they lead to more general classes of functions having non-negative 
Fourier transforms. 

We shall consider the class 8 of positive-definite functions 


f(x) -[ e"dV(a), 


which are Fourier transforms of bounded non-negative distributions. They 
have the following properties: 

l. if fi. fe C B, a, 2 0, ag = O, then af; + af, C ¥; 

2. if fi. fe C B, then fife C BP; 

3. if f. C B and lim f, exists uniformly in every finite interval, then 


lim fr Z.° 
First proof. ‘Excluding the trivial case p = 1, we have to prove that 
f(z) = exp {—|2{"J 


belongs to Bfor0 < p <1. Since, for these values of p, 


es 2p 1 
2p a da ; 
£ C, - G > @, 


Received September 20, 1937 
P. Lévy, Calcul des Probatilités, 1925, pp. 252-277. 
265. Polya, Herleitung des Gaussachen Fehlergesetzes aus einer Funktionalgleichung, Math 
Zeitechrift, vol. 1% (1923), p. 109 
?P. Lévy, loe. cit., Chapitre I]; 8S. Bochner, Vorlesungen tiber Fouriersche Integrale, 
1932, pp. 64-77 
7265 





























STABLE LAWS OF PROBABILITY AND COMPLETELY MONOTONE FUNCTIONS 727 


f,(x) is, on every finite z-interval, the uniform limit of functions of the type 


n 


exp| - —", 
~ 2s (‘) 
b 


Hence, by properties 3 and 1, it is sufficient to prove that 


exp 4 — ar = ¢ exp abt 
x} =) + b2 2 Xx} 2 + b2 


belongs to $. But 


c bed e - es 
exp 4 - = - (x + 5). 
: \z + es n=o n! adidas 
Therefore, because of properties 1, 2, 3, our assertion follows from the ele- 

mentary fact that (2° + b°)”' belongs to 8, namely, 


1 2 
2 2-1 iar —bia| 
(7° +b) = = ee da 
2b J-« 
A general class of functions including f,(x), for which our argument remains 
in foree, are 


f(x) = exp {—2°¢(\x))}, 


where 
+ dy(a) = 0. 
Pe 


These functions g(x) are a type of completely monotone functions which have 
been recently investigated by D. V. Widder and R. P. Boas.* 

Second proof. We shall use the following lemma.’ If a function f(y) is 
completely monotone in 0 S y < , and y(y) is a function vanishing at the 
origin whose derivative ¥’(y) is completely monotone in 0 < y < &, then 
S((y)) is again completely monotone in 0 Ss y < x. Therefore, 


Swy)) = e“dy(b), dy(t) = 0. 


J 


An admissible substitution is 
¥v(y) = x’, 0<p<l, 


since 


*x 


, ” p yt 
v'(y) = py ‘= oe” ta. 
rl — p) Jo 


*D. V. Widder, The tterated Stieltjes transform, Proc. Nat. Acad. Se., vol. 23 (1937), 
pp. 242-244. 

5 Completely monotone functions of the Laplace operator for torus and sphere, this Journal, 
vol. 3 (1937), pp. 488-508 








728 S. BOCHNER 


Putting f(y) = e ”, we obtain 
e vl? = / e “ dy(d), 
/0 


or 
el? / e** dy(t). 
0 
Obviously the right side is the uniform limit of finite sums of the type 
> bee via, 
v=l . 


But e~*”’ belongs to % and therefore, by properties 1, 2, 3, the function f,(r) 
does also. 


PRINCETON UNIVERSITY. 








Pa ee ee Ta 











ON SOME GENERALIZATIONS OF A THEOREM OF A. MARKOFF 
By Ernar Hitxe, G. 8zec6, anp J. D. TAMARKIN 


I. Introduction 


1. The theorems of A. Markoff and of S. Bernstein concerning the deriva- 
tive of a rational or of a trigonometric polynomial state that if || f || denotes 
the maximum of the absolute value of a rational polynomial f(r) over a finite 
interval (a, b), or of a trigonometric polynomial over its interval of periodicity, 
then for the derivative f’(z) we have 


(1.1) Sf’ ll s An’ |lf || or {if? || s Anlifil, 


respectively, where n is the degree of f(x) and A is a constant which does not 
depend on n or on f, but only on (6 — a). In fact A = 1 in the case of ra- 
tional polynomials considered on (—1, +1) and also in the case of trigonometric 
polynomials of period 27. These results can be stated in “abstract” form if we 
consider f(z) as an element of the space C of continuous functions and interpret 
'|f || as the “norm” of this element. A natural question arises then whether 
estimates similar to (1.1) hold if f(x) is considered as an element of other fune- 
tion spaces with different definition of the norm. The purpose of the present 
note is to answer this question for rational polynomials in the case of the space 
L,, p 2 1, where the norm is defined by 


. 1 i. , \/p 
fi aad = {54 [ f(x) ash ° 


2. The corresponding problem for trigonometric polynomials was solved in a 
much more general case by Zygmund' by using an important interpolation 
formula of M. Riesz.” According to this formula we have, for an arbitrary 
trigonometric polynomial of degree n and of period 27, 


2n 


(1.2) f'(x)| < Do” \fe + 0”) |, 


) ) ° . . 
where ps", 6," are certain numbers which do not depend on f(x) and which 
satisfy 


2n 


(1.3) eo” >0, Do” =n, 


vel 
(1.4) eee ef <¢..: el Se. 


Received October 2, 1937 

1A remark on conjugate functions, Proceedings of the London Math. Soe., (2), vol. 34 
(1932), pp. 392-400, esp. pp. 394-396. 

2 Eine trigonometrische Interpolationsformel und einige Ungleichungen fiir Polynome, 
Jahresbericht der Deutschen Math. Verein., vol. 22 (1914), pp. 354-368, esp. p. 356, (9), (10). 


729 








730 EINAR HILLE, G. SZEGO, AND J. D. TAMARKIN 


Now let ¢(u) be any non-negative, convex, and non-decreasing function of 
u 2 0. If we use fundamental properties of convex functions, it is readily 
seen from (1.2) that 
2n 


[ (| f'(x) |/n) dx ; : py” o( f(x + 6”) dx 
0 0 v=1 


[ " ol) fla) \) dx. 


The result for L, can be derived from this immediately by choosing ¢(u) = wu’, 
whence 

. , 
(1.5) f p Sn S \lp- 
This is a complete analogue of 8. Bernstein’s classical theorem which can be 
derived from (1.5) by allowing p— ~. 


IA 


3. The corresponding problem for rational polynomials seems to be more 
complicated because no analogue of (1.2) is known in this case. We show, 
however, that A. Markoff’s theorem still can be extended to the space L, by 
proving the following 

Tueorem. Let p = 1 and let f(x) be an arbitrary rational polynomial of 


degree n. Then 


f re +1 l/p +1 l/p . 
(1.6) ‘| f'(z) "de} i [ | f(z) rae} < An, 
| J-1 =" 


where A is a constant which depends only on p, but not on f(x) or on n. 

For each n there exist polynomials f(x) of degree n such that the left member of 
(1.6) is = Bn’, where B is a constant of the same nature as A. 

A. Markoff’s theorem (with a less precise value of the constant A) is obtained 
from (1.6) by allowing p—> «. Another important case, namely, p = 2, was 
treated some time ago by E. Schmidt.” The treatment of this special case 
given in Part V below is, as we understand, essentially identical with Schmidt’s 
line of argument. In this special case the constant A above can be charac- 
terized in a more precise fashion than in the general case. 


4. Neither customary methods used for the proof of A. Markoff’s original 
theorem’ (p = «) nor E. Schmidt’s elegant method (p = 2) seems to be appli- 
cable in the general ease. In Parts I] and IIL we give two variants of our 
proof of (1.6) (the first of them valid only for p > 1). Both may present 
interest even in the limiting case p = «. In Part IV we show that the “order” 


2 Die asymptotische Bestimmung des Marimums des Integrals uber das Quadrat der Ablei- 
tung eines normierten Polynoms, dessen Grad ins Unendliche wichst, Sitzungsberichte der 
Preussischen Akademie, 1932, p. 287. This note contains a statement of the result without 
proof 

‘See, eg., G. Pélya and G. Szego, Aufgaben und Lehrsdtze aus der Analysis, Berlin, 


1925, vol. 2, pp. 91, 287, problem 23 














. of 
lily 


be 





SOME GENERALIZATIONS OF THEOREM OF A. MARKOFF 731 


n° of the bound in (1.6) is “the best possible”. The determination of the 
precise value of the constant A seems to be a more difficult problem. 


II. General case. First method 


1. Our first proof of the inequality (1.6) is based on the following two lemmas, 
one of which is due to Gabriel and the other is an extension of a classical theorem 
of S. Bernstein-M. Riesz.° 

Lemma 2.1. Jf T ts any convex closed curve in a complex z-plane and C any 
conver curve inside T, and if F(z) is regular inside and on I, then 


(2.1) [iro ‘idz| < ¢ [ire *| dz}. 
c r 


Here \ is any number = 0 and G an absolute constant. 

Let C be any simple closed rectifiable Jordan curve in a complex z-plane and 
let 

z=V¥(w) =cewtotaw +---+ew" +---,ce>0, 

be the function which maps conformally the simply connected infinite domain 
exterior to C into the exterior of the unit circle | w| = 1 in the w-plane. Let 
Cy be the image in the z-plane of the circle |w| = R. With this notation 
we have 

LemMA 2.2. If f(z) is any polynomial of degree n, then 


(2.2) / f(z) |"|dz! s a | f(z) |? | dz), p> 0. 
CR Cc 


Consider the function 
o(w) = w "f(¥(w))(W’(w))' ”. 
It is clear that this function is regular for | w | > 1 (ineluding the point at 
infinity) and therefore the integral 


le = | o(Re"’) |? do Tt | oe") "de 


as R | 1.7 Here o(e") = lim @(Re"). Now 


R—1 


| I fle) "|dz| = Rm ~ | p(Re!”) "do = RT p, 
Cr 0 


/ f(z) |"\dz| = [ oe") |” 4, 


and (2.2) follows at once. 


5 Cf. M. Riesz, Ober einen Satz des Herrn Serge Bernstein, Acta Mathematica, vol. 40 
(1916), pp. 337-347. 


*R. M. Gabriel, Concerning integrals of moduli of regular functions along convex curves, 
Proceedings of the London Math. Soc., (2), vol. 39 (1935), pp. 216-231, esp. p. 229 

7 Concerning the monotony property of /x ef. Pélya-Szegé, loc. cit., vol. 1, p. M4 and 
p. 380, problem 310.) The fact that the limit function (¢ *) exists and belongs to L, over 
(0, 27) is well known and is trivial in the case which will occur in the subsequent discussion 








732 EINAR HILLE, G. SZEGO, AND J. D. TAMARKIN 


Remark. It is clear that the preceding argument can be applied in the case 
where C is a rectifiable Jordan are. The only change will consist in replacing 


/ by 2 |. Thus we have 
Cc oy 


(2.3) [ f(z) |” |\dz| < anv | f(z) |? | dz |. 


2. We now are prepared to give a proof of (1.6) in the case p > 1; this case 
will be assumed to hold throughout the remainder of this part. Let f(z) be ar 
arbitrary polynomial of degree n. By Cauchy’s formula we have 


. 1 S() ; 
(2.4) f(a) = = [ EC cok 


here —1 < x S +1 and the contour of integration C, which may depend on z 
will be specified later.* Hélder’s inequality yields 


5 f'(z) |?dx S (2r) r [ae | f fle - dz|_ 


" pip’ 
ate f af. ” ride, ttle: ; 
where 1/p + 1/p’ = 1. 


Let R > 1 be arbitrary and let Ey denote the ellipse with foci at +1 and 
semi-axes 


(2.6) a=}(R+R”), b = }(R — R™). 


(2.5) 


An elementary discussion’ gives the following expression for the shortest dis- 
tance D = D(z, R) of z from Ex: 
1 


(ol — 2°)’, if |x| sa, 


(2.7) D(z, R) = ¢ 
a-—|z|, ifa'sle|s1. 

Now choose for C, the circle with the center z and radius D; this circle is in- 
ternally tangent to E,x. Then the last factor in the right member of (2.5) 
does not exceed (2nD'*”’)””” = (2r)””"D”'. As for the first expression in 
braces in (2.5), a successive application of Lemma 2.1 and of the remark fol- 
lowing Lemma 2.2 yields 


+ 
f(z) |"\dz' < “| f(z) |"!dz| s 2aner | f(a) |? dex. 
Ws ER 1 


* An analogous argument, in the essentially simpler case p = ~, was used by Montel, 
Sur les polynomes d’approzimation, Bulletin de la Société Mathématique de France, vol. 
46 (1918), pp. 151-192, esp. pp. 160-161. 

’ For bu? + a%e? = ath? we have {(u — 2)? + 04! = f(a u ax)? + b%(1 — x2)}!. 








ase 
ing 


ase 


1z 


iS- 





SOME GENERALIZATIONS OF THEOREM OF A. MARKOFF 733 
On substituting these results in (2.5) we have 


+1 tl +1 
(2.8) / f(x) "dr Sr cr | | f(x) rar. | | D(x, R)}-? "de. 
il a" =" 


3. To obtain the most favorable estimate for the right-hand member of 
(2.8) let R | 1, so that 


a—12}(R - 1)’, b~yR- 1. 
Then 


" l/a 
il [ D(x, R)}-" "da = avr [ (Q— 2)" dz 


(2.9) ; 
+ 2 | (a — x)” ‘dz, 
l/a 


where 


 ’ (= r’) “MPtD > ~2 Mp+b [ (1 oii ryt dx 
0 0 


ae 1/a) a (R— _ 


1 
~2 Kota ( . _a 
= (p—1) ~ pl 


’ 


1 
/ (a—x)”'dxr =p ‘(a—1)"}1—(1+4 1/a)”} 
l/a 


~2’p (1 —2°")(R - 1), 
so that finally 
+1 
I { D(a, R)}-? "dx = O1(R — 1)°*"}. 


1 
On putting R = 1 + 1/n we obtain an upper bound for the right member in 
(2.9), which is O}(R — 1)°°?} = O(n’”). On substituting into (2.8) we obtain 
inequality (1.6) in the case p > 1. 
It should be observed that in the case p = 1 the same method yields an 
estimate O(n*log n) for (2.9). This leads to a result much less precise than 
that obtained in Part III by the second method. 


III. General case. Second method 
1. Our second method in turn is based on two lemmas which may present an 
interest in themselves and on the corresponding result for trigonometric poly- 
nomials, which was stated in Part I. 
LemMA 3.1. Jf p 2 1 and f(x) is an arbitrary polynomial (# 0) of degree n, 
then 


+1 +1 
(4.1) f(a) |? A — a) bdr < 2(mp + 1)" (np) ad | | f(x) |? de. 
hn =" 











734 EINAR HILLE, G. SZEGO, AND J. D. TAMARKIN 


Lemma 3.2. Under the assumptions of Lemma 3.1 we have’ 


1 
f(z) Pdx 
1 
(4.2) 
2 ine np+ np—-1 - ) 2\}(p—D 
< pag (np + p)"”’ "(np + 1)" f(z)? -— 2)? dz. 
mn =" 


In what follows we shall use the elementary transformation z = }(w + w ‘) 
which for w = e” reduces to x = cos 6. We shall also use integrals extended 
over certain circles |w!| = p. We therefore shall write w = pe” and integrate 
with respect to @ over 0 S 6 S 2x. Now, to prove Lemma 3.1, let 0 <r <1. 
Then" 

| w"fia(w + wd} |? = = . d6< | wf A(w + w ')} |? dé 


wl=r w|=1 


(C22 r+1 
-1| f (cos 6) "|sin @\d0 = | \f(x) |\?dx =I. 
0 —1 


1 —r for|w | = r, we have 


IV 


Since | w™ — 


| w'ft3(w + w')} |da< aly, 


u 


whence 


1 : =F 
| fie)" — ay de = if f (cos 6) | d0 
1 ( 


w"fii(w + w ')} |’ da< 3 | w'fik(w + w ')} |" de 


Il 
Acai 


7 i=l wine-? 


; i 2; 2np 
=i, | lw'ftk(w + w ')} |'da< i shh. 
— 7 


jwl=r 


Lemma 3.1 follows immediately if we put 1 — r° = (np + 1) '. 


2 \) 
1° The factor ( ) should be replaced by 1 when p = 1. 
p 


To derive this inequality it is perhaps simplest to observe that if F.(z), ---, Fi(z) 
are any set of regular analytic functions, then 


F(z) = | Fy(z) |"! --+ | Fa(z) 1", pi 2o0,---,m 2 9, 


is subharmonic, and to use well-known properties of subharmonic functions. Cf. Radé, 
Subharmonic functions, Ergebnisse der Mathematik, vol. 5, Berlin, 1987: see p 8. 2.4 











_ 





SOME GENERALIZATIONS OF THEOREM OF A. MARKOFF 735 


A similar argument can be used in proving Lemma 3.2. Indeed, we have 


+1 2x 
/ f(x) |?dx = 3 [ f (cos @) |” | sin 6! dé 
-1 0 
= ney. 1), |p w — 1 ney —1l)) |p w —1 
= lw'f{e(w+w )} | j do < w'fi}(wtw)} —— dé 
0 


jw|=r 1 


= pet jw"f{R(w + we ) he 1 =~ de 


jwl|=r 


<3" -— r)'” / |w"f{a(w + w')} |?) 1 — w |? de 


jwl|=r 


<r ary ff w'sthw $ wd} (1 = w" | do 
Jw |=1 


‘nie grip -2np a6 | _ r’)'?Is, 


where 
+1 , 
ip 2 »—1) 
I, = [ f(a) |? — 2)” az. 
a 
Lemma 3.2 follows immediately if we write here 


1— rr = (p — 1)(nmp + p)”. 


2. Inequality (1.6) now is readily derived by combining Lemmas 3.1 and 
3.2 with Zygmund’s result (1.5). Indeed, if we put again 


t= [isc ae, 
we have by Lemma 3.1 
[ |f (cos @) |!” d@ < 4(np + 1)""* (np) "Ih, 
whence, by (1.5), 
[ f'(cos 0) sin 0|" dd < 4(np + 1)""* (np) “?n®ly. 
On replacing n by (n — 1) in Lemma 3.2 we now have 
[ f'(x)|’dzx < (<* ) (np)""(np —p+ 1) ""*?!.2(np + 1"? (np) n"I; 


= 2"(p — 1)" "(np + 1)" "(np — p+ IP hh. 








736 EINAR HILLE, G. SZEGO, AND J. D. TAMARKIN 
The coefficient of J; is ~ (2ep)’(p 1)'’n” as n — & for p fixed. This 


proves (1.6). 
On the other hand, if we allow p — « for n fixed and observe that 


in Sp — 1) °*""(np + 1)""""’(np — p+ 1"? = 2a — < 2en’, 
we obtain in the classical problem of A. Markoff the estimate 
If’ || < en? || f ||. 
IV. Exactness of the order n° 
1. Let f(z) = P(x) be Jacobi’s polynomial in the “ultraspherical” 


: a 
case. Then, asn — =, 


+1 x 
(4.1) / f(x) ° dx = n*” er [ lu “J.(u) |"udu, 
-1 0 


provided that (a + 4)p > 2. Here J.(u) is Bessel’s function of order a. 
The special case p = 1 of this formula can be found in a paper by Szegé.” 

The following line of argument is slightly simpler than the one used there. 
We use the formula of ‘“Mehler’s type” 


(4.2) lim n "Pis9(cos “) = (u/2) “J.(u), 
t 


ns 
. ° ° . . 14 
which holds uniformly over every finite interval, and the estimate 


(4.3) P\*:” (cos 0=) = 0 ** O(n), n'<0<7/2. 


2. Let now w be a fixed positive number. Then 


+1 x/2 w/in w/2 
/ Po (z) |" dz = 2 | P\2" (cos 0) |’sin 6dé = 2 | + 2 | ’ 
1 0 0 win 


The first term of the last sum according to (4.2) equals 


= ¢* uli . test | , 
PS” ( cos am —du Xa’ 2" ju “J(u |"udu. 
nm fy n nt 0 


The second term according to (4.3) is 
ow [ 6 at+4 >»? 9 0 oe O)n®” 2? (atp 


Sinee w can be taken arbitrarily large, this proves (4.1). 
2('f, the notation in Pélya-Szegé, loc. cit., vol. 2, pp. 93, 94, 292, 293, problem 98. 
8 Asympltotische Entwicklungen der Jacobischen Polynome, Schriften der Kénigsberger 


Gelehrten Gesellschaft, 1933, pp. 35-112, esp. p. 88 
“4 ('f. Szegé, loe. cit., pp. 74, 77 








his 


er 





SOME GENERALIZATIONS OF THEOREM OF A. MARKOFF 737 


3. Now we observe that” 
d a.a a a+l1) 
f'(z) = az Pn (2) = (a + (nn + 1))PA"'**" (a), 
so that, by (4.1), 


+1 ~ 
(4.4) i | f"(x) |?da & (n/2)?n@tP?-*gerner / jun" Jaus(u) |? udu, 
= 0 


and the ratio of the integrals in the left members of (4.4) and (4.1) remains 
> Bn’, as stated in the theorem of Part I, §3. 
\V. E. Schmidt’s case p = 2 


1. Without loss of generality we can confine ourselves to real polynomials 
f(x). Indeed, if we write f(z) = g(x) + th(x), where g(x) and h(x) have real 


coefficients, we have 


[ (fa) ade = [ " ta@)i*de + [ (h(a) ae, 


[ | f(x) ?dx = i {g’(x)}°dx + [ {h’(x)}* dz. 


-1 


(5.1) 


Let now f(z) = ao + a2 + ax + --- +a,2",n 22. Then the determina- 
+1 +1 
tion of the maximum of [ {f’(x)}* dx under the condition [ (f(z) }? dx = 1 
=| —1 


is a characteristic value problem leading to the system of equations 


+1 +1 
(5.2) FS [ / {f'(x) dx — af (7(2)}*dz | = 0 (v = 0,1,2,---,n). 
v —1 —1 
This system is equivalent to the condition that 
+1 +1 
(5.3) il f'(x)q'(x) dx — | f(x)q(z) dx = 0 
= —1 


be satisfied by an arbitrary polynomial q(x) of degree n. Integration by parts 
gives 


(5.4) [, f(a) + rAS(@ adr = f(D) — f(—Da(-D. 


2. Introducing Legendre polynomials {P,(x)} in the usual notation, we put 
in (5.3) 


(5.5) gx) = DD & + YPAOPAy), 


v=0 


8 Szegé, loc. cit., p. 38, (5). 








738 EINAR HILLE, G. SZEGO, AND J. D. TAMARKIN 


where y is a parameter. By using familiar properties of Legendre polynomials, 
we obtain 


gg! O TNO =FOL 6+ PPO -F-D EO + DPW 
». y=0 


v=0 
= §f(D{PAY) + Pra) + -D'S(-DIPAY) — PruQ)}- 
Now it is helpful to write 


(5.7) fly) = aoPis(y) + Phy) + oP ay) + asPX'(y) + ++. 


This is obviously possible if the real constants ap , a: , a2, --- , a, are suitably 
chosen. On substituting (5.7) into (5.6) and comparing the coefficients we get 


hao = 3f'(1) — H—D"f'(— 1D) = ao Pra) + P21) + ---, 
(5.8) ra = 3f’(1) + 8(—1)"f'(— 1) = a PX) + os PO) + ---, 
ha, + a,2 = 0 (vy = 2, 3, ---,n). 


This system is equivalent to (5.2) and readily furnishes the characteristic values 
and functions. Indeed, since \ is real and positive, we have 


(5.9) ae, = (—A) "an, G41 = (—A) ay, (v 21), 
and 
ao = a | Po 4a(1) ~ & Pia) + PL?) —---f, 
(5.10) ee ihe a. 
hax = am {P,(1) —A P, (1) + AP, () — --°}. 
Thus the set of characteristic values is obtained from the two equations 
Pyi(l) — XP) + X?P PLD — «++ = 0, 
(5.11) ade Mass 
P,(1) — XPM) + XP POM) — ++ = 0. 


3. Observe that we have for all »v and n 


a (n + 2r)! 1 3 


(5. "Pe () = 
5.12) n'P, (1) =n (n — 2Qv)! 22(2v)! ~~ 2(2v)!’ 


while for »v fixed 


, . | 
(5.13 limn PY’) = , 
) ne 2**(2v)! 
Let Aw, Ant, -** » Ann denote the characteristic values in decreasing order. If 


we observe (5.12), (5.13), and apply Hurwitz’ theorem to each of the equations 
(5.11) we readily see that the limit 


(5.14) lim n “Ane = Az (k = 0, 1,2, ---) 


n~-*a 








SOME GENERALIZATIONS OF THEOREM OF A. MARKOFF 739 


exists for each fixed k and that furthermore the set {A,} represents the set of all 
the roots of 


> on 1 a 

(5.15) rar 2*(2v)! = cos (2X*) = 0, 

so that 

(5.16) Me = 9 (2k + 1)° (k = 0, 1, 2, ---)- 


+1 
If now M*, denotes the maximum of the ratio of the integrals [ {f’(x)}? dx 
=" 
+1 
and i {f(x)}" dx, we have the final result”® 
= 


(5.17) lim n?M, = lim nko = N= 

Remark. It can be shown that the largest characteristic value A,» necessarily 
is a root of the first of the equations (5.11). Indeed let f,4:(A) and f,(A) denote 
the left-hand members of the two equations (5.11), respectively. A simple 
calculation shows that 


l n+1) n— 
(5.18) KM fara) + FO] = (mn + DNOPFA). 
If fasa(Ano) # 0, we necessarily have frsi(Ano) > 0, fnsa(Ano) > 0, fn (no) > 0, 
so that the left-hand member of (5.18) is positive for 4 = Ano, whereas 


fn(An0) = 0. This is a contradiction. 
In fact it is easy to show that A,o satisfies only the first of the two equations 


(5.11). 
YALE UNIVERSITY, WASHINGTON UNIVERSITY, AND Brown UNIVERSITY. 


16 There is a slight discrepancy between this result and that of E. Schmidt according 
to which this limit would be 1 instead of 7!. 








(n — 1)-DIMENSIONAL CHARACTERISTIC STRIPS OF A FIRST 
ORDER EQUATION AND CAUCHY’S PROBLEM 


By E. W. Titt 


Consider the first order partial differential equation 
(a) F(z“ |z| pa) = 0 (Pa = 02/dx") 


in one unknown z and n independent variables x“ (n = 3). The purpose of 
the present paper is to generalize to the case of more than two independent 
variables the usual geometrical discussion of Cauchy’s problem showing the 
manifolds for which the problem is indeterminate.’ In doing this we introduce 
the concept of an (nm — 1)-dimensional characteristic strip and study its relation 
to the one-dimensional characteristic strips. 


1. We first recall the geometrical approach to the one-dimensional character- 
istic strip. If we assume that the space S"*’ with codrdinates z', --- , 2", 
z is Euclidean with rectangular Cartesian coérdinates, the problem of inte- 
grating the equation (a) is that of determining a hypersurface’ 


(1.1) z= 2(z',---,2") 

in S"** such that the direction ratios p,: --- :p,:—1 of the normal to (1.1) 
satisfy the condition (a) at each point P of (1.1). The geometrical configura- 
tion consisting of a point P(z', .-- , x", z) and a hyperplane passing through P, 
namely, 

(1.2) Z—2z= p.(X* — 2"), 


in called an element. In general, the integral elements at P, i.e., those which 
are possible tangent hyperplanes to integral hypersurfaces at P, envelope a 
hypercone 7 with vertex at P. It then follows easily that an integral element 
(1.2) is tangent to the hypercone 7 along the generator given by® 


; xX'-z X"—2" Z-z 
(1.3) ——$ =++-= - = —, 
Fy, F,, Paty, 

Received June 16, 1937; developed in part while the author was a National Research 


Fellow and mentioned in a paper presented to the American Mathematical Society, Feb- 
ruary 23, 1935. See Abstract 41-3-103, Bull. Amer. Math. Soc., vol. 41 (1935), p. 182. 

! See, for example, EF. Goursat, Cours d’ Analyse Mathématique, Tome II, 1924, Chapter 
22. We do not discuss the regularity requirements on F or the initial manifold. For a 
treatment of this question in the case of two independent variables the reader is referred 
to G. A. Bliss, Princeton Colloquium Lectures, Amer. Math. Soc., 1913, p. 98. 

2 In S**! we shall call an n-dimensional spread, a hypersurface, and an (n — 1)-dimen- 
sional spread, an edge. For linear spreads we shall use the terminology hyperplane and 
plane edge. 

7('f. Goursat, loc. cit., p 616 

740 





Cr ==> == 








(n — 1)-DIMENSIONAL CHARACTERISTIC STRIPS 741 


The characteristic curves are the curves traced on an integral hypersurface 
which are tangent at each point P to the generator (1.3) determined by the 
integral element at P. As is well known, these curves are determined by the 
system of ordinary equations 


dz’ dx" dz dp, dp, 


(1.4) SS «cs = = = = eee = = 
—Fin —F.p, 


=> = = - = dv" 
Fy, Fo, Pak y, —Fra—F ip 


without knowledge of an integral hypersurface. The geometrical configuration 
consisting of a curve together with a strip of elements along it satisfying the 
system (1.4) will be called a one-dimensional characteristic strip. 


2. Now let us turn to the consideration of Cauchy’s problem for equation (a), 
i.e., the problem of passing an integral hypersurface (1.1) through an arbitrary 
edge 


(2.1) x = e(o',--- 0"); z= g(v',--- 0"). 


Generalizing the idea of a characteristic curve, by a characteristic edge we shall 
mean an edge which lies on an integral hypersurface and which has the property 
that its tangent plane edge at each point P contains the generator (1.3) deter- 
mined by the integral element at P. We shall now proceed to find for our 
characteristic edge a system of partial differential equations, which is inde- 
pendent of the integral hypersurface. 

If the plane edge, tangent to the characteristic edge (2.1) at the point P, 
is to contain the generator (1.3) associated with P, then the rank of the matrix 


¢ I Pi F,. PaF), 
1 n = 
(2.2) a a oa 
1 n 
Tra Tn-1 «n—l 


where the subscripts on the x’s and z denote partial derivatives with respect to 
the v’s, must be less than n.* Since the characteristic edge (2.1) lies on an 
integral hypersurface, the integral element at P must contain the tangent plane 
edge at P, ie., 

(2.3) & = Del:. 

Multiplying each of the first » columns of (2.2) by pa and subtracting from the 
last, we have on account of (2.3) that the condition on (2.2) is completely 
equivalent to 

(2.4) F,. D. = 0, 


‘In what follows, when we use the term edge we always imply that the matrix obtained 
by striking out the first row and last column of (2.2) is of maximum rank. Throughout 
Greek letters a, 8, y, --: have the range 1, --- , ; Latin letters a, b, ¢, --- , the range 
I, +--+, — 1; and Latin letters 7, j, & the range 2, --- , 1. 








742 E. W. TITT 


where the D, are the cofactors of F,, in the determinant of the first n columns 
of (2.2). Differentiating the equation (a) with respect to z* we obtain 


az 
5 Fp, Da F.pa+ Fu =0 ap = —— }. 
(2.5) p,Pas + F:pa + Fa (rx 5s) 
If we adopt the notation A% for the cofactor of the element xf in the deter- 
minant of the first n columns of (2.2), then we have immediately the relations 


(2.6) F,,Ds + 2243 = 0. 


If a = 8, then (2.6) is equivalent to (2.4); and the other equations (2.6) are 
obvious. We then multiply the equations, 


OPa _ ago 
av I aBtay 
through by A‘ , sum on a, and make use of (2.5) and (2.6), getting 


(2.7) Ag P= _ Ds F.pe + Fea) = 0. 

dv* 
If in particular D, # 0, the equations (2.7) with 8 = 1 imply the remainder. 
For it follows from (2.6) that any vector Aj, --- , A}, Ds (8 ¥ 1) is either 
a zero vector or proportional to Aj, ---,A; , D,. Let us associate with 
each point of the characteristic edge (2.1) the integral element containing the 
tangent plane edge. Then any set of (2n + 1) functions, 


(2.8) z* = §*(v), z = ¢(v), Pa = Ta(v), 
with D, # 0, which satisfy the system’® 
(a) F,, Da = 9, (b) ze — Pete = 0, 
(2.9) ; 
(c) At 7P* — DAF .pe + Fo) = 0, 
ou* 
will be called an (n — 1)-dimenstonal characteristic strip. 
We have immediately that over an (n — 1)-dimensional characteristic strip 


the function F = constant. For if we multiply the equation (2.9¢c) through by 
zi, sum on @ and make use of the equation (2.9b) we get 


a OPa 
< 
av’ 


Iq A = DAF. pa + F a) tb —_ 0. 
This becomes 

ar 
ow 


p= =@ 


when use is made of (2.6) and (2.9b). 


* In connection with (2.9a) let us notice that an edge lying on an integral hypersurface 
of a second order equation which satisfies the condition Py,, Da Dg = 0 is the well-known 
characteristic surface. Let us also notice that in case n = 2 the system (2.9) reduces 


to (1.4). 
































re 


er. 
er 
ith 
he 


rip 
by 


face 
own 


uces 


itn a a 








(n — 1)-DIMENSIONAL CHARACTERISTIC STRIPS 743 


3. Now let us show that an (n — 1)-dimensional characteristic strip is com- 
posed of an (n — 2)-parameter family of one-dimensional characteristic strips. 
In order to show that each one-dimensional characteristic strip determined by an 
element of our (n — 1)-dimensional characteristic strip lies entirely on our 
(n — 1)-dimensional strip, we consider the system of ordinary equations 

dv" At 
(3.1) oe 


where the right members are functions of v* alone in virtue of (2.8). By the 
existence theorem for ordinary equations the system (3.1) has a unique solution, 


(3.2) vo = V*(o' | v9), 


which reduces to v} for i' = 0. The set of (2n + 1) functions of 5' obtained 

by substituting (3.2) into (2.8) constitutes a one-dimensional characteristic 

strip. For multiplying the equations (3.1) by x7 , summing on a, and making 

use of (2.9a) or (2.6) we find that the first n equations (1.4) are satisfied. The 

next equation (1.4) is a consequence of the first n and the equation (2.9b). 

The remaining equations (1.4) follow from (2.9¢) and the equations (3.1). 
Now let us consider any set of (n — 1) functions, 


(3.3) 6 = GP, --- , 8’), 
which for the set of values 3° = 0 make the determinant 


Aj[v9(0)] --- At “[vs(0)] 


av} (0) _ avo (0) 

(3.4) av" av" ~ 0. 
av}(0) ave '(0) 
op"! op"! 


On account of (3.4) the result of replacing the arbitrary constants vp in (3.2) 
by (3.3), namely, v* = v°("), can be regarded as a change of parameter in the 
neighborhood of the set of values «*° = 0. In the transformed strip (2.8) the 
one-dimensional strips along which é' varies are characteristic. Let the result 
of setting s' = 0 in the transformed strip (2.8) be denoted by 


(3.5) 250, --- , 8"); z(8", --- , 8"); pal®, ---,8"”). 


In the neighborhood of the set of values 6° = 0 the set of functions (3.5) satisfy 
the conditions 


(a) Flag | 20| paol = eonst., 


(3.6) azo are 








744 


and make the rank of the matrix 


ee ¢ Poe 

ax) mle axy 

(3.7) av" av" 
ax) axe 
on"! ap"! 


equal to (n — 1). 

Conversely, let us show that through any (n — 2)-dimensional strip (3.5) 
satisfying the conditions (3.6) and (3.7) there passes a unique (n — 1)-dimensional 
characteristic strip. Suppose that we have an integral of the system (1.4) 
depending on an auxiliary variable 3’ and (2n + 1) arbitrary constants, 2°, 
2) , Pao Which reduces to xi , 2), Pao for 6 = 0. Let the result of substituting 
the (2n + 1) functions (3.5) into this integral be denoted by S; we must show 
that S is an (nm — 1)-dimensional characteristic strip. S satisfies the equation 
(2.92) as a direct consequence of the first n equations (1.4). Also on account 
of the same equations (1.4) the quantities A{ which do not vanish identically 
will be AL. In particular, let us consider Al # 0. Since A} = —D,, the 
equations (2.9c) are identical with the last n equations (1.4). When a = 1, 
the equation (2.9b) follows from the first (n + 1) equations (1.4). In order 
to show that the remaining equations (2.9b) are satisfied we put 

Vi = 2% — Pal; . 

Differentiate this relation with respect to i’, make use of (2.9b) for a = 1 and 
equations (1.4), and obtain 
(3.8) or oF, 


ao re agi + PePati + Peedi. 


From (3.6a) we have dF'/a0' = 0, and equations (3.8) become 


ak 3 
ao" 
Since V; = 0 for o' = 0 by (3.6b), we have V,; = 0. 

The proof that the strip S is unique consists in showing that any other 
(n — 1)-dimensional characteristic strip S* containing the (nm — 2)-dimensional 
strip (3.5) must contain all of the (n — 2)-parameter family of one-dimensional 
characteristic strips which constitute S. By assumption, the initial element 
of any one of the one-dimensional characteristic strips in S lies also on S*. 
By the argument in the first part of §3, the strip S* consists of an (n — 2)- 
parameter family of one-dimensional characteristic strips. Since a one-dimen- 
sional characteristic strip is determined by its initial element, the argument is 
complete. 

The construction of our (n — 1)-dimensional characteristic strip can be given 















































Pn 





(n — 1)-DIMENSIONAL CHARACTERISTIC STRIPS 745 


a geometrical interpretation. Let an initial (n — 2)-dimensional manifold m 
be given by the first (n + 1) equations (3.5). Suppose that there exists an 
element E which contains the (n — 2)-spread | tangent to m at P (0' = 0) and 
which is tangent to the hypercone 7 with vertex at P. Suppose also that / 
does not contain the generator through P determined by EZ. Then the equa- 
tions (3.6) determine a one-parameter family of strips (3.5) containing the 
manifold m. Then as E describes one of these strips containing m, the (mn — 2)- 
parameter family of one-dimensional characteristic strips thus determined con- 
stitutes the (n — 1)-dimensional characteristic strip. 


4. Let us recall that Cauchy’s method of integrating the equation (a) is to 
replace the arbitrary constants in a solution of (1.4) by (2n + 1) functions of 
(n — 1) parameters, namely, 


ry = E(v',---, 0"); zo = FO", --- , 0"); 
(4.1) _— 
Pad = Fav, cede ho bs 
satisfying the conditions 


F [x¢ | z0| pao] = 0, 


(4.2) 20 ed axe (o =2,..-- n) 
au? 8? aye? aes 


We now make a few remarks to show how the above theory can be applied 
in a discussion of Cauchy’s problem. Let the initial edge M be given by the 
first (n + 1) equations (4.1) and let us suppose that there exists a set of values 
Pao Which satisfy the system (4.2) at some point P(vj) of M. First consider the 
case in which 


(4.3) F,,Da #0 


for the set of values pao, E*(vs), E(vs). Henee we can solve equations (4.2) 
for the quantities pao as functions of »* , --. ,v". On account of (4.3) the strip 
thus obtained will give us a solution of Cauchy’s problem in the form (1.1). 
xeometrically we have assumed that there exists an element £, which contains 
the plane edge L tangent to M at P, and which is tangent to the hypereone 7 
with vertex at P. We have supposed also that L does not contain the generator 
through P determined by E. Then the (xn — 1)-parameter family of one- 
dimensional characteristic strips, determined by EF as P describes a neighbor- 
hood of itself on M, comprise our integral hypersurface. This integral hyper- 
surface can also be thought of as generated by the one-parameter family of 
(n — 1)-dimensional characteristic strips determined by any one-parameter 
family of (n — 2)-dimensional spreads lying on the edge M. 

Next let us consider the case where M bears an (n — 1)-dimensional char- 
acteristic strip (4.1) and Cauchy’s problem becomes indeterminate. Select any 
(n — 2)-dimensional spread m lying on M such that at any point P of m the 








746 E. W. TITT 


(n — 2)-spread I tangent to m at P does not contain the generator through P 
determined by E. Through P on m pass a line t, on EZ, which with | determines 
a plane edge that does not contain the generator determined by EZ. Let T 
be a curve through P with ¢ for its tangent line. Select a differentiable (n — 2)- 
parameter family of such curves I’, one through each point of m, which taken 
together comprise an edge It. The plane edge tangent to MP? at any point P 
of m lies in E but does not contain the generator through P determined by E£. 
Therefore the integral hypersurface passing through Y? must contain the 
(n — 1)-dimensional characteristic strip through m, i.e., the strip (4.1). 

Any edge M lying on an integral hypersurface, whose tangent plane edge L 
at each point P contains the generator through P determined by E but which 
does not bear an (n — 1)-dimensional characteristic strip, must contain singu- 
larities of the integral hypersurface. For if the integral hypersurface possessed 
second derivatives at each point of M, we should conclude by the argument 
in §2 that M bears an (n — 1)-dimensional characteristic strip. 


PRINCETON UNIVERSITY. 



























THEOREMS ON FOURIER SERIES AND POWER SERIES 
By H. R. Pirr 
1. Introduction 
1.1. Notation. If p > 0, we write 


T,[e,] = ( i (2 | Cn | ry dz)’, 
M, [FO] = ( [ \F@) ao)” 


If g(z) is a regular analytic function for | z| < 1, we write 


® l/p 
,[g(2)] = lim ( [ lore") a) 


(The limit exists, since the expression in the bracket increases with r.) 
1.2. Suppose that F(6) is periodic and integrable and that g(z) is regular in 
|z| <1. Let 


(1.2.1) F@) ~ > a,e"" (ao = 0), 
(1.2.2) g(z) = > Cy 2” (jz| <1). 


We shall prove that if p, q, a satisfy certain conditions, 


(1.2.3) Sa [lenn™] S K9H,[g(z)(Ql — z)"), 
(1.2.4) S,la,n™] < KM, [F(0)e*), 
where 
(1.2.5) ee cP ee 

p q 


and the constants K(p, q, a) are independent of g(z) and F(@). 
Special cases of these inequalities, due to Hausdorff’ and Hardy and Little- 


Received October 16, 1937. 
! Hausdorff [5], Theorem Il. (Numbers in brackets refer to the references at the end 
of the paper.) This is the case a = 7 = 0 of (1.2.4). 


TAT 








748 H. R. PITT 


wood, are well known. They can be derived from Theorems 1 and 2 of this 


paper by substituting appropriate values of p, q and a. 
2. A theorem for power series 


2.1. Our principal result is as follows. 
THEorEM |. Let 


x 


g(z) = C,2" (jz| <1); 
1 


r= 


»>q2p>Q0, a 20, 


bat é: 40.886 
-. 


Then 
(A) F,[e.n‘] S KHp[g(z)(1 — z)*], 
(B) Z,le.n****"] < KH,[9(z) — =)"), 


whenever the right side is finite. 
We shall denote the inequalities (A) and (B), for particular values of p, q¢ 
and a, by Alp, q, a] and Blip, q, a], and use the symbol > to show relations 


of inclusion between them. For example, we write 
Alp, q, a] > Bir, s, 8], 


if B[r, s, 8] can be deduced from A[p, q, a]. 
2.2. Lemma 1. 


(a) Let 
epesi. «<¢h <i; 
c, =n" Fb 
; v=l y(n — v) 
Then 
S,le.] S K(a, b, q) S,[b,J. 
(b) Let 
> 5, o> §, s<5. 6a5=5, 
8s 8s r 


Os de+eu 5+ 5 1: 
8 r 


Ss bb, 
ya ¥*(n — vj)? 


C, =n 


n 


2 Hardy and Littlewood [1] and [4]. The former covers the cases a = 0 or y = 0, while 


the latter deals with the case a = 1/p’, \ = 1/q of (1.2.3). 








lis 


q 
ns 


ile 








THEOREMS ON FOURIER SERIES AND POWER SERIES 749 


Then 


=,[c,] < K(a, r, s)S?[b,]. 


These results are special cases of a very general inequality of Hardy and 
Littlewood, Theorem 1 of [2]. We obtain (a) (with b/, , ci, instead of b, , en) 
by writing 


P= % r=q% a6, B=b+", y=atb+ 7-1, 


a,=n", b, = bn”, auntie. 
We obtain (b) by writing 
p=q=8, a=B8=a+-, yar-¢ 
a, = b, = ban, Ch = c,n'. 
2.3. LEMMA 2. 
(a) Alp, q, 0] is true if 
2» >q2zp>l, eer s 
Pp 4 
(b) Al p, q,1 — i. ‘] is true if 
P 4 
2 >q2p>l, wary! 
P @q 


These results follow at once from Theorems 9 and 10 of [1]. In fact, the 
latter results are true for general Fourier series, whereas we are concerned here 
only with Fourier power series. 

2.4. Lemma 3. 

A[p, q, a] > Bip, q, a] fq>1; 
B\p, 9, a] > Alp, q, a] ifq <1. 


These are immediate consequences of the following inequalities of Hardy and 
: 3 
Littlewood. 


Tylen) S K(QSalenn™ J ifq>1; 
Salen’ "| < K(q)Ta len] ifg <1. 


2.5. Lemma 4. If ~ 2q 21,% >p>0O0,€ > 0,a 20, then 
Alp, q, a] > Alp, q, a + €]. 
3 Hardy and Littlewood [3], Theorems 3 and 11. (There is a misprint in Theorem 11. 


The exponent (p + q — pq)/q on the left of (4.11) should be negative.) The case g = 1 
is trivial. 











H. R. PITT 


o(z) = Dd n*'2” 
1 


(jz| < 1). 


4 ° ° 
Then’ ¢(z) is regular for _z| < 1, z ¥ 1, and has no zero, except at z = 0, in 


z! < 1. Moreover, | ¢(z)(1 — z)‘|’ is bounded in |z| S$ 1. Hence, if we 
write 
zw(z) = g(z)/o(z) = Dd b,2" (jz| < 0), 
1 
we have 
aia n—l b.. 
e mn pi- ’ 


H,[w(z)2(1 — z)*] S K(e)Dplg(z)(1 — z)*"*. 


We can write 


n—l 


or a b,_.(n — v)* 
alt = % 
o=1 v(m — v)-’ 
and since 
1 1 ] 
LaHet nha age ses, 
q P 4 


it follows from Lemma 1(a) that 


S,[e.n |] < KS,[b.n “I. 
Hence, assuming A[p, q, a], we have 
S,[ce.n”‘] S KS,[b.n *] S KH,[w(z)e(1 — z)*] S KH,[g(z)(1 — z)*"'h. 
This gives A[p, g, a + él. 
2.6. Lemma 5. Let 


(2.6.1) q > 1, s> 1, s2p> 0, a = 0, 
: ~i1s . < 2 
s ~ | $ 

Then 


Alp, 8, a] D A[4p, 4, 2a]. 


By using « theorem of F. Riesz,’ we may suppose that g(z) has no zeros in 
z < |, except at z = 0, and write 


zq(z) [w(z) |, w(z) : F b,2’, 
i 


* See Hardy and Littlewood [4], page 467 
‘FP. Riesz (6 See aleo page 207 of [1] 














THEOREMS ON FOURIER SERIES AND POWER SERIES 751 


so that 


n—l 


C1 = b, b,- . 


v=l1 


We suppose that A[p, s, a] is true. Then 


(2.6.2) S.[b.n “] S KH,[w(z)(1 — z)*], 
where 

p 8s 
Let 

A= 3 + ts 2a — 1 

P @q 

Then 
Cnn” = a - bv“ bn—(n oem v)* 


1 ov #(n — wv) 
It is plain from (2.6.1) that 
—1<5, -1S5-5, 
It follows from Lemma 1(b) that 
Selenin ) S$ KSi[b,n™“] S KG} [w(z)(1 — 2)*), 
by (2.6.2). Hence 
Salenan”] S KHpp[w(2)*(1 — z)**] = KHppl9(2)( — 2)**1, 


which is equivalent to A[}p, q, 2a]. 
2.7. Lemma 6. If © >q > p>O0,a 20, then 


B[p, q, a] > B[}p, 4¢, 2a]. 


As in Lemma 5, we may’* suppose that g(z) # 0 for |z| < 1, z ¥ 0, and write 


2 n—l 
zg(z) = [w(z)]’, w(z) = > b,2’, Cra = Dy by da. 


v= v= 
We suppose that B[p, q, a] is true, so that 
T,[b,.n “"**""] < KH, [o(z)(1 — z)*), 
where 


l l 
w= -+-+a—-l1. 
P @q 


* We use the fact that M.[@ + ¥] S A(r)[M-(e) + W(¥)] for any functions ¢, ¥ and 
any positive r. If r 2 1, this is Minkowski’s inequality. If r < 1, it follows from the 
inequalities a + b S (at + b')"" Ss 2r(a + b) 






















Let 
N= + + 2a — 1. 
Then 
X Jena] neat sD we Sb, | ba 


n=l n= 


oO o 


u > |b, | 2° | bn | x"(n + aati 


n=1 v=1 


(> |b, | 2°y ees 


vel 


lA 


since 


A+1—4/q = Au+1— 2/9) 20 


and (a + b)* = ab for any positive a, b. Hence 


Tielenan '*")] < Tilb.n” "| < KH}; [w(z)(1 — z)*] 
= K§,,[w(z)*(1 — z)**] = K&,,[9(z)2(1 — z)**), 


and this is equivalent to B[4p, 4q, 2a]. 

2.8. Proof of Theorem 1. We suppose first that » > q 2p> 1. In view 
of Lemma 3, it is sufficient to prove Alp, q, a], and this follows at once from 
Lemmas 2 and 4. 

Next, let = >q>12p>0. Because of Lemmas 3 and 4, it is sufficient 
to prove A[p, g, 0]. We suppose first thatg > 1 2 p > 3. We can choose s 
so that 


get etaiat Pe3 ec! 
> = q q s p 
Then 
eH tek: s22p> 1. 
5 q s 


so that 
A[2p, s, 0] > Alp, q, 9), 


by Lemma 5. Moreover, since 
‘ l . 
“>s22p>1, + = 3s, 
s 


it follows from Lemma 2(a) that A[2p, s, 0] is true. Hence Alp, q, 0] is true 
forg > 1 2p > 3}. If we now put s = q in Lemma 5, we can prove that 
Alp, 4, 0] is true for 
I 4 
. gr ge SPF ee 


> p> 0. 





successively, and it follows that the result is true generally for gq > 1 








ue 


at 








THEOREMS ON FOURIER SERIES AND POWER SERIES 753 


Finally, we have to consider the case 1 2 q 2 p> 0. Because of Lemma 3, 
it is sufficient to prove Bp, q, a]. We choose an integer k so that 2 = 2*q > 1. 
Then B[2‘ p, 2‘q, a] is true for a = 0, by what we have already proved, and the 
conclusion follows by repeated application of Lemma 6. This completes the 


proof of Theorem 1. 
2.9. We can express Theorem 1 in a slightly different form. We know 


that if ,[g(z)(1 — z)*] is finite, then g(z) has a boundary function G(6) such 
that the ratio 

H,[g(z)(1 — z)“]: M,[G(6)e*] 
lies between positive bounds K(p, a). Conversely, if Dt,[G(6)6*] is finite, then 
(7(@) is the boundary function of an analytic function g(z), and the same relation 
holds. It follows that we may replace §,[g(z)(1 — z)*] by 92,[@(@)é*] in the 
conclusion of Theorem 1. 


3. Theorems for Fourier series 


3.1. We shall now show that the inequality (1.2.4) can be deduced from 
Theorem 1 when p, qg and a satisfy certain extra conditions. 
THEeorEeM 2. Suppose that F(@) is integrable and periodic. Let 


F@ ~ Dd a,e"”, ad = 0; 
> I > 
o>q2p>il, reas 


bw id tiene SRE 
Pp 4 


Then 
(a) S,[a,n |] < KM,[F()e*], 
(b) mM,[F(0)e “| < KS,fa,n"). 


Let G(@) be the conjugate of F(@). Then F(@) + #G(@) and F(6@) — iG(@) 
are boundary functions of 2 > a,2" and 2 > a_,2", respectively, and the conclu- 
sion (a) follows from ‘iain 1 and the eaiaiii of 2.9 if we show that 

M,[G(e)e"] < KM,[F(e)e"). 


rm: . Ss , 
lhis has been proved by Hardy and Littlewood’ when p > 1, —l, p< a< 1p’, 
and these conditions are plainly satisfied here. 

The inequality (b) ean be deduced from (a) by a““conjugacy” argument. Let 


Q(0) = } ee 


7 BF. Riesz [6], Theorems Il and IIL. See also $4.1 of [4] 
® Hardy and Littlewood [4], $6.3 
























54 


7 


be a polynomial. Then 





F(6) Q(6) dé = Doane, 


/ F(0)6* Q(8) & de > a,n*c,.n-* | 


lA 


S,la,n*] Splen n “| 
< Sa, n|KM,-[Q() &) 
by Theorem 2(a), since 


a=>+54h-120, 1>r220, o>p'2q'>1. 
P q q 


It follows from the converse of Hélder’s inequality that 


M[F (ee) < KS,[a,n"]. 


This is what we require. 

3.2. We can deduce from Theorem | the following extensions of Theorems 
9 and 10 of [4]. 

TueoreM 3. Suppose that G(@) is the boundary function of an analytic func- 


tion g(z) = >. nz”, that 
1 
s,(z) = bm af: 
1 
and that p, 4, @ satisfy the conditions of Theorem 1. Then 
Z,[(s,(z) — s(z))n “] S KM,[(F(« + 6) — s(x))0*""], 


whenever the right side is finite. 
Tueorem 4. Suppose that F(6) is integrable and 


“a 
F(@) ~ > ane™, ay = 0. 


o(z, 0) = 4[F(z + 6) + F(x — 0) — 2x(zx)); 







“>qeprt, 





srems 


func- 





THEOREMS ON FOURIER SERIES AND POWER SERIES 


Then 


S[(s.(xz) — s(x))n™] S KM,[o(x, 0)0°™'), 


whenever the right side is finite. 

We obtain Theorem 3 immediately on applying Theorem 1 to the function 
(g(z) — s(0))(1 — 2)". Theorem 4 follows from Theorem 1 by the argument 
given in §5.2 of [4]. 

The Hardy-Littlewood theorems are given, in each case, by a = 1/p’. 


REFERENCES 


1. G. H. Harpy anp J. E. LirrLtewoop, Some new properties of Fourier constants, Math. 
Ann., vol. 97 (1926), pp. 159-209. 

2. G. H. Harpy anp J. E. Lirrtewoop, An inequality, Math. Zeitschr., vol. 40 (1935), 
pp. 1-40. 

3. G. H. Harpy anv J. E. Lirrtewoop, Elementary theorems concerning power series with 
positive terms, Journ. fiir Math., vol. 157 (1927), pp. 141-158. 

4. G. H. Harpy anp J. E. LirrLewoop, Some more theorems concerning Fourier series and 
Fourier power series, this Journal, vol. 2 (1936), pp. 354-382. 

5. F. Hausporrr, Eine Ausdehnung des Parsevalschen Sdtzes tiber Fourierrethen, Math. 
Zeitschr., vol. 16 (1923), pp. 163-169. 

6. F. Rresz, Uber die Randwerte einer analytischen Funktion, Math. Zeitschr., vol. 18 
(1923), pp. 87-95. 


PeTERHOUSE, CAMBRIDGE. 








ACKNOWLEDGMENT 


In bringing to a close the third volume of the Duke Mathematical Journal 
the editors wish to state that their task in launching the new periodical has 
been greatly lightened by the whole-hearted coéperation received from many 
sources: from Vice-President Flowers and other administrative officers of Duke 
University, who have provided the necessary funds in somewhat difficult times; 
from the mathematical public, which has aided by contributing articles, referee- 
ing them and subscribing; and from the Waverly Press, which has cheerfully 
-modified many of its practices to suit our needs. We are particularly indebted 
to Messrs. A. Cohen, J. D. Tamarkin and 8. Lefschetz, who by arranging the 
transfer of papers from the established American journals helped us to publish 
our first number earlier than would otherwise have been possible; to Mr. R. E. 
Langer, who acted as editor for analysis during the academic year 1935-1936 
when Mr. Widder was absent in Europe; to those mathematicians who, although 
not members of our board, have served as referees, namely, 


J. W. Alexander, KE. Hopf, I. J. Schoenberg, 
H. Bateman, V. A. Hoyle, W. Seidel, 

P. Bernays, FE. V. Huntington, A. Sinkov, 

S. Bochner, N. Jacobson, N. E. Steenrod, 
D. G. Bourgin, I. kk. Johnston, M. H. Stone, 
H. R. Brahana, S. Lefschetz, G. Szegé, 

A. B. Brown, N. Levinson, J. D. Tamarkin, 
H. B. Curry, D. C. Lewis, Jr., H. 8. Vandiver, 
T. L. Downs, H. W. March, R. J. Walker, 
W. B. Ford, M. Morse, M. J. Weiss, 

(’. A. Garabedian, Ff. D. Murnaghan, H. Whitney, 

\V. G. Grove, S. B. Myers, ©. Zariski, 

I. Hille, W. C. Randels, L.. Zippin, 


and to Misses Hilda Howes and Alta Odoms, who have done the secretarial 
work and proof reading. To all of these we express our sincere thanks. 

















) 
: 


many 
Duke 
imes; 
feree- 
rfully 
ebted 
g the 
tblish 
R. E. 

1936 
10ough 


tarial 





